I am attempting to assemble a genome from an organism I am working on. Initial BLAST analysis of some assembled contigs has led me to believe I am dealing with previously described bacterium “X”. Read mapping in geneious supports this as the majority of my reads are used and I get good coverage (bar a few regions of no coverage).
My de novo assembly however is giving me mixed messages…Though the assembled contigs have a high ANI, >95% when compare to the reference genome, the contigs will not map to the reference. I have a handful of contigs between 300 – 600 kb however, none will map back to the reference. I initially thought this was due to a poor assembly however my read quality is good and I can more-or-less (give or take a few base pairs) replicate the large contigs across two different assemblers (Spades and Abyss) and two different data sets from illumina and ion torrent reads generated from the same organism. DNA was extracted from a known pure culture so it is unlikely to be due to contaminating reads.
Has anyone come across this before or have any idea of where I might have gone wrong?
My de novo assembly however is giving me mixed messages…Though the assembled contigs have a high ANI, >95% when compare to the reference genome, the contigs will not map to the reference. I have a handful of contigs between 300 – 600 kb however, none will map back to the reference. I initially thought this was due to a poor assembly however my read quality is good and I can more-or-less (give or take a few base pairs) replicate the large contigs across two different assemblers (Spades and Abyss) and two different data sets from illumina and ion torrent reads generated from the same organism. DNA was extracted from a known pure culture so it is unlikely to be due to contaminating reads.
Has anyone come across this before or have any idea of where I might have gone wrong?
Comment