Hi everybody!
I'm a little confused with the "why" of the existence of a "sweet spot" coverage when assembling a genome.
for example, i have been reading that the sweet spot to 454 dataset is between 60X ~ 80X, and more coverage could result in missassemblies. I thought that as more coverage, lesser error probability in each base.
I imagine there is some kind of distribution related to the error rate of the sequencing technology used. but i'm not clear.
thanks in advance!
I'm a little confused with the "why" of the existence of a "sweet spot" coverage when assembling a genome.
for example, i have been reading that the sweet spot to 454 dataset is between 60X ~ 80X, and more coverage could result in missassemblies. I thought that as more coverage, lesser error probability in each base.
I imagine there is some kind of distribution related to the error rate of the sequencing technology used. but i'm not clear.
thanks in advance!
Comment