Hi!
I'm looking for approximate statistics of genome assembly errors (SNP, indel) which are introduced due to errors in raw reads.
Most errors are eliminated when coverage is high enough, but some (i.e. homopolymer errors) occur non-randomly and, as a consequence, introduce errors to assembly.
Maybe someone has come across such information (papers, presentations)?
I think it`s a simple but important question which can be answered (approximately) by comparing assemblies made from reads from different sequencing platforms.
Thanks in advance!
Aleksander Manolov
I'm looking for approximate statistics of genome assembly errors (SNP, indel) which are introduced due to errors in raw reads.
Most errors are eliminated when coverage is high enough, but some (i.e. homopolymer errors) occur non-randomly and, as a consequence, introduce errors to assembly.
Maybe someone has come across such information (papers, presentations)?
I think it`s a simple but important question which can be answered (approximately) by comparing assemblies made from reads from different sequencing platforms.
Thanks in advance!
Aleksander Manolov