Hi Guys,
even though I have been working with NGS for a while it is not clear to me why mapping onto a reference genome is preferred to denovo assembly to decipher genomic variations like CNV, inversions, translocations etc.
I guess one issue is the difficulty in confidently assembling the entire sequence from short reads. But given that most of the new data is paired end and the read lengths are increasing steadily, what other problems do you see with denovo assembly? If this issue is resolved, then would denovo assembly identify the genomic variations more accurately and confidently?
Thoughts and insights are welcome, especially in the light of BGI's push in this direction.
even though I have been working with NGS for a while it is not clear to me why mapping onto a reference genome is preferred to denovo assembly to decipher genomic variations like CNV, inversions, translocations etc.
I guess one issue is the difficulty in confidently assembling the entire sequence from short reads. But given that most of the new data is paired end and the read lengths are increasing steadily, what other problems do you see with denovo assembly? If this issue is resolved, then would denovo assembly identify the genomic variations more accurately and confidently?
Thoughts and insights are welcome, especially in the light of BGI's push in this direction.
Comment