Hi,
I apologize for my potentially "silly" question. However I've been playing around with data from the 1000genomes project, and had a look at quite a few of the alignment files that they provide on their FTP link. One thing struck me:
In the SAM files pertaining to the chromosomes of the individuals, the avg. no. of variations between the aligned reads and the reference is between 5-10%. Also, there is a good percentage of reads (greater than 10%) that remain unaligned. However, its said too often that "the similarity between the DNA of any 2 human beings is greater than 99%".
Well, something doesn't add up!
I apologize for my potentially "silly" question. However I've been playing around with data from the 1000genomes project, and had a look at quite a few of the alignment files that they provide on their FTP link. One thing struck me:
In the SAM files pertaining to the chromosomes of the individuals, the avg. no. of variations between the aligned reads and the reference is between 5-10%. Also, there is a good percentage of reads (greater than 10%) that remain unaligned. However, its said too often that "the similarity between the DNA of any 2 human beings is greater than 99%".
Well, something doesn't add up!