SEQanswers

SEQanswers (http://seqanswers.com/forums/index.php)
-   Literature Watch (http://seqanswers.com/forums/forumdisplay.php?f=10)
-   -   Sequence and structural variation in a human genome uncovered by short-read, massivel (http://seqanswers.com/forums/showthread.php?t=1999)

benimmyeo 06-25-2009 02:14 AM

Sequence and structural variation in a human genome uncovered by short-read, massivel
 
Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two base encoding

Kevin Judd McKernan1,7, Heather E. Peckham1, Gina Costa1, Stephen McLaughlin1, Eric Tsung1, Yutao Fu1, Christopher Clouser1, Cisyla Dunkan1, Jeffrey Ichikawa1, Clarence Lee1, Zheng Zhang1, Andrew Sheridan1, Haoning Fu1, Swati Ranade1, Eileen Dimilanta1, Tanya Sokolsky1, Lei Zhang1, Cynthia Hendrickson2, Bin Li1, Lev Kotler1, Jeremy Stuart1, Joel Malek3, John Manning1, Alena Antipova1, Damon Perez1, Michael Moore1, Kathleen Hayashibara1, Michael Lyons1, Robert Beaudoin1, Brittany Coleman1, Michael Laptewicz1, Adam Sanicandro1, Michael Rhodes1, Francisco De La Vega1, Rajesh K. Gottimukkala1, Fiona Hyland1, Michael Reese1, Shan Yang1, Vineet Bafna4, Ali Bashir4, Andrew MacBride5, Can Aklan6, Jeffrey M Kidd6, Evan E Eichler6 and Alan Phillip Blanchard1
+Author Affiliations



Abstract
We describe the genome sequencing of an anonymous individual of African origin using a novel ligation based sequencing assay that enables a unique form of error correction that improves the raw accuracy of the aligned reads to >99.9% allowing us to accurately call SNPs with as few as 2 reads per allele. We collected several billion mate-pair reads yielding ~18 haploid coverage of aligned sequence and close to 300 clone coverage. Over 98% of the reference genome is covered with at least one uniquely placed read and 99.65% spanned by at least one uniquely placed mate-paired clone. We identify over 3.8 million SNPs, 19% of which are novel. Mate-paired data is used to physically resolve haplotype phases of nearly 2/3 of the genotypes obtained and produce phased segments of up to 210 Kb. We detect 226,529 intra-read indels, 5,590 indels between mate-paired reads, 91 inversions and 4 gene fusions. We use a novel approach for detecting indels between mate-paired reads that are smaller than the standard deviation of the insert size of the library and discover deletions in common with those detected with our intra-read approach. Dozens of previously described disease susceptibility mutations and thousands of novel potentially functional variants, both single-nucleotide and structural, are identified in this individual which suggests a higher than expected load of deleterious variants that can be tolerated in the human genome. There is more genetic variation in the human genome still to be uncovered and we provide guidance for future surveys in populations and cancer biopsies.

http://genome.cshlp.org/content/earl....full.pdf+html


All times are GMT -8. The time now is 12:19 PM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.