Go Back   SEQanswers > Literature Watch

Similar Threads
Thread Thread Starter Forum Replies Last Post
De novo assembly of human genomes with massively parallel short read sequencing dan Literature Watch 0 12-21-2009 05:40 AM
PubMed: De novo assembly of human genomes with massively parallel short read sequenci Newsbot! Literature Watch 0 12-19-2009 03:13 AM
PubMed: Probabilistic resolution of multi-mapping reads in massively parallel sequenc Newsbot! Literature Watch 0 07-17-2009 06:01 AM
PubMed: High-resolution mapping of copy-number alterations with massively parallel se Newsbot! Literature Watch 0 12-02-2008 06:00 AM
PubMed: Accuracy and quality of massively parallel DNA pyrosequencing. Newsbot! Literature Watch 0 03-01-2008 06:40 AM

Thread Tools
Old 07-29-2009, 06:00 AM   #1
RSS Posting Maniac

Join Date: Feb 2008
Posts: 1,443
Default PubMed: Mapping accuracy of short reads from massively parallel sequencing and the im

Syndicated from PubMed RSS Feeds

Related Articles Mapping accuracy of short reads from massively parallel sequencing and the implications for quantitative expression profiling.

PLoS One. 2009;4(7):e6323

Authors: Palmieri N, Schlötterer C

BACKGROUND: Massively parallel sequencing offers an enormous potential for expression profiling, in particular for interspecific comparisons. Currently, different platforms for massively parallel sequencing are available, which differ in read length and sequencing costs. The 454-technology offers the highest read length. The other sequencing technologies are more cost effective, on the expense of shorter reads. Reliable expression profiling by massively parallel sequencing depends crucially on the accuracy to which the reads could be mapped to the corresponding genes. METHODOLOGY/PRINCIPAL FINDINGS: We performed an in silico analysis to evaluate whether incorrect mapping of the sequence reads results in a biased expression pattern. A comparison of six available mapping software tools indicated a considerable heterogeneity in mapping speed and accuracy. Independently of the software used to map the reads, we found that for compact genomes both short (35 bp, 50 bp) and long sequence reads (100 bp) result in an almost unbiased expression pattern. In contrast, for species with a larger genome containing more gene families and repetitive DNA, shorter reads (35-50 bp) produced a considerable bias in gene expression. In humans, about 10% of the genes had fewer than 50% of the sequence reads correctly mapped. Sequence polymorphism up to 9% had almost no effect on the mapping accuracy of 100 bp reads. For 35 bp reads up to 3% sequence divergence did not affect the mapping accuracy strongly. The effect of indels on the mapping efficiency strongly depends on the mapping software. CONCLUSIONS/SIGNIFICANCE: In complex genomes, expression profiling by massively parallel sequencing could introduce a considerable bias due to incorrectly mapped sequence reads if the read length is short. Nevertheless, this bias could be accounted for if the genomic sequence is known. Furthermore, sequence polymorphisms and indels also affect the mapping accuracy and may cause a biased gene expression measurement. The choice of the mapping software is highly critical and the reliability depends on the presence/absence of indels and the divergence between reads and the reference genome. Overall, we found SSAHA2 and CLC to produce the most reliable mapping results.

PMID: 19636379 [PubMed - in process]

Newsbot! is offline   Reply With Quote

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 10:30 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO