![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
percentage similarity between genomes | parulvk | Bioinformatics | 3 | 11-16-2011 11:36 AM |
Percentage of mapped reads ? | zack80.liu | Bioinformatics | 6 | 03-01-2011 10:08 AM |
percentage coverage after alignment | johnsequence | Bioinformatics | 9 | 03-15-2010 02:36 PM |
How big is the data from one lane of solexa? | tujchl | Illumina/Solexa | 2 | 11-11-2009 07:04 PM |
General control lane used by Illumina/Solexa data?! | edge | Illumina/Solexa | 6 | 09-14-2009 04:05 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Senior Member
Location: Cambridge, MA Join Date: Mar 2009
Posts: 141
|
![]()
Hi all
I'm planning a project to sequence multiple bacterial strains and am trying to calculate how many samples I can multiplex per lane while getting sufficient coverage to accurately determine polymorphisms. According to the VAAL paper from the Broad (http://www.nature.com/nmeth/journal/...meth.1286.html), 1 lane of 36 bp reads for Staph aureus gave ~53x Q20 coverage (table S1). I calculate that this is about 64% of the total sequence produced for the lane. I haven't come across any other similar calculations in the literature. What is your experience with the total amount of sequencing needed to ensure 20-30x Q20 coverage? |
![]() |
![]() |
![]() |
#2 | |
Member
Location: Germany Join Date: May 2008
Posts: 79
|
![]() Quote:
As example for good cases, here are some numbers for a project that went well in terms of quality (5.8m reads, 40mers). Pre-assembly QC gave these numbers:
In assembly:
The theoretical maximum average coverage would have been 55.2x and the achieved coverage was 53.3x. Which equals to ~3.5% loss. Not too shabby. However, the bigger problem for you is the consistency of what you get from sequencing in terms of raw numbers: even good labs with 7 to 9m 40mer-reads per lane have every once in a while a run which yields only 3 to 3.5m reads. Normally they'll redo it for you free of charge, but you have to account for that. B. PS: 53x is way too much coverage |
|
![]() |
![]() |
![]() |
#3 |
Senior Member
Location: Cambridge, MA Join Date: Mar 2009
Posts: 141
|
![]()
Thanks, BaCh, that is very helpful. When you say 53x is way too much coverage, what would you consider sufficient but not excessive coverage for your purposes?
|
![]() |
![]() |
![]() |
#4 |
Member
Location: Germany Join Date: May 2008
Posts: 79
|
![]()
Incidentally, the coverage also depends a bit on the length of the reads. For 36 and 40mers I've run experiments by starting at ~35x and reducing down to 15x:
Please note that I do some hand editing on the assemblies and check everything not only by statistics, but by visual inspection. Your mileage may vary. For 76mers first results let me think I'll get away with a bit less coverage, but I haven't checked thoroughly yet. Please also have a look at this paper from the Sanger Centre in Nature from Nov. last year: http://www.ncbi.nlm.nih.gov/pubmed/18987734 It's a pretty good read and, if I remember correctly, the basic conclusions on coverage are comparable. |
![]() |
![]() |
![]() |
Thread Tools | |
|
|