SEQanswers Hot to calculate coverage on a 454 targeted region sequenced ?
 User Name Remember Me? Password
 Register FAQ Members List Calendar Search Today's Posts Mark Forums Read

 Similar Threads Thread Thread Starter Forum Replies Last Post arendon Bioinformatics 53 08-20-2015 08:23 AM gumbos Bioinformatics 1 01-09-2012 05:01 PM coonya SOLiD 0 12-28-2010 11:14 PM Giorgio C Bioinformatics 2 11-25-2010 01:57 AM litali Bioinformatics 0 08-23-2010 08:45 AM

 03-21-2012, 02:39 AM #1 Giorgio C Member   Location: ITALY Join Date: Oct 2010 Posts: 89 Hot to calculate coverage on a 454 targeted region sequenced ? Hi all, I have a region of 250.000 bp targeted with probes and then sequenced on a 454. How can i make a statistical calculation of the coverage ? Do i have to consider the total number of bp sequenced and number of reads ? Or just for example considering 1.000.000 bp sequenced : Coverage = 100.000/250.000 bp ? Any advices are highly appreciated, Thanks
 03-21-2012, 04:10 AM #2 sphil Senior Member   Location: Stuttgart, Germany Join Date: Apr 2010 Posts: 192 Hi, coverage = (#reads * length of reads) / length of target sequence thus, having 50k reads with length of 250 and 100k target sequence length you get a average coverage of 125. However please keep in mind that some regions can be sequences more easily and all sequences techniques have their own caveats. Keeping that in mind the above approximation is still valid. Best Phil
 03-21-2012, 04:15 AM #3 Giorgio C Member   Location: ITALY Join Date: Oct 2010 Posts: 89 Thank you very much for your reply. So if i undersand well the correct formula is: total numbero of bp obteined * avarege length of all reads / length of targeted region Is that right ?
 03-21-2012, 04:17 AM #4 Giorgio C Member   Location: ITALY Join Date: Oct 2010 Posts: 89 or is : total number of bp obteined * total number of reads / length of targeted region ?
 03-21-2012, 04:29 AM #5 sphil Senior Member   Location: Stuttgart, Germany Join Date: Apr 2010 Posts: 192 it is: total number of bp obtained / length of target region. Since total number of bp obtained == amount of reads * length of reads.
 03-21-2012, 04:51 AM #6 Giorgio C Member   Location: ITALY Join Date: Oct 2010 Posts: 89 Hmm.. i'm sorry but something is not clear to me; If i have this situation: - Number of total Bp sequenced = 7.114.424 - Number of reads = 354 - Length of targeted region = 225.000 bp Is correct only to do 7.114.424 / 225.000 = 31,61 ; Means 31% of coverage ? And i don't need to consider the "Number of reads" (354) ? I hope I was clear, Thanks you for your help
03-21-2012, 05:12 AM   #7
sphil
Senior Member

Location: Stuttgart, Germany

Join Date: Apr 2010
Posts: 192

Quote:
 Originally Posted by Giorgio C Hmm.. i'm sorry but something is not clear to me; If i have this situation: - Number of total Bp sequenced = 7.114.424 - Number of reads = 354 - Length of targeted region = 225.000 bp Is correct only to do 7.114.424 / 225.000 = 31,61 ; Means 31% of coverage ? And i don't need to consider the "Number of reads" (354) ? I hope I was clear, Thanks you for your help
yes, you are right. You do not need to consider the number of redas because this should result from: number of reads * read length. To get the coverage you simply do the calculation above, thus 7.114.424 / 225.000 = 31,61. That does not mean that you ave 31% coverage. It means that every base in your target sequence is covered by 31 bases contained in your reads. So you have a coverage of 31, which is not too bad.

 03-21-2012, 05:21 AM #8 Giorgio C Member   Location: ITALY Join Date: Oct 2010 Posts: 89 Thank you very much for your reply. You have been very helpful !!!
 10-23-2012, 02:06 PM #10 Giorgio C Member   Location: ITALY Join Date: Oct 2010 Posts: 89 Thanks jordipt for your clarification...it's never late for good suggestions