![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
what program can i use for calculating coverage? | dkrtndhkd | Bioinformatics | 2 | 02-03-2012 06:39 PM |
Gene list for calculating coverage | nseh | Bioinformatics | 1 | 05-22-2011 07:31 AM |
reads and coverage question | seqgirl123 | Illumina/Solexa | 2 | 03-28-2011 09:06 AM |
"nucleotide coverage" to genome feature coverage | sheremey | Bioinformatics | 3 | 11-02-2010 12:24 PM |
low 454 coverage combined with high solexa coverage | strob | Bioinformatics | 7 | 10-07-2010 11:14 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: Germany Join Date: Feb 2012
Posts: 2
|
![]()
Hey guys,
I could not find a precise information of how to calculate the physical coverage from genome coverage, assuming that I have a paired end library, where both reads have an average length of 70bps and the insert size is around 200bp. Does the definition of physical coverage include both reads? I was guessing that it does and calculated it the following way: Physical_Coverage=Genome_Coverage*(2*Read_Length+Insert_Size)/(2*Read_Length) As an example: If I know that my genome coverage is 4, then the phyiscal coverage is 4*(2*70+200)/(2*70)=9.7 Is that correct? Thanks in advance! Tristan Last edited by tristanstoeber; 02-22-2012 at 08:47 AM. |
![]() |
![]() |
![]() |
#2 |
David Eccles (gringer)
Location: Wellington, New Zealand Join Date: May 2011
Posts: 838
|
![]()
What's the difference between physical coverage and genome coverage? Is that coverage including unsequenced insert bases vs coverage only for sequenced bases?
The coverage calculation I am used to is fairly simple: Code:
total number of base pairs sequenced (S) ---------------------------------------- total number of base pairs in genome (G) Code:
both reads have an average length of 70bps and the insert size is around 200bp. For paired-end runs, you could either consider S to include the non-sequenced bases present in the fragment (Sf = fragment length * number of pairs), or only include the sequenced bases (Ss = read length * number of reads, or Ss = 2 * read length * number of pairs). The ratio of these two S sizes is the ratio of total fragment length to sequenced bases, which will be the same as the ratio for any coverage value calculated based on these sizes: Code:
Sf/Ss = (fragment length) / (read length * 2) <=> Sf = (fragment length) * Ss / (read length * 2) <=> Ss = (read length * 2) * Sf / (fragment length) Last edited by gringer; 02-22-2012 at 12:30 PM. Reason: correction -- reads / pairs, relationship equation |
![]() |
![]() |
![]() |
#3 |
Member
Location: Newcastle upon Tyne Join Date: Apr 2013
Posts: 55
|
![]()
Can anyone please explain to me the difference between physical and genome coverage please?
Thank you Kind Regards Parinita |
![]() |
![]() |
![]() |
#4 |
Registered Vendor
Location: Eugene, OR Join Date: May 2013
Posts: 521
|
![]()
For paired end reads, a typical Illumina library would consist of two 100-bp reads from a 500 bp genomic fragment. So if you get 10X coverage of the genome with the sequenced reads, you will have a higher coverage of the genomic fragments used to generate those reads. See here:
Code:
ssssss---------------ssssss ssssss-------------sssssss ssssss---------ssssss GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
__________________
Providing nextRAD genotyping and PacBio sequencing services. http://snpsaurus.com |
![]() |
![]() |
![]() |
#5 |
Member
Location: Newcastle upon Tyne Join Date: Apr 2013
Posts: 55
|
![]()
Thank you. Now I understand the difference.
Kind Regards Parinita. |
![]() |
![]() |
![]() |
Tags |
genome coverage, paired end reads, physical coverage |
Thread Tools | |
|
|