Seqanswers Leaderboard Ad

**gringer** · 02-22-2012, 12:16 PM

What's the difference between physical coverage and genome coverage? Is that coverage including unsequenced insert bases vs coverage only for sequenced bases?

The coverage calculation I am used to is fairly simple:

Code:

total number of base pairs sequenced (S)
----------------------------------------
total number of base pairs in genome (G)

For a single end run of 100M reads, each with 100bp sequenced, that's 10Gbp sequenced bases, which would be a coverage of 5x for a 2Gbp genome.

Code:

both reads have an average length of 70bps and the insert size is around 200bp.

insert size is a confusing term. I much prefer fragment length, because the size selection is most commonly for fragments of a particular size.

For paired-end runs, you could either consider S to include the non-sequenced bases present in the fragment (Sf = fragment length * number of pairs), or only include the sequenced bases (Ss = read length * number of reads, or Ss = 2 * read length * number of pairs).

The ratio of these two S sizes is the ratio of total fragment length to sequenced bases, which will be the same as the ratio for any coverage value calculated based on these sizes:

Code:

    Sf/Ss = (fragment length) / (read length * 2) 
<=> Sf = (fragment length) * Ss / (read length * 2)
<=> Ss = (read length * 2) * Sf / (fragment length)

**pari_89** · 06-24-2013, 09:43 AM

Can anyone please explain to me the difference between physical and genome coverage please?

Thank you

Kind Regards

Parinita

**SNPsaurus** · 06-24-2013, 10:43 AM

For paired end reads, a typical Illumina library would consist of two 100-bp reads from a 500 bp genomic fragment. So if you get 10X coverage of the genome with the sequenced reads, you will have a higher coverage of the genomic fragments used to generate those reads. See here:

Code:

[FONT="Courier New"]          
                  ssssss---------------ssssss
ssssss-------------sssssss        ssssss---------ssssss
GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG
[/FONT]

This example has less than 1X sequence coverage "s" of the genome "G". But there is >1X coverage by the genomic fragments in the library. If you are mapping inversions, for example, the physical coverage of the fragments is more important than the sequencing coverage. If you are using mate pairs (two 100 bp reads from 5-10kb fragments), the physical coverage will be much higher than the sequencing coverage.

**pari_89** · 06-24-2013, 10:53 AM

Thank you. Now I understand the difference.

Kind Regards

Parinita.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Newbie Question: Calculating physical coverage from genome coverage

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News