Seqanswers Leaderboard Ad

**amitm** · 03-09-2015, 02:51 AM

hi priya,
You haven't given any specific detail. Was the term used in context to NGS analysis? Was it referring to a reference genome or a de novo assembled one?

**priya** · 03-09-2015, 03:00 AM

Originally posted by amitm View Post

hi priya,
You haven't given any specific detail. Was the term used in context to NGS analysis? Was it referring to a reference genome or a de novo assembled one?

I came across this term in paper describing normalization of chip-seq reads .
For your better understanding , I attached screenshoot of lines from the paper .

Attached Files

article.png (140.9 KB, 33 views)

**amitm** · 03-09-2015, 03:06 AM

hi,
it seems that corrected genome size is the area of the genome covered by all the ChIP-seq reads in that sample.
So, if sample A has 20M reads and they cover 2Gb of hg19, then corrected genome size is 2Gb.

**priya** · 03-09-2015, 04:01 AM

Originally posted by amitm View Post

hi,
it seems that corrected genome size is the area of the genome covered by all the ChIP-seq reads in that sample.
So, if sample A has 20M reads and they cover 2Gb of hg19, then corrected genome size is 2Gb.

Hi amitm,
Thank you for your reply!

Can you please clarify me how to calculate the genome coverage from sequencing experiment.
For sample read coverage, i can easily check the alignment logs (for eg: Bowtie log files), which gives me clearly stat of number of reads mapped per sample.

**amitm** · 03-09-2015, 06:42 AM

hi,
Once you have done the mapping of reads, use the BAM file obtained to create a BED file.
Use bedtools -

bamtobed — bedtools 2.31.0 documentation

http://bedtools.readthedocs.org/en/latest/content/tools/bamtobed.html

Then, the coordinates returned would be overlapping. You need to merge them to create "unique" regions/ coordinate intervals.
Use -

merge — bedtools 2.31.0 documentation

http://bedtools.readthedocs.org/en/latest/content/tools/merge.html

Once there, add up the lengths of all intervals and thats the portion of the genome covered, i.e. corrected genome size

**priya** · 03-09-2015, 07:14 AM

Originally posted by amitm View Post

hi,
Once you have done the mapping of reads, use the BAM file obtained to create a BED file.
Use bedtools -

bamtobed — bedtools 2.31.0 documentation

http://bedtools.readthedocs.org/en/latest/content/tools/bamtobed.html

Then, the coordinates returned would be overlapping. You need to merge them to create "unique" regions/ coordinate intervals.
Use -

merge — bedtools 2.31.0 documentation

http://bedtools.readthedocs.org/en/latest/content/tools/merge.html

Once there, add up the lengths of all intervals and thats the portion of the genome covered, i.e. corrected genome size

Hi amitm,
Thanks alot for your clear explaination. I will try it out

**AlexReynolds** · 03-25-2015, 01:13 PM

You can use BEDOPS bam2bed to convert from BAM to BED, pipe to bedops to merge overlapping elements, and pipe to bedmap to generate a list of lengths per merged element to sum with awk:

$ bam2bed < foo.bam | bedops --merge - | bedmap --echo-overlap-size - | awk '{s += $1;} END {print s;}' > answer.txt

In this case, bedmap is mapping merged elements against themselves. Merged elements coming out of bedops are guaranteed to be disjoint, so --echo-overlap-size is guaranteed to report the unique length of each merged element.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 37 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Genome size and corrected genome size

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News