Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
masked or unmasked genome reference for RNAseq skly RNA Sequencing 2 08-29-2015 08:47 PM
Consensus sequence out of BAM+VCF with low coverage areas masked? Tron Bioinformatics 0 07-12-2013 03:46 AM
Transcript coverage estimates? tboothby General 2 12-14-2011 03:42 AM
Masked/Unmasked Reference Genome ytmnd85 General 5 05-31-2009 04:52 PM
Masked or unmasked genome for ChIP-seq analysis? hbbio Bioinformatics 3 04-07-2009 12:14 PM

Thread Tools
Old 11-04-2016, 12:05 PM   #1
Junior Member
Location: london

Join Date: Jun 2016
Posts: 2
Default Coverage Estimates from Masked/Unmasked Genome

Hi All,

I've carried out a mapping run with Illumina paired-end reads to a genome using BWA. From this I've calculated i) x times coverage and ii) fraction of the reference sequence covered to a depth of at least one read. My supervisor now wants me to get coverage metrics which take into account regions that have no read coverage as they are repetitive elements. In essence he just wants coverage stats for the 'mappable' region.

So, for example, the genome I'm using is c. 2.3Gb in length and c. 50% of this is composed of repeats which reads are unlikely to map to. This will deflate the coverage estimates. So if, say, I have c. 50% of a reference sequence covered at at least 1 read depth; if I minus the 50% of the genome that are repeats then this rises to 100%. What I'm trying to figure out is if I know the annotations info for the repetitive elements can I figure this out with my existing .bam file or will I need to remap to a hard-masked genome or remove the repetitive elements somehow and then figure it out.

I really hope this makes sense (I suspect not!).


Last edited by Magpie101; 11-04-2016 at 12:09 PM.
Magpie101 is offline   Reply With Quote
Old 11-04-2016, 01:03 PM   #2
Location: CA

Join Date: Jul 2013
Posts: 74

The samtools "depth" command should output the coverage at only the bases that have a depth of at least 1 if that's what you want.

But this is conceptually different than filtering out repetitive regions based on the annotations you have.
atcghelix is offline   Reply With Quote
Old 11-04-2016, 01:14 PM   #3
Devon Ryan
Location: Freiburg, Germany

Join Date: Jul 2011
Posts: 3,480

Use GEM to determine the mappable regions and then determine coverage accordingly.
dpryan is offline   Reply With Quote

coverage, genome mapping, hardmask

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 05:24 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO