Seqanswers Leaderboard Ad

**maubp** · 04-29-2013, 06:35 AM

You need to read more about the FLAG field in SAM/BAM. Here bits 0x4 and 0x10 will tell you if a read is mapped and to which strand.

You can't just look at column 3 (RNAME), as that can be a reference name if the read's mate was mapped even if the read itself was not.

**NitaC** · 04-29-2013, 08:28 AM

Hi maubp!

It is my understanding that 0x4 tells you that the read was unmapped and that 0x10 indicates that the read is on the reverse strand. I don't think either of those flags really address my issue. I've already filtered for unaligned reads.

And I'm not sure I understand your second response. If I've filtered out unaligned reads and filtered for only the forward strand, ultimately I still need some sort of reference name or gene id to count right? Are you saying that after filtering for forward reads, there may still be reads in my results that have not actually mapped to the forward strand but are listed because their mate did?

As it turns out, my little pipeline actually matches the results using htseq-count so I'm a little more confident in what I did. And my co-worker's counts were from one sample whereas mine were from all the samples combined. That little tidbit would've saved me from a lot of headaches.

**maubp** · 04-29-2013, 08:34 AM

Originally posted by NitaC View Post

Are you saying that after filtering for forward reads, there may still be reads in my results that have not actually mapped to the forward strand but are listed because their mate did?

Yes in general. This is deliberate so that when coordinate sorted the unmapped read is next to its mapped partner in the BAM file. See "Recommended Practise for the SAM format" in the specification. Also "Bit 0x4 is the only reliable place to tell whether the segment is unmapped".

**NitaC** · 04-29-2013, 09:02 AM

Ohhh, good to know. Thank you.

**Simon Anders** · 04-30-2013, 02:55 AM

Also, remember that a read that aligns to multiple places will be listed in multiple lines in the SAM file.

Topics	Statistics	Last Post
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 33 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 49 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 34 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM
Study Identifies Ribosomal RNA Fingerprints as Early Cancer Biomarkers by seqadmin Started by seqadmin, 12-11-2024, 07:45 AM	0 responses 46 views 0 likes	Last Post by seqadmin 12-11-2024, 07:45 AM

Seqanswers Leaderboard Ad

Announcement

procedure for extracting raw read counts

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News