Seqanswers Leaderboard Ad

**Jessica_L** · 12-09-2015, 06:32 AM

looking at the example you posted, the 4th column is the number of reads, in this case 11. I'm not sure what you mean about 14 reads in reality. I don't see that anywhere in the example, but I only skimmed. Maybe I missed something.

To your second question: The mapping quality referred to by the ] character is not a base call quality score, it represents the mapping quality, which is a measure of how well the read aligns/matches the reference. I think this link explains it better than I can: https://www.biostars.org/p/8371/

**tristan dubos** · 12-09-2015, 06:52 AM

Thank you for the answer. I can explain with more details :
Always in this case

seq2 156 A 11 .$......+2AG.+2AG.+2AGGG <975;:<<<<<

We have have : 9 . + 2 G = 11
But the 3 reads with the insert of AG are not recorded, that is why i m saying there is 14 reads covering this position in reality . I m asking this question because if any variant caller use this column 4 for filtering the covering depth of the position it can t find INDELS ...

Ok i understand better the meaning of the ^ thank you again !

**Jessica_L** · 12-09-2015, 01:10 PM

My understanding of the base read format is that the insertions of AG exists on three of the 11 reads that have already been counted-- the insertion is between this reference position (156) and the next position in the reference sequence (157). The reads on which those insertions appear are counted here, in the example they are all matches to the reference-- dots. I read them as ".+2AG" which is a match to the reference, plus a 2bp insertion consisting of AG. Treating that read as "." and "+2AG" is double counting it.

If you're saying that filtering on read depth would potentially cause you to toss out some indels, I'd agree with that statement. Filtering can always cost you the ability to see a novel variant, but it's the tradeoff for less noisy data. That's not the same as the variant caller not being able to find indels at all, though.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

question about mpileup

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News