Seqanswers Leaderboard Ad

**lh3** · 01-28-2010, 04:43 PM

What aligner are you using? Samtools may work well with some but worse with others. Does it do gapped alignment? As for strand bias, people from Broad Institute found that when SNPs are supported from reads on one strand only, they tend to be wrong. This may be caused by duplicates, wrong alignments, context-related error dependency and other factors. You may perform a test to rule out such SNPs. In addition, if you have deep depth, you may consider to increase varFilter -d to require higher coverage on SNPs. The miscalls in repetitive proteins look also mysterious to me because the high mapping quality indicate the alignments look reliable. Were you aligning against the whole genome or targeted region only?

**cwivagg** · 02-02-2010, 07:00 AM

This is using the Maq aligner, but using only single-end reads, so if I understand how Maq works correctly, that means it's not doing any gapped alignment? My apologies for omitting this critical piece of information.

In any case, it's my strong suspicion that the SNPs are wrong, and you're right that turning up the varFilter or awking a bit differently would stop these SNPs from showing up. I just thought that the strandedness phenomenon was interesting, and if it had come up before or it was known what caused it.

You raise a good point about the reads in repetitive regions; I guess if the mapping quality is good, I should be trusting the alignment. In response to your question, it was aligned against the whole genome... so I have no reason to think that it would align better somewhere else that the aligner just didn't see, because the mapping quality takes into account the whole genome. That having been said, the indicated SNP confidence is low (lower, at any rate) than for the other SNPs... I guess I don't know what to make of these regions either.

Thank you for the quick response!

Carl

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 55 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 45 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

SAMtools Pileup Strand Weirdness

Comment

Comment

Latest Articles

ad_right_rmr

News