Seqanswers Leaderboard Ad

**wwmm933** · 03-14-2011, 06:44 PM

I have the same problem. Hope someone can help us!

**n00c** · 03-14-2011, 07:54 PM

The reason you are seeing reads with more mismatches than specified is because you have paired-end reads, and with paired-end read resolution usually you have a situation where one end maps to some location within the mismatch threshold specified, and the other doesn't, so the other end is then aligned using Smith-Waterman algorithm to the region where one would expect to find it, sometimes producing a quite a few mismatches, indels, or even clipping.

Some mappers allow a user to specify an option that only "independently-mapped" reads should be paired, which would prevent this. Perhaps there is some work-around with BWA, but I would just filter out reads with more mismatches than normal (note that if pair concordance is important to you, the correct approach is to just accept the fact that some reads will contain more mismatches/indels then specified).

**jstjohn** · 03-14-2011, 09:12 PM

If you really want to disable the sensitive mate mapping feature you can do that in bwa sampe with the -s option. Then you could go back and filter your reads for only those where both mates are still mapped using the `-F 12` feature in samtools view (-F means ignore reads containing a flag and 12 = 0x4+0x8 which are the flags for read unmapped and mate-unmapped). I think that would result in only mapped mated reads with 2 mismatches if you set -n 2. I don't think you need to mess with seeding or anything else.

**nilshomer** · 03-14-2011, 09:14 PM

BWA searches for mappings up to N+1 differences, where N is the # of differences you specified (or calculated by read length given no option). The (N+1) is to guarantee all N differences. You could map each end of the read independently ("bwa aln" then "bwa samse") to see if you still see more than N+1 differences, and report them here.

**bpetersen** · 03-14-2011, 11:04 PM

Thank you so much for all your answers! I guess given the reasons for more mismatches than specified, I will keep those alignments. It doesn't seem to affect too many reads (from what I see in IGV), so I think I can live with that.
Does anyone also have an example of the parameters they use for mapping of 75 bp mate-pair Illumina reads? That would be very helpful!

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Illumina mapping with bwa

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News