SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
mapping mismatches with Rsamtools, or best approach darren.obbard Bioinformatics 0 01-31-2012 01:40 AM
bowtie paired end mismatches madsaan Bioinformatics 0 06-22-2011 08:26 AM
Number of mismatches in Bowtie Rachelly Bioinformatics 0 05-08-2011 12:55 AM
bowtie sam output, number of mismatches sridharacharya Bioinformatics 2 01-08-2011 05:22 PM
Bowtie mapping with more mismatches mapper Bioinformatics 0 01-05-2011 02:11 AM

Reply
 
Thread Tools
Old 11-05-2012, 05:56 AM   #1
rndouglas
Member
 
Location: USA

Join Date: Jun 2012
Posts: 23
Default Bowtie mapping with more than 3 mismatches?

I'm trying to map some sRNA reads to specific loci of interest while allowing up to 3 mismatches using -v 3.

For some of my target loci, this seems to work well. All of the mapped reads have no more than 3 mismatches.

However, when I use the exact same code, changing only the bowtie files for a new locus, I'll get reads with up to 20 mismatches (and the sRNA read is only 24nt long!).

I've attached two screenshots from IGV showing the same sRNA sample mapped with Bowtie against two different loci. All of the nucleotides shown are mismatches.

How I expect it to always look:


How it usually turns out:


The code I'm using is as follows:

Code:
bowtie -f -v 3 -S ~/CRM2 ~/sRNA.fa > ~/output.sam
Oddly, I also have the exact same issue (too many mismatches) when I use this code:
Code:
bowtie -f -n 0 -l 25 -S ~/CRM2 ~/sRNA.fa > ~/output.sam
Am I doing something wrong? Has anyone else seen this?
rndouglas is offline   Reply With Quote
Old 11-09-2012, 07:59 AM   #2
rndouglas
Member
 
Location: USA

Join Date: Jun 2012
Posts: 23
Default

Cranking up the frustration a notch....

This week I discovered the same situation (up to 20 mismatches on a 24 nt "mapped" read) occurs when I set -v to 0, supposedly not allowing for mismatches.



This happens with ~4-5 loci I'm interested in mapping to, but several other loci of similar length do not show this problem at all.
rndouglas is offline   Reply With Quote
Old 07-28-2015, 06:28 AM   #3
Nepher
Junior Member
 
Location: france

Join Date: Jan 2013
Posts: 4
Default Have the same problem

Hi all,

since this thread has no answer and that I came across exact the same issue I wanted to know if someone knows how this type of alignment is possible. Where is the mistake ?
I also have 24 mismatches on sequences that were mapped with NO mismatch option...

If you could help I would be very grateful

Thanks a lot
Nepher is offline   Reply With Quote
Old 07-29-2015, 04:41 AM   #4
rndouglas
Member
 
Location: USA

Join Date: Jun 2012
Posts: 23
Default

For me, it was an issue with IGV. When I looked more closely IGV was coloring nucleotides as being mismatched, but they really were not mismatched. I never found a workaround/explanation, so I just had to ignore all of the claimed mismatches.

Bowtie was mapping properly with zero mismatches allowed.
rndouglas is offline   Reply With Quote
Old 07-29-2015, 05:50 AM   #5
Nepher
Junior Member
 
Location: france

Join Date: Jan 2013
Posts: 4
Default

Thanks for your answer.

I came across a solution this morning. After checking lots of stuff I looked again at my reference sequence and I saw that there were special characters at the beginning of the sequence :

>name
??????ATCGTGCTA

I simply erased them and IGV removed the most part of mismatches (it left those from the mapping). I think that the mapping coordinates where simply shifted/reference sequence.

Hope this will help other users with IGV problems
Nepher is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 07:45 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO