SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
Calculate number of multi-mapped reads? KAP Bioinformatics 13 02-17-2017 06:07 AM
How to extract multi-mapped reads by samtools? mavishou RNA Sequencing 5 12-05-2016 05:27 AM
BWA-MEM: output mapped reads larger than input reads Kennels Bioinformatics 9 09-05-2013 05:27 PM
BWA rescue of multi-mapping or unmapped reads kjlee Bioinformatics 6 07-13-2012 10:29 PM
Tophat - reporting only best hit given multi-reads/multi-maps. john_nl Bioinformatics 1 07-05-2012 12:15 PM

Reply
 
Thread Tools
Old 02-10-2014, 02:50 AM   #1
nk
Member
 
Location: UK

Join Date: Apr 2012
Posts: 11
Default bwa reporting of multi-mapped reads

I am mapping some RNA-seq data with bwa and would like to do some analysis on where multi-mapped reads fall.

I know that I can extract multi-mapped reads by looking for mapq < 23 and/or the XA flag on the reads. However, I am wondering how bwa decides which location to report for a read that can be mapped to two different locations equally well. Does it choose a random one? Does it always report the first one? Something else?

Does anybody know what exactly bwa does here?
nk is offline   Reply With Quote
Old 02-10-2014, 04:10 AM   #2
TiborNagy
Senior Member
 
Location: Budapest

Join Date: Mar 2010
Posts: 329
Default

There is a command line argument: -R. If BWA reach this limit, stop searching further locations.
TiborNagy is offline   Reply With Quote
Old 02-10-2014, 04:20 AM   #3
nk
Member
 
Location: UK

Join Date: Apr 2012
Posts: 11
Default

I don't think this answers my question. I want to know how bwa decides which of the location it reports as the primary alignment.
nk is offline   Reply With Quote
Old 02-14-2014, 10:37 PM   #4
sdriscoll
I like code
 
Location: San Diego, CA, USA

Join Date: Sep 2009
Posts: 438
Default

Its a random choice. Also when there are more than 1 equally "best" hits for a read the alignment gets a MAPQ of 0. When the author benchmarks the BWA tools he usually throws out alignments with MAPQ = 0 since those are random assignments.
__________________
/* Shawn Driscoll, Gene Expression Laboratory, Pfaff
Salk Institute for Biological Studies, La Jolla, CA, USA */

Last edited by sdriscoll; 02-14-2014 at 10:39 PM.
sdriscoll is offline   Reply With Quote
Old 03-05-2014, 09:40 PM   #5
nk
Member
 
Location: UK

Join Date: Apr 2012
Posts: 11
Default

Great, thank you!
nk is offline   Reply With Quote
Old 03-06-2014, 12:16 AM   #6
Brian Bushnell
Super Moderator
 
Location: Walnut Creek, CA

Join Date: Jan 2014
Posts: 2,707
Default

Quote:
Originally Posted by sdriscoll View Post
Its a random choice. Also when there are more than 1 equally "best" hits for a read the alignment gets a MAPQ of 0. When the author benchmarks the BWA tools he usually throws out alignments with MAPQ = 0 since those are random assignments.
This is not correct - if there are two equal-scoring locations, bwa gives both a mapping score of 3 (equivalent to 50% probability), and so forth.

If you want to map RNA-seq data for organisms with splicing, such as eukaryotes, bwa is not the right tool. You should use a splice-aware aligner like Tophat or BBMap.
Brian Bushnell is offline   Reply With Quote
Old 07-19-2017, 04:59 AM   #7
Medhat
Member
 
Location: Poland

Join Date: Jun 2013
Posts: 37
Default

what about duplicate how BWA deals with duplicate , and Is it possible to give me the source of this info.
Medhat is offline   Reply With Quote
Old 07-19-2017, 05:07 AM   #8
Medhat
Member
 
Location: Poland

Join Date: Jun 2013
Posts: 37
Default

[QUOTE=Brian Bushnell;134401]This is not correct - if there are two equal-scoring locations, bwa gives both a mapping score of 3 (equivalent to 50% probability), and so forth.

what about duplicate how BWA deals with duplicate , and Is it possible to give me the source of this info.
Medhat is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 10:50 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO