SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
IUPAC coded reference sequences data Bioinformatics 9 11-29-2011 08:50 PM
BWA & Reverse Sequences dp05yk Bioinformatics 1 06-14-2011 08:15 PM
BWA: reference sequences longer than 4GB ElMichael Bioinformatics 1 06-06-2011 01:22 PM
cuffcompare not matching reference genome genec Bioinformatics 11 06-08-2010 10:35 AM
BWA and masking sequences... spindrift Bioinformatics 4 02-17-2010 11:20 AM

Reply
 
Thread Tools
Old 11-18-2010, 07:38 PM   #1
MBekritsky
Member
 
Location: CSHL

Join Date: Nov 2009
Posts: 15
Default BWA reporting of sequences matching to reverse complement of reference

Hi,

I'm doing this as a sanity check.

I'm using BWA to align simulated reads to a reference genome, and I'm noticing that in the SAM file, when a read maps to the reverse complement, the reported start position is for the 3' end of the original reverse complemented query sequence (the 5' position of the read in the forward strand in the genome). Also, BWA appears to reverse complement the original query sequence and reverse the quality string. Did I get that right? I'm just trying to make sure I thoroughly understand BWA's output.

Thanks!
MBekritsky is offline   Reply With Quote
Old 11-19-2010, 02:34 AM   #2
dawe
Senior Member
 
Location: 4530'25.22"N / 915'53.00"E

Join Date: Apr 2009
Posts: 258
Default

AFAIK, this is not only BWA, it is a SAM format specification.
See here, page 5 note 7.

d
dawe is offline   Reply With Quote
Old 11-19-2010, 03:46 AM   #3
MBekritsky
Member
 
Location: CSHL

Join Date: Nov 2009
Posts: 15
Default

Thanks dawe!

I've looked at the SAM manual a number of times, but I seem to have always missed that note....
MBekritsky is offline   Reply With Quote
Reply

Tags
bwa, sequence alignment

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 02:58 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO