Unconfigured Ad

**Jon_Keats** · 06-23-2011, 01:25 PM

What was your experiment design? Standard Illumina mate-pair? Read length? Also how did you reverse complement and what was the bwa command line arguements?

I've done standard Illumina MP preps, reverse complemented with fastx-toolkit and aligned with standard parameters using bwa before with good success. See below

133724796 in total
0 QC failure
45170770 duplicates
121800898 mapped (91.08%)
133724796 paired in sequencing
66862398 read1
66862398 read2
102897270 properly paired (76.95%)
115119322 with itself and mate mapped
6681576 singletons (5.00%)
3387642 with mate mapped to a different chr
2495203 with mate mapped to a different chr (mapQ>=5)

**apratap** · 06-23-2011, 01:37 PM

Hi Jon

Thanks for your reply. The protocol is not standard we are trying to sequence the ends of transcripts using Mate Pair technique.

I data that I get after linker removal is of variable read length 60+/-20 bp. I reverse compliment the reads based on the basic definition reverse the read and then compliment it and also reverse the quality header.

One thing that could trick BWA is the variable fragment size as it dependent on the length of transcripts that we are trying to capture.

As per BWA options I have pretty much used the standard ones. At this point I am not so concerned about the mapping % as I am about the need for reverse complimenting the reads before mapping with BWA and how it handles the multi read mapping.

Thanks!
-Abhi

**Jon_Keats** · 06-23-2011, 01:48 PM

I'm assuming you are trying to get the 5' and 3' ends of each RNA species by circularizing the cDNA? Neat idea, definately the weird distribution when mapped to genome will give bwa some problems. You might want to try Tophat instead for the alignment.

For reads that map at multiple locations bwa will report the other potential sites and will randomly select one unless the mate/pair read dictates the location, but even then it should report the alternative options.

**apratap** · 06-23-2011, 01:51 PM

Any idea why BWA needs reads to be inner directional (---> <---) for it to map them.

I guess Tophat will not work as the read lengths are variable and as per my understanding of the version I used they require read 1/2 of equal length. In our case based on the identification of linker the read length will be variable.

-Abhi

**Jon_Keats** · 06-23-2011, 04:38 PM

Maybe try BWA in single end mode, filter out the reads aligning to multiple locations, then manually pair the reads using perl or something to find the mates/pairs that mark the ends of your RNA species

Topics	Statistics	Last Post
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 21 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM
DNA Methylation Study Reveals How Epigenetic Changes Pass Between Generations by SEQadmin2 Started by SEQadmin2, 06-02-2026, 11:40 AM	0 responses 14 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 11:40 AM
MetaBeeAI Helps Scientists Process Research Literature Faster by SEQadmin2 Started by SEQadmin2, 05-28-2026, 11:40 AM	0 responses 29 views 0 reactions	Last Post by SEQadmin2 05-28-2026, 11:40 AM
Scientists Solve a 25-Year Mystery in RNA Interference by SEQadmin2 Started by SEQadmin2, 05-26-2026, 10:12 AM	0 responses 31 views 0 reactions	Last Post by SEQadmin2 05-26-2026, 10:12 AM

Unconfigured Ad

BWA behaviour with Mate Pair data + Multi read mapping

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News