![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Paired End Mapping using Bowtie | adarshjose | Bioinformatics | 8 | 01-26-2012 07:41 AM |
bowtie- paired end - no alignments | madsaan | Bioinformatics | 1 | 06-27-2011 01:24 PM |
bowtie paired end mismatches | madsaan | Bioinformatics | 0 | 06-22-2011 08:26 AM |
Tophat options to report unaligned reads and controlling Bowtie options | Siva | Bioinformatics | 0 | 10-15-2010 07:38 PM |
bowtie paired end mode | boetsie | Bioinformatics | 1 | 08-18-2010 04:12 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: US Join Date: Aug 2009
Posts: 8
|
![]()
Hi everyone,
I just began to use bowtie to map my solexa paired-end reads to a genome. I found that bowtie has three options to specify the orientation of the paired-end reads, --fr, --rf, and --ff. I tried all of them, and found: --fr, as the default one, can map most of my data onto the genome. For the left ones, --rf and --ff options can map some, but still leave some reads, which can be separately mapped in non-paired-end mode(should I call it single-end mode), but cannot be recognized as Paired-end reads with any of the three options. Carefully examining those remaining reads, I found most of them are in an orientation of rev-rev. In another word, both ends of each pair are mapped to the reverse strand. Why does bowtie have no --rr option? How can I process those reads in paired-end mode? THanks a lot! Dezhi |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: USA, Midwest Join Date: May 2008
Posts: 1,178
|
![]()
Dezhi,
There is no --rr because that is the same as --ff, molecularly speaking. The 'f' and 'r' do not refer to the strand which the read aligns to, but the relative orientation of read1 and read2. The relative orientation of the reads 1 and 2 is determined by the method used to generate the paired reads; it should be identical for all read pairs in your data set and you should now what the expected orientation is based on the method used to construct the paired end library. (Let's ignore for now that unexpected pair orientations can signal structural variations.) The --fr orientation means that the two reads are aligned to opposite strands and are pointed toward each other (i.e. the 3' ends of the reads are closer together than the 5' ends). Code:
--------------> =============================================== <-------------- Code:
--------------> =============================================== <-------------- Code:
read1 read2 --------------> --------------> =============================================== OR =============================================== <-------------- <-------------- read2 read1 |
![]() |
![]() |
![]() |
#3 |
Junior Member
Location: US Join Date: Aug 2009
Posts: 8
|
![]()
kmcarr, thanks so much. your reply is really helpful.
I guess that my reads should be produced using the Illumina paired end protocol, so most of them are in --fr orientation. But a small portion of the paired-end reads are in --rf/--ff orientation, because they can be mapped with those two options. Some are even in an orientation as illustrated below. I guess they are different from the -ff orientation, because read2 is 5' to read1. Do you have any idea why the same illumina paired-end protocol can generate some reads that is not in -fr orientation? or Do you meet this problem before? read2 read1 --------------> --------------> =============================================== OR =============================================== <-------------- <-------------- read1 read2 |
![]() |
![]() |
![]() |
#4 |
Member
Location: Retirement - Not working with bioinformatics anymore. Join Date: Apr 2010
Posts: 63
|
![]()
This post is really helpful, addressing Illumina mate-pair and paired-end as well as the SOLiD mate-pair. The newest SOLiD machines also have a paired-end, which as near as I can tell is of the configuration:
Code:
read1 read2 --------------> <-------------- =============================================== |
![]() |
![]() |
![]() |
#5 |
Senior Member
Location: Mexico Join Date: Mar 2011
Posts: 137
|
![]()
I'd like to bump this up as I'm trying to map some SOLID 4 reads. Thanks!
|
![]() |
![]() |
![]() |
#6 |
Senior Member
Location: Mexico Join Date: Mar 2011
Posts: 137
|
![]()
Hi!
I'm analyzing a "second-hand" dataset generated using SOLiD 4. It is a transcriptome mate pair library that is 52 x 37 nt, and I cannot for the sake of me find the protocol that was used to generate those specific read lengths. I have F3 and R3 reads, so I am assuming it is a circularization protocol, but I do not know what the size selection parameters were, or how the circles were cut to produce the final fragments. This info would be very valuable for a more accurate mapping. Any knowledge would be greatly appreciated! Thanks a lot, Carmen |
![]() |
![]() |
![]() |
Thread Tools | |
|
|