Hi All,
I have RNA seq data (human, paired end, 72 bases, Solexa) from a bunch of samples. In one of my analyzes I have realized that a big problem is the fact that reads get mis-aligned at exon exon junctions, see attached illustration. So far I have mainly been using Bowtie, and the "errors" I am seeing makes sense given how Bowtie works. So reads will map with a small "overhang" at the exon border, especially if there is some seq. similarity at that border and on the other side of the junction (the other exon). This leads to that I will get what looks like sequence variation, in the attached illustration for example the red AA on the left hand side will look a SNP or an editing event.
One easy approach to "correct" for this would be to only analyze data that falls within exon borders, but this will cause a loss of information. So my question is: Are there other aligners that are better at aligning around exon junctions?
Another option is to align reads to the transcriptome instead, which I have been doing, but genome annotation then gets lost which is a major problem.
Thankful for any thoughts about aligners that would work well in this situation!
Boel
I have RNA seq data (human, paired end, 72 bases, Solexa) from a bunch of samples. In one of my analyzes I have realized that a big problem is the fact that reads get mis-aligned at exon exon junctions, see attached illustration. So far I have mainly been using Bowtie, and the "errors" I am seeing makes sense given how Bowtie works. So reads will map with a small "overhang" at the exon border, especially if there is some seq. similarity at that border and on the other side of the junction (the other exon). This leads to that I will get what looks like sequence variation, in the attached illustration for example the red AA on the left hand side will look a SNP or an editing event.
One easy approach to "correct" for this would be to only analyze data that falls within exon borders, but this will cause a loss of information. So my question is: Are there other aligners that are better at aligning around exon junctions?
Another option is to align reads to the transcriptome instead, which I have been doing, but genome annotation then gets lost which is a major problem.
Thankful for any thoughts about aligners that would work well in this situation!
Boel
Comment