SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
RNA-Seq: Comparative Analysis of RNA-Seq Alignment Algorithms and the RNA-Seq Unified Newsbot! Literature Watch 3 07-31-2011 08:08 PM
RNA Seq alignment? Shruti Madhiwalla RNA Sequencing 0 10-22-2010 06:33 AM
TopHat alignment issue telos Bioinformatics 0 03-16-2010 11:39 AM

Reply
 
Thread Tools
Old 04-02-2013, 01:43 PM   #1
Amative
Member
 
Location: USA

Join Date: Dec 2011
Posts: 45
Default RNA-Seq alignment issue

Hello all,

I have been given paired-end RNA-Seq files to align against a couple of references. I used Bowtie2 to do the job. The alignment results were very low in most of the cases (less than 5% overall alignment rate).Now, we are thinking this might be caused by either contamination or mix up samples.

Any suggestion what to do in such case?
Thank you in advance
Amative is offline   Reply With Quote
Old 04-02-2013, 02:47 PM   #2
swbarnes2
Senior Member
 
Location: San Diego

Join Date: May 2008
Posts: 912
Default

Bioinformatically, there's nothing you can do, other than help the people to know what went wrong.

For starters, spot-check some random high quality reads, BLAST them against nr, see if you can determine what they are.

Try aligning to the whole genome to see how much of the library was genomic.

See if there are certain highly repetitive reads (like Illumina adapters) taking up a lot of reads.

And of course see if the run overall was of good enough quality for you to believe that your reads are accurate.
swbarnes2 is offline   Reply With Quote
Old 04-03-2013, 02:57 AM   #3
rboettcher
Member
 
Location: Berlin

Join Date: Oct 2010
Posts: 71
Default

Hi Amative,

what kind of reference did you provide? Bowtie2 is not splicing aware, so it is not able to deal with reads spanning splice junctions. Therefore, it can only be used to align against the transcriptome (for RNAseq). This is why TopHat was created to align against the whole genome.

Regards
rboettcher is offline   Reply With Quote
Old 04-03-2013, 08:11 AM   #4
Amative
Member
 
Location: USA

Join Date: Dec 2011
Posts: 45
Default

Thanks swbarnes2 & rboettcher,

@swbarnes2
  • I tried to blast the first ten reads from one of the samples I have, blast results were not that good. I tried to align against the available sequences of the two of the top blast hits. Same low alignment rate.
  • I checked for adapters, sequences are already trimmed.

@rboettcher
Yes, I am aligning against the transcriptome sequences.
Amative is offline   Reply With Quote
Old 04-03-2013, 08:39 AM   #5
GenoMax
Senior Member
 
Location: East Coast USA

Join Date: Feb 2008
Posts: 7,082
Default

Quote:
Originally Posted by Amative View Post
I tried to blast the first ten reads from one of the samples I have, blast results were not that good.
You probably want to go into the file some ways. With illumina atleast the first hundred (or more) sequences may not represent the best of the lot since they are generally from the edge of the flowcell/start of the lane.

You may also want to use this tool to do some screening: http://www.bioinformatics.babraham.a.../fastq_screen/
GenoMax is offline   Reply With Quote
Old 04-08-2013, 12:21 PM   #6
Amative
Member
 
Location: USA

Join Date: Dec 2011
Posts: 45
Default

Thanks GenoMax, for the suggestion I am working on it.

I like the fastq_screen It saves some time!
Amative is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 12:50 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO