SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
Tophat2 Error running 'long_spanning_reads': dvanic Bioinformatics 16 04-26-2013 01:56 PM
tophat2/bowtie2 inconsistency in number of unmapped reads manianslab Bioinformatics 2 07-13-2012 01:56 PM
Bowtie2 running infinitely jaZt Bioinformatics 0 01-16-2012 02:47 AM
GATK CountCovariates running very slow indapa Bioinformatics 1 06-30-2011 06:46 AM
samtools sort running extremely slow tsucheta Bioinformatics 2 06-11-2010 07:30 AM

Reply
 
Thread Tools
Old 10-26-2012, 10:28 AM   #1
jdenvir
Junior Member
 
Location: Huntington, WV USA

Join Date: Dec 2011
Posts: 6
Default Tophat2 very slow when running over Bowtie2

We recently completed our first RNA-seq runs on an Illumina Hi-Seq 1000. These were paired-end 100bp reads, with ~30-50 million paired reads per sample. Some are human cell line, some human patient sample, and some are rat.

(This means I'm a newbie for tophat and bowtie.)

I've been trying to run alignments with Tophat2 for these reads. I was successful with the rat samples running Tophat 2.0.5 over Bowtie 0.12.8; however for some of the human samples the run crashed on long_spanning_reads.

I could fix this by switching to Bowtie 2.0.0.7, but the runs were taking prohibitively long using this version. Under Bowtie 0.12.8 the successful runs are completing in ~5-7 hours; under Bowtie 2.0.0.7 the only one I have let run to completion has taken 40 hours. Since I have about 40 samples to process, this is not going to work. (I'm using Red Hat Linux with 40 processors committed to the job @ 2.00GHz. The machine has 128GB of RAM and is not running out of physical memory.)

According to this thread: http://seqanswers.com/forums/showthread.php?t=22438 the "long_spanning_reads" error is fixed in the latest version of tophat (2.0.6), so for now my solution is to run Tophat 2.0.6 over Bowtie 0.12.8. Hopefully this will work but I would really like to be running the most recent version of Bowtie.

The only non-default options I'm providing to tophat are -p 40 for the multithreading and --bowtie1 if I'm running against bowtie 0.12.8. I am not providing annotations, though I actually tried that and it made no difference either to the speed or to the long_spanning_reads error.

Has anyone experienced this drop in performance between bowtie 0.12.8 and bowtie 2.0.0.7 when using tophat2? Any suggestions?
jdenvir is offline   Reply With Quote
Old 02-18-2013, 12:39 AM   #2
pettervikman
Member
 
Location: Sweden

Join Date: Nov 2009
Posts: 23
Default

I've also noticed that tophat (v.2.0.7) is extremly slow when run in conjunction with bowtie2. I was going to have a look at my options ( -r 100 -m 1 -p 1 --coverage-search --microexon-search --library-type fr-unstranded) to see if there were anything there. According to the program the coverage search can be really slow but I've also found that the alignments per se is really slow. I multiplex several samples rather then having many cores on one samples so it takes more then one week per sample (human RNA, around 40-50 million reads).

We have started looking into star which is way (more then factor 10) faster. But since we haven't evaluated that just yet I'd really like to keep on using tophat.

/Petter
pettervikman is offline   Reply With Quote
Old 02-18-2013, 06:28 AM   #3
jdenvir
Junior Member
 
Location: Huntington, WV USA

Join Date: Dec 2011
Posts: 6
Default

I ended up running Tophat2 over Bowtie 1; the latest version of each resolved the problems I was having with crashing. It would be nice to understand why Bowtie2 is prohibitively slow, though.
jdenvir is offline   Reply With Quote
Reply

Tags
bowtie, bowtie2, performance, tophat2

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 09:22 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO