![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Tophat2 Error running 'long_spanning_reads': | dvanic | Bioinformatics | 16 | 04-26-2013 01:56 PM |
tophat2/bowtie2 inconsistency in number of unmapped reads | manianslab | Bioinformatics | 2 | 07-13-2012 01:56 PM |
Bowtie2 running infinitely | jaZt | Bioinformatics | 0 | 01-16-2012 02:47 AM |
GATK CountCovariates running very slow | indapa | Bioinformatics | 1 | 06-30-2011 06:46 AM |
samtools sort running extremely slow | tsucheta | Bioinformatics | 2 | 06-11-2010 07:30 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: Huntington, WV USA Join Date: Dec 2011
Posts: 6
|
![]()
We recently completed our first RNA-seq runs on an Illumina Hi-Seq 1000. These were paired-end 100bp reads, with ~30-50 million paired reads per sample. Some are human cell line, some human patient sample, and some are rat.
(This means I'm a newbie for tophat and bowtie.) I've been trying to run alignments with Tophat2 for these reads. I was successful with the rat samples running Tophat 2.0.5 over Bowtie 0.12.8; however for some of the human samples the run crashed on long_spanning_reads. I could fix this by switching to Bowtie 2.0.0.7, but the runs were taking prohibitively long using this version. Under Bowtie 0.12.8 the successful runs are completing in ~5-7 hours; under Bowtie 2.0.0.7 the only one I have let run to completion has taken 40 hours. Since I have about 40 samples to process, this is not going to work. (I'm using Red Hat Linux with 40 processors committed to the job @ 2.00GHz. The machine has 128GB of RAM and is not running out of physical memory.) According to this thread: http://seqanswers.com/forums/showthread.php?t=22438 the "long_spanning_reads" error is fixed in the latest version of tophat (2.0.6), so for now my solution is to run Tophat 2.0.6 over Bowtie 0.12.8. Hopefully this will work but I would really like to be running the most recent version of Bowtie. The only non-default options I'm providing to tophat are -p 40 for the multithreading and --bowtie1 if I'm running against bowtie 0.12.8. I am not providing annotations, though I actually tried that and it made no difference either to the speed or to the long_spanning_reads error. Has anyone experienced this drop in performance between bowtie 0.12.8 and bowtie 2.0.0.7 when using tophat2? Any suggestions? |
![]() |
![]() |
![]() |
#2 |
Member
Location: Sweden Join Date: Nov 2009
Posts: 23
|
![]()
I've also noticed that tophat (v.2.0.7) is extremly slow when run in conjunction with bowtie2. I was going to have a look at my options ( -r 100 -m 1 -p 1 --coverage-search --microexon-search --library-type fr-unstranded) to see if there were anything there. According to the program the coverage search can be really slow but I've also found that the alignments per se is really slow. I multiplex several samples rather then having many cores on one samples so it takes more then one week per sample (human RNA, around 40-50 million reads).
We have started looking into star which is way (more then factor 10) faster. But since we haven't evaluated that just yet I'd really like to keep on using tophat. /Petter |
![]() |
![]() |
![]() |
#3 |
Junior Member
Location: Huntington, WV USA Join Date: Dec 2011
Posts: 6
|
![]()
I ended up running Tophat2 over Bowtie 1; the latest version of each resolved the problems I was having with crashing. It would be nice to understand why Bowtie2 is prohibitively slow, though.
|
![]() |
![]() |
![]() |
Tags |
bowtie, bowtie2, performance, tophat2 |
Thread Tools | |
|
|