Seqanswers Leaderboard Ad

**NateP** · 04-11-2012, 01:01 PM

I've had an error pop about at the same point when I gave it a test run the other day myself.
Was single end mapping.
Went through the left_kept_reads then:
[timestamp] Searching for junctions via segment mapping
[timestamp] Retrieving sequences for splices
[FAILED]
Error: Splice sequence retrieval failed with err =-6

**Jon_Keats** · 04-11-2012, 10:26 PM

Have you tried the fusion version with the -bowtie1 option as suggested when you are doing the fusion detection.

**ahmetz** · 04-12-2012, 10:25 AM

yeah, and i also removed fusion detection completely but they all end up giving the same error.

**Nicolas** · 04-12-2012, 11:45 AM

Same here. Tophat2 hangs forever at the same step "Searching for junctions via segment mapping", with paired-end HiSeq fastq reads (~140M reads), with or without --fusion-search option.
I ran it succesfully on a small subset of the same files (1M reads), so I suspect it has to deal with data size / memory...
Hope for a quick fix!

**plabaj** · 04-12-2012, 11:59 AM

I have a simillar problem at the same stage:
[FAILED]
Error: segment-based junction search failed with err =-6
Error: invalid 'B' array subtype (B)!

and TopHat was run with the following parameters:
--bowtie1 --report-secondary-alignments --bowtie-n

**gpertea** · 04-12-2012, 12:42 PM

There are quite a few different problems reported in this thread - I apologize for them. We are aware of a couple of these problems and we are trying to fix them, though we are encountering some difficulties reproducing some of these errors.
We just updated the packages on the website with a couple of last minute fixes so please re-download the tophat-2.0.0 package of your choice (yes, it's the same version, same file names) from http://tophat.cbcb.umd.edu and try running the new version again on your test data. If tophat 2 still fails at any stage in the process please report the failure at [email protected] - it would also help if you attach a tarball or zip file with the content of the "logs" directory when you post a bug report there.

Thank you,
-Geo

**NateP** · 04-13-2012, 05:41 AM

Originally posted by gpertea View Post

There are quite a few different problems reported in this thread - I apologize for them. We are aware of a couple of these problems and we are trying to fix them, though we are encountering some difficulties reproducing some of these errors.
We just updated the packages on the website with a couple of last minute fixes so please re-download the tophat-2.0.0 package of your choice (yes, it's the same version, same file names) from http://tophat.cbcb.umd.edu and try running the new version again on your test data. If tophat 2 still fails at any stage in the process please report the failure at [email protected] - it would also help if you attach a tarball or zip file with the content of the "logs" directory when you post a bug report there.

Thank you,
-Geo

The updated version fixed the error I posted earlier in the thread. Thank you for the quick work!

**westerman** · 04-13-2012, 10:34 AM

Yes, the updated version also solved the problems I was having with crashes and hangs in the 'Joining segment' part of the program. My samples are being mapped to a non-standard organism that has over 80 scaffolds.

**vyellapa** · 04-13-2012, 01:06 PM

I did'nt include the fusion flags and it worked ok. Is anybody seeing any significant speedup compared to 1.4.1? I dont.

Code:

time tophat2 -p 8 -r 50 --mate-std-dev 50 --transcriptome-index=./tophat2_transcript/GRCh37_E66 -o ./t2 \
-n 2 -N 4 --b2-sensitive --library-type fr-unstranded \
--rg-id ID000 --rg-sample M123 --rg-library lib0000 --rg-platform-unit IlluminaLn7 --rg-center TGen \
$RESOURCES/bowtie2/GRCh37.62 ./1mr1.fastq ./1mr2.fastq

**NicoBxl** · 04-16-2012, 12:47 AM

Tophat 2 also hangs forever in

[2012-04-13 03:41:33] Searching for junctions via segment mapping

and in the segment_juncs.log

>> Performing segment-search:
Loading left segment hits...

**DineshCyanam** · 04-16-2012, 06:28 AM

Yes... Tophat 2 is taking extremely long (~ 30 hrs)... particularly the Searching for junctions via segment mapping step as NicoBxl above observed. Here is my log.

$ tophat -r 150 --solexa1.3-quals -p 24 -g 20 --library-type fr-unstranded -o tophat_out $BOWTIE2_INDEXES/genome 61T4D/s_1_1_sequence.txt.gz,6286U/s_1_1_sequence.txt.gz 61T4D/s_1_2_sequence.txt.gz,6286U/s_1_2_sequence.txt.gz

[2012-04-13 18:14:51] Beginning TopHat run (v2.0.0)
-----------------------------------------------
[2012-04-13 18:14:51] Checking for Bowtie
Bowtie version: 2.0.0.5
[2012-04-13 18:14:51] Checking for Samtools
Samtools version: 0.1.18.0
[2012-04-13 18:14:51] Checking for Bowtie index files
[2012-04-13 18:14:51] Checking for reference FASTA file
[2012-04-13 18:14:51] Generating SAM header for /Homo_sapiens/Ensembl/GRCh37/Sequence/BowtieIndex/genome
format: fastq
quality scale: phred64 (reads generated with GA pipeline version >= 1.3)
[2012-04-13 18:14:53] Preparing reads
left reads: min. length=60, count=44338793
right reads: min. length=60, count=44098248
[2012-04-13 18:29:52] Mapping left_kept_reads against genome with Bowtie2
[2012-04-13 19:17:01] Mapping left_kept_reads_seg1 against genome with Bowtie2 (1/2)
[2012-04-13 19:28:48] Mapping left_kept_reads_seg2 against genome with Bowtie2 (2/2)
[2012-04-13 19:47:50] Mapping right_kept_reads against genome with Bowtie2
[2012-04-13 20:33:37] Mapping right_kept_reads_seg1 against genome with Bowtie2 (1/2)
[2012-04-13 20:47:07] Mapping right_kept_reads_seg2 against genome with Bowtie2 (2/2)
[2012-04-13 21:08:17] Searching for junctions via segment mapping
[2012-04-14 22:36:08] Retrieving sequences for splices
[2012-04-14 22:38:05] Indexing splices
[2012-04-14 22:51:34] Mapping left_kept_reads_seg1 against segment_juncs with Bowtie2 (1/2)
[2012-04-14 23:01:42] Mapping left_kept_reads_seg2 against segment_juncs with Bowtie2 (2/2)
[2012-04-14 23:15:09] Joining segment hits
[2012-04-14 23:37:53] Mapping right_kept_reads_seg1 against segment_juncs with Bowtie2 (1/2)
[2012-04-14 23:49:11] Mapping right_kept_reads_seg2 against segment_juncs with Bowtie2 (2/2)
[2012-04-15 00:04:38] Joining segment hits
[2012-04-15 00:28:13] Reporting output tracks
-----------------------------------------------
[2012-04-15 00:43:47] Run complete: 1 days 06:28:56 elapsed

Previous versions of Tophat for the same data took ~8 hours to run

**turnersd** · 04-16-2012, 10:51 AM

I'm also getting an error when using the --transcript-index option, "GFF transcripts file not found or invalid". The file exists, and it appears valid.

**DineshCyanam** · 04-17-2012, 12:00 PM

So I was able to reduce my runtime to ~4 hrs from ~30 hrs by using the --no-coverage-search and --no-novel-indels flags.

Originally posted by DineshCyanam View Post

Yes... Tophat 2 is taking extremely long (~ 30 hrs)... particularly the Searching for junctions via segment mapping step as NicoBxl above observed.
Previous versions of Tophat for the same data took ~8 hours to run

**Dario1984** · 04-17-2012, 10:01 PM

Index Locations

The location of the indexes doesn't work like the documentation says.

Code:

darstr@clark-lab:~/bowtie2$ echo $BOWTIE_INDEXES
/home/darstr/bowtie2/indexes/
darstr@clark-lab:~/bowtie2$ ls $BOWTIE_INDEXES
hg18.1.bt2  hg18.2.bt2  hg18.3.bt2  hg18.4.bt2  hg18.rev.1.bt2  hg18.rev.2.bt2

[2012-04-18 15:43:14] Beginning TopHat run (v2.0.0)
-----------------------------------------------
[2012-04-18 15:43:14] Checking for Bowtie
                  Bowtie version:        2.0.0.5
[2012-04-18 15:43:14] Checking for Samtools
                Samtools version:        0.1.18.0
[2012-04-18 15:43:14] Checking for Bowtie index files
Error: Could not find Bowtie 2 index files (hg18.*.bt2)

Not only is the environment variable ignored, but it also doesn't look in the indexes directory in the directory of the bowtie instance.

The only was it works is if I change directories to /home/darstr/bowtie2/indexes/ and run it from there.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 24 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 20 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

tophat2 errors

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News