Seqanswers Leaderboard Ad

**dpryan** · 06-24-2013, 01:43 AM

Your life would be easier if you used the Ensembl annotation with the Ensembl sequence or the UCSC annotation with the UCSC annotation and didn't try to mix them.

**dpryan** · 06-24-2013, 01:45 AM

Also, you can download pre-made indices that will work from iGenomes.

**rubbertjes** · 06-24-2013, 01:49 AM

Originally posted by dpryan View Post

Also, you can download pre-made indices that will work from iGenomes.

This is indeed where I obtained my indices from.

**rubbertjes** · 06-24-2013, 01:50 AM

Originally posted by dpryan View Post

Your life would be easier if you used the Ensembl annotation with the Ensembl sequence or the UCSC annotation with the UCSC annotation and didn't try to mix them.

I am downloading the Ensembl data from http://cufflinks.cbcb.umd.edu/igenomes.html as we speak, so we will see how that goes (ETA 15h :S), but it is still strange it doesn't function with my current implementation...
Thanks!

**hathiram2** · 08-28-2013, 12:55 AM

Hi rubbertjes,

I am also facing same problem

I am doing alignment of my RNA-Seq data from Arabidopsis with Tophat

I gave the following command

tophat -G genes.gtf -p 5 -o controlone genome
C2AFAACXX_Hr-02_13s004770-1-1_ram_lane313s004770_sequence.fastq,C2AFAACXX_Hr-02_13s004770-2-1_ram_lane513s004770_sequence.fastq

Genes.gtf is gene annotation file, while bowtie2 index files are starting with 'genome' name.

These both the files were downloaded from this link

404 Not Found

http://tophat.cbcb.umd.edu/igenomes.shtml

and the out put looks like this

[2013-08-27 15:23:42] Beginning TopHat run (v2.0.9)
-----------------------------------------------
[2013-08-27 15:23:42] Checking for Bowtie
Bowtie version: 2.1.0.0
[2013-08-27 15:23:42] Checking for Samtools
Samtools version: 0.1.19.0
[2013-08-27 15:23:42] Checking for Bowtie index files (genome)..
[2013-08-27 15:23:42] Checking for reference FASTA file
[2013-08-27 15:23:42] Generating SAM header for genome
format: fastq
quality scale: phred33 (default)
[2013-08-27 15:23:42] Reading known junctions from GTF file
[2013-08-27 15:23:45] Preparing reads
left reads: min. length=52, max. length=52, 58033296 kept reads
(3282 discarded)
[2013-08-27 15:36:32] Building transcriptome data files..
[FAILED]
Error: gtf_to_fasta returned an error.

Could you solve this probem? and what was the mistake.

thank you so much..

**dpryan** · 08-28-2013, 01:15 AM

You might have a look in the run log to see if you get a more informative error. Aside from that, ensure that gtf_to_fasta is in your PATH and executable. You can also just directly execute it yourself using the command in the run log.

**hathiram2** · 08-28-2013, 01:40 AM

Hi dpryan,

in the run log I see following details

/g/software/linux/pack/tophat-2.0.9/bin/gtf_to_fasta --min-anchor 8 --splice-mismatches 0 --min-report-intron 50 --max-report-intron 500000 --min-isoform-fraction 0.15 --output-dir controlon/ --max-multihits 20 --max-seg-multihits 40 --segment-length 25 --segment-mismatches 2 --min-closure-exon 100 --min-closure-intron 50 --max-closure-intron 5000 --min-coverage-intron 50 --max-coverage-intron 20000 --min-segment-intron 50 --max-segment-intron 500000 --read-mismatches 2 --read-gap-length 2 --read-edit-dist 2 --read-realign-edit-dist 3 --max-insertion-length 3 --max-deletion-length 3 -z gzip -p5 --gtf-annotations Arabidopsis_thaliana_NCBI_TAIR10/Arabidopsis_thaliana/NCBI/TAIR10/Annotation/Archives/archive-2013-03-06-09-50-01/Genes/genome.gtf --gtf-juncs controlon/tmp/genome.juncs --no-closure-search --no-coverage-search --no-microexon-search Arabidopsis_thaliana_NCBI_TAIR10/Arabidopsis_thaliana/NCBI/TAIR10/Annotation/Archives/archive-2013-03-06-09-50-01/Genes/genome.gtf Arabidopsis_thaliana_NCBI_TAIR10/Arabidopsis_thaliana/NCBI/TAIR10/Sequence/Bowtie2Index/genome.fa controlon/tmp/genome.fa > controlon/logs/g2f.out

but can't understand that there is any error message

**hathiram2** · 08-28-2013, 02:07 AM

I copy-pasted the last lines of run log into the command line and output was like this

/g/software/linux/pack/tophat-2.0.9/bin/gtf_to_fasta: /lib64/libz.so.1: no version information available (required by /g/software/linux/pack/tophat-2.0.9/bin/gtf_to_fasta)
Error (GFaSeqGet): not a fasta header?

**dpryan** · 08-28-2013, 02:15 AM

The libz warning can be ignored. I've not seen that particular error, but the source code suggests that the fasta file doesn't start with a ">Something" line. Is this the case?

**hathiram2** · 08-28-2013, 02:22 AM

I opened the fasta file in the terminal and looks like this

cat genome.fa
XSym
0029
7ec8fcfa270da919afc1f0afd221e788
../WholeGenomeFasta/genome.fa

I am not sure this is the right, althought I got this fasta file with other index files

thanks for your posts..

**dpryan** · 08-28-2013, 02:30 AM

Oh my, something definitely went amiss there. You're going to want to either redownload that or recreate it from you bowtie indexes.

**hathiram2** · 08-28-2013, 02:35 AM

I already downloaded index files twice, and both the .fa files look same. I created my own index files but in those files I didn't find.fa files, although there were other .bt2 files.

Now I am going to recrteate new index files..

**dpryan** · 08-28-2013, 02:42 AM

Where are you getting the fasta files and indices? Are these from iGenomes?

**hathiram2** · 08-28-2013, 02:42 AM

this time also I don't see any .fa file after indexing, I just see 6 .bt2 files

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 46 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

GTF file error: "gtf_to_fasta returned an error"

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News