Unconfigured Ad

**dpryan** · 09-18-2014, 01:33 AM

Have a look at the --transcriptome-index option, which is what you're looking for.

**LeonDK** · 09-18-2014, 02:41 AM

Originally posted by dpryan View Post

Have a look at the --transcriptome-index option, which is what you're looking for.

Hi dpryan,

Thanks for input reg. the --transcriptome-index option for tophat2. I looked it up in the TopHat2 manual. For other users, which may encounter the same challenge - The trick is to run this command first:

Code:

tophat2 -G iGenomes/Homo_sapiens/UCSC/hg19/Annotation/Genes/genes.gtf --transcriptome-index=transcriptome_data/known iGenomes/Homo_sapiens/UCSC/hg19/Sequence/Bowtie2Index/genome

and then subsequently call tophat2 with this command:

Code:

tophat2 --num-threads 12 --transcriptome-index=transcriptome_data/known iGenomes/Homo_sapiens/UCSC/hg19/Sequence/Bowtie2Index/genome myfastq_R1.fastq.gz myfastq_R2.fastq.gz

After running the above command, you'll see

Code:

[2014-09-18 12:12:04] Using pre-built transcriptome data..

Which is significantly faster, when running multiple samples.

The UCSC/hg19 data can retrieved like so:

Code:

wget ftp://igenome:[email protected]/Homo_sapiens/UCSC/hg19/Homo_sapiens_UCSC_hg19.tar.gz

Cheers,
Leon

**konika** · 08-11-2015, 05:48 AM

tophat not creating transcriptome indexes

Hi
In my case The following command doesnt start tophat2. tophat2 just shows me the available options, like I have used a wrong option somewhere. Does anyone has an idea whats wrong here
The command I use:
tophat2 -G /home/chawla/rna_seq_pipeline/gff/mouse_ensembl.gff --transcriptome-index=tdata /home/chawla/rna_seq_pipeline/gff/mouse_ensembl

**GenoMax** · 08-11-2015, 06:14 AM

Originally posted by konika View Post

Hi
In my case The following command doesnt start tophat2. tophat2 just shows me the available options, like I have used a wrong option somewhere. Does anyone has an idea whats wrong here
The command I use:
tophat2 -G /home/chawla/rna_seq_pipeline/gff/mouse_ensembl.gff --transcriptome-index=tdata /home/chawla/rna_seq_pipeline/gff/mouse_ensembl

You have to point tophat2 process to the indexes for the full genome. It appears that you are including a gff file instead of the bowtie2 indexes at the end of your command. Refer to LeonDK's example in posts above.

**konika** · 08-11-2015, 06:37 AM

Thanks, it was actually old version of tophat that also needs an input read to create transcriptome indexes.

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 30 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 96 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 116 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 109 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

TopHat2 on multiple samples, avoid building Bowtie index from genes.fa each time?

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News