Seqanswers Leaderboard Ad

**dpryan** · 08-16-2013, 04:54 AM

1. You might tell tophat to use more threads. It'll take a lot longer to get results when using only a single thread.

2. It doesn't much matter. I recall someone saying that the iGenomes indexes are missing somethings (perhaps it was mitochondria). When in doubt, always make your own (it doesn't take that long).

3. Abundant species would be things like rRNAs, that are often present in VERY high amounts but not of interest. These are useful to give to cufflinks or similar to tell it to ignore those areas of the genome.

4. For most common uses, the defaults are fine (just give it more threads).

**sindrle** · 08-16-2013, 05:02 AM

Thank you very much for fast answers!

Do you have a link describing thread settings for TopHat2?

Also, one final question, when downloaded pre-built hg19 all annotations are included, how may I use this with Tophat2?

**GenoMax** · 08-16-2013, 05:39 AM

Originally posted by sindrle View Post

Do you have a link describing thread settings for TopHat2?

Also, one final question, when downloaded pre-built hg19 all annotations are included, how may I use this with Tophat2?

From the TopHat manual:

-p/--num-threads <int> Use this many threads to align reads. The default is 1.

Start with 2 or 4 threads. It does not mean that even if your computer has 8 "cores" you will get an equivalent speedup since disk read speeds start being the limiting factor.

If you happen to get pre-built indexes from the iGenomes site then you can specify the location of the Bowtie2Index's while running TopHat by specifying the path to the "genome.fa" file in the index folder.

**sindrle** · 08-17-2013, 12:25 PM

Hi again!
Thank you for input. I aborted the first run, but before Im running the second test, can you control that this code is correct?

I want to use 8 threads: -p 8
I want use the genes.gtf included in the iGenomes hg19: -G path/to/genes.gtf
Since I use the -G, I also need: transcriptome-index=transcriptome_data
I dont want coverage-search: --no-coverage-search
My hg19 genome and Bowtie2 indexes (genome.fa and .bt2 files) is in my PATH (usr/bin/indexes) as aliases, does this work?
Finally I have my fastq I want to analyse.

So is this correct?

tophat2 -p 8 -G
/path/to/genes.gtf --transcriptome-index=transcriptome_data --no-coverage-search *
genome /path/to/x.fastq

"path/to" is just for simplicity.

Im also curious about the "*" after all the option codes. Also I wonder where the "transcriptome_data" folder will be created.

**dpryan** · 08-17-2013, 01:09 PM

The "transcriptome_data" folder will be created wherever you specify after "--transcriptome-index=". I have no clue what you're trying to achieve with the random asterisk, but I suspect it won't do whatever it is that you want. Aside from that, it should work.

**sindrle** · 08-17-2013, 01:30 PM

I cant get the -G and --transcriptome-index=
transcriptome_data/know option to work.

Also I have to cd to where my bowtie2 indexes are to type the tophat2 command even though I have the indexes in my PATH.

**GenoMax** · 08-18-2013, 05:11 AM

Having the indexes in your path won't work since they are not executable files. You will be better off providing full paths for them. There is no harm in providing full file paths (e.g. for tophat2 executable, indexes, output directories) in your command lines.

Check the detailed tutorial at the end of this article for examples of various command lines for the TopHat/Cufflinks suite of programs: http://www.nature.com/nprot/journal/....2012.016.html

**sindrle** · 08-18-2013, 07:45 AM

Thanks! I fiddled around for some hours yesterday and it all works like a charm now!

Next step is to run Cufflinks2.

Thank you everyone!

Topics	Statistics	Last Post
TIGR Systems Offer a Compact Alternative to CRISPR for Gene Editing by seqadmin Started by seqadmin, 03-03-2025, 01:15 PM	0 responses 154 views 0 likes	Last Post by seqadmin 03-03-2025, 01:15 PM
Highlights from AGBT 2025 – Part II by seqadmin Started by seqadmin, 02-28-2025, 12:58 PM	0 responses 238 views 0 likes	Last Post by seqadmin 02-28-2025, 12:58 PM
Highlights from AGBT 2025 – Part I by seqadmin Started by seqadmin, 02-24-2025, 02:48 PM	0 responses 607 views 0 likes	Last Post by seqadmin 02-24-2025, 02:48 PM
Selecting the Right AI Model for Bioinformatics Research by seqadmin Started by seqadmin, 02-21-2025, 02:46 PM	0 responses 263 views 0 likes	Last Post by seqadmin 02-21-2025, 02:46 PM

Seqanswers Leaderboard Ad

Announcement

Optimise TopHat2 speed and results

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News