Seqanswers Leaderboard Ad

**dpryan** · 11-20-2012, 08:23 AM

It looks like you're mapping against some subset of the transcriptome. Try mapping against the actual genome with both (I would recommend using a reference GTF file with tophat) and I expect tophat will perform more favourably.

**DunderChief** · 11-20-2012, 11:07 AM

I would be cautious about switching to solexa scores just because you get a higher alignment rate. It seems like you have a problem unrelated to your quality scores and you'll probably end up confusing things further. If you run fastQC, it will automatically determine the version of your quality scores.

I doubt this would make that big of a difference anyway, but are you sure you're using tophat v2. It depends on how you set it up on your system, but typically the command is tophat2, not tophat.

Also, how are you determining the alignment rate? When I first started using tophat, I got very confused by their log files. Calculate the % mapped the same way for both tophat2 and bowtie2 results.

**SHeaph** · 11-20-2012, 01:13 PM

Thanks for yer help

**SHeaph** · 11-20-2012, 01:25 PM

I was using a cDNA and non-coding RNA library to map the reads against. Can I use this as a GTF? if so would any unmapped reads here then be mapped against the actual genome?

Much appreciated.

**dpryan** · 11-21-2012, 02:05 AM

You can't use the multifasta file as an annotation (since it doesn't actually annotate anything), but since it's name suggests that it's from Ensembl, you might just use the normal Ensembl genome sequence and annotation.

You can probably save some time by just downloading the premade indices (and I think GTF files, though I don't recall exactly) from here.

BTW, don't be surprised if mapping things this way leads to slightly lower alignment rates, as the results are going to be both more reliable and easier to analyse downstream (at least for common analyses).

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 18 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 22 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 47 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Tophat2 Alignment Rate

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News