TopHat --prefilter-multihits parameter

id0

Senior Member

Join Date: Sep 2012

Posts: 130
- Share
- Tweet
#1

TopHat --prefilter-multihits parameter

06-30-2014, 12:39 PM

I am trying to understand TopHat's --prefilter-multihits parameter. According to the documentation:

When mapping reads on the transcriptome, some repetitive or low complexity reads that would be discarded in the context of the genome may appear to align to the transcript sequences and thus may end up reported as mapped to those genes only. This option directs TopHat to first align the reads to the whole genome in order to determine and exclude such multi-mapped reads (according to the value of the -g/--max-multihits option).

I ran TopHat 1.4.1 (last version before 2) and 2.0.9 with just --GTF parameter on the same sequences. TopHat 2.0.9 mapped more reads, but both versions ended up with about 20% of bases as intronic or intergenic. When I add --prefilter-multihits, TopHat 1.4.1 produces very similar results (~1% less mapped reads), which seems very reasonable to me. However, with TopHat 2.0.9, I lose over half the reads. Seems like a lot, but maybe it's possible they are all multi-mapped. More importantly, less than 1% of aligned reads are now intergenic or intronic.

Two questions:
1) Why such a huge difference in behavior between the two versions? As far as I can tell, this option was not altered for version 2.
2) Why does this parameter eliminate essentially all reads outside the transcriptome for TopHat 2.0.9?
Tags: None

Previous template Next

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad