Tophat and Repeated Sequences

Airwalker810

Junior Member

Join Date: Oct 2010

Posts: 6
- Share
- Tweet
#1

Tophat and Repeated Sequences

03-11-2011, 02:42 PM

My PI and I have been wondering what exactly Tophat does with repeated sequences from a fastq file. Does it merely throw them away, or does it take the read and 'place' it at each location that it appears in the genome? We were going to use the CASAVA pipeline, and I do know that generates a file of repeated sequences that can later be indexed, but Tophat better serves our purposes.

Thanks for the Help.
Tags: repeated sequences, tophat
Jon_Keats

Senior Member

Join Date: Mar 2010

Posts: 279
- Share
- Tweet
#2

03-11-2011, 03:33 PM

Tophat prints the read at each location if it maps to multiple locations so you can end up with 120% mapped. Seems to be a big issue with short (50bp) single end reads, minor when using short paired-end reads so I suspect longer reads in a paired-end format would work better.
Comment

Previous template Next

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 58 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 45 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad