SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
intersect (actually: filter) a gtf file with coordinates from a bed-file dietmar13 Bioinformatics 6 07-13-2017 05:20 AM
index file of hg19 for bwa carolW Bioinformatics 3 04-22-2013 03:35 AM
dbsnp rod file for hg19 arkal Bioinformatics 1 08-09-2012 02:29 PM
problem indexing a bam file kjaja Bioinformatics 1 05-03-2012 11:51 AM
Error indexing BAM file using samtools veena Bioinformatics 9 03-04-2010 03:52 AM

Reply
 
Thread Tools
Old 07-11-2013, 06:01 AM   #1
rozitaa
Member
 
Location: Sweden

Join Date: Jun 2013
Posts: 51
Default differences between gtf file and indexing file (hg19)

Hi,

Sorry for asking so simple questions but I am new in bioinformatics. I have some mouse RNA-seq data and I would like to align them by TopHat. I know a bit about gtf file (representing position of genes) and indexing or annotating file (e.g. hg19 or mm10) (representing function of genes). I was wondering if I am right! and if there would more information about them!
I was also wondering about library type of reads (stranded or unstranded)! and if they both work for single-end or paired-end sequences.

Thanks
rozitaa is offline   Reply With Quote
Old 07-11-2013, 06:31 AM   #2
dpryan
Devon Ryan
 
Location: Freiburg, Germany

Join Date: Jul 2011
Posts: 3,466
Default

Well, a GTF file would be an annotation of the genome, describing where features are and their structure. Functions could be added to an annotation file, but are often in a separate file, which wouldn't really be an annotation since they often lack genomic locations (using gene or transcript names instead, for example). Indexing is a generic term that can apply to a lot of things. For example, genome annotations are indexed by genomic coordinate. A file describing gene ontology might be indexed by gene or ontology category. Alternatively, genomes themselves can be indexed to make alignment faster. That term doesn't have any particular definition aside from that used in English.

Stranded-ness of library preparation is separate from whether the library is subsequently sequenced with single or paired-ends.
dpryan is offline   Reply With Quote
Reply

Tags
annotation, gtf, top hat

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 06:45 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2017, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO