SEQanswers

Go Back   SEQanswers > Bioinformatics > Bioinformatics



Similar Threads
Thread Thread Starter Forum Replies Last Post
Tophat - aligning to known gene annotations whuzzy RNA Sequencing 0 02-09-2012 01:04 AM
Gene list for calculating coverage nseh Bioinformatics 1 05-22-2011 07:31 AM
tophat -G gene model annotations GTF format? silin284 Bioinformatics 15 04-21-2011 07:26 AM
SpliceMap Gene annotations file for hg19 trickytank Bioinformatics 0 01-18-2011 05:44 PM
how to select gene model from different gene predictions zwzhu Bioinformatics 0 01-13-2011 06:47 AM

Reply
 
Thread Tools
Old 10-12-2010, 02:45 PM   #1
fabrice
Member
 
Location: paris

Join Date: Oct 2009
Posts: 86
Default tophat with a list of gene model annotations.

I download the GTF (gene model annotations) file from:

ftp://ftp.ensembl.org/pub/current/gt...Ch37.59.gtf.gz

I run tophat with the option -G, the Bowtie index is H. sapiens, NCBI v37. Download from ftp://ftp.cbcb.umd.edu/pub/data/bowt...7_asm.ebwt.zip

Because the chromosome names in the gene model annotations must match the names in the Bowtie index. So I use this sed script to convert the chromosome name in GTF to match the names in the Bowtie index. Is it right? Thanks.


Code:
s/^1\t/gi|224589800|ref|NC_000001.10|\t/g;
s/^10\t/gi|224589801|ref|NC_000010.10|\t/g;
s/^11\t/gi|224589802|ref|NC_000011.9|\t/g;
s/^12\t/gi|224589803|ref|NC_000012.11|\t/g;
s/^13\t/gi|224589804|ref|NC_000013.10|\t/g;
s/^14\t/gi|224589805|ref|NC_000014.8|\t/g;
s/^15\t/gi|224589806|ref|NC_000015.9|\t/g;
s/^16\t/gi|224589807|ref|NC_000016.9|\t/g;
s/^17\t/gi|224589808|ref|NC_000017.10|\t/g;
s/^18\t/gi|224589809|ref|NC_000018.9|\t/g;
s/^19\t/gi|224589810|ref|NC_000019.9|\t/g;
s/^20\t/gi|224589812|ref|NC_000020.10|\t/g;
s/^21\t/gi|224589813|ref|NC_000021.8|\t/g;
s/^22\t/gi|224589814|ref|NC_000022.10|\t/g;
s/^2\t/gi|224589811|ref|NC_000002.11|\t/g;
s/^3\t/gi|224589815|ref|NC_000003.11|\t/g;
s/^4\t/gi|224589816|ref|NC_000004.11|\t/g;
s/^5\t/gi|224589817|ref|NC_000005.9|\t/g;
s/^6\t/gi|224589818|ref|NC_000006.11|\t/g;
s/^7\t/gi|224589819|ref|NC_000007.13|\t/g;
s/^8\t/gi|224589820|ref|NC_000008.10|\t/g;
s/^9\t/gi|224589821|ref|NC_000009.11|\t/g;
s/^X\t/gi|224589822|ref|NC_000023.10|\t/g;
s/^Y\t/gi|224589823|ref|NC_000024.9|\t/g;
s/^MT\t/gi|17981852|ref|NC_001807.4|\t/g;
fabrice is offline   Reply With Quote
Old 10-12-2010, 02:46 PM   #2
fabrice
Member
 
Location: paris

Join Date: Oct 2009
Posts: 86
Default

I do not use --no-novel-juncs in my analysis.
fabrice is offline   Reply With Quote
Old 10-13-2010, 07:44 AM   #3
fabrice
Member
 
Location: paris

Join Date: Oct 2009
Posts: 86
Default

bowtie-inspect --names h_sapiens_37_asm

gi|224589800|ref|NC_000001.10| Homo sapiens chromosome 1, GRCh37 primary reference assembly
gi|224589811|ref|NC_000002.11| Homo sapiens chromosome 2, GRCh37 primary reference assembly
gi|224589815|ref|NC_000003.11| Homo sapiens chromosome 3, GRCh37 primary reference assembly
gi|224589816|ref|NC_000004.11| Homo sapiens chromosome 4, GRCh37 primary reference assembly
gi|224589817|ref|NC_000005.9| Homo sapiens chromosome 5, GRCh37 primary reference assembly
gi|224589818|ref|NC_000006.11| Homo sapiens chromosome 6, GRCh37 primary reference assembly
gi|224589819|ref|NC_000007.13| Homo sapiens chromosome 7, GRCh37 primary reference assembly
gi|224589820|ref|NC_000008.10| Homo sapiens chromosome 8, GRCh37 primary reference assembly
gi|224589821|ref|NC_000009.11| Homo sapiens chromosome 9, GRCh37 primary reference assembly
gi|224589801|ref|NC_000010.10| Homo sapiens chromosome 10, GRCh37 primary reference assembly
gi|224589802|ref|NC_000011.9| Homo sapiens chromosome 11, GRCh37 primary reference assembly
gi|224589803|ref|NC_000012.11| Homo sapiens chromosome 12, GRCh37 primary reference assembly
gi|224589804|ref|NC_000013.10| Homo sapiens chromosome 13, GRCh37 primary reference assembly
gi|224589805|ref|NC_000014.8| Homo sapiens chromosome 14, GRCh37 primary reference assembly
gi|224589806|ref|NC_000015.9| Homo sapiens chromosome 15, GRCh37 primary reference assembly
gi|224589807|ref|NC_000016.9| Homo sapiens chromosome 16, GRCh37 primary reference assembly
gi|224589808|ref|NC_000017.10| Homo sapiens chromosome 17, GRCh37 primary reference assembly
gi|224589809|ref|NC_000018.9| Homo sapiens chromosome 18, GRCh37 primary reference assembly
gi|224589810|ref|NC_000019.9| Homo sapiens chromosome 19, GRCh37 primary reference assembly
gi|224589812|ref|NC_000020.10| Homo sapiens chromosome 20, GRCh37 primary reference assembly
gi|224589813|ref|NC_000021.8| Homo sapiens chromosome 21, GRCh37 primary reference assembly
gi|224589814|ref|NC_000022.10| Homo sapiens chromosome 22, GRCh37 primary reference assembly
gi|224589822|ref|NC_000023.10| Homo sapiens chromosome X, GRCh37 primary reference assembly
gi|224589823|ref|NC_000024.9| Homo sapiens chromosome Y, GRCh37 primary reference assembly
gi|17981852|ref|NC_001807.4| Homo sapiens mitochondrion, complete genome
fabrice is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 06:48 AM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO