![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Creating a mRNA GTF file from fasta for HTSeq ? | CEPHSeq | Bioinformatics | 11 | 06-18-2015 10:44 AM |
creating GFF file from refernce fasta | dena.dinesh | RNA Sequencing | 1 | 10-30-2014 05:29 PM |
Creating a gtf file for the database I mapped against | lanner | Bioinformatics | 1 | 05-06-2014 01:18 AM |
How do I go from a fasta and a chromosome to gtf/gff file? | Brown_lineage | Bioinformatics | 8 | 12-07-2012 07:21 AM |
BWA not creating rBWT file | ramiro2k | Bioinformatics | 2 | 03-23-2012 03:27 AM |
![]() |
|
Thread Tools |
![]() |
#1 |
Junior Member
Location: United Kingdom Join Date: Mar 2015
Posts: 5
|
![]()
I downloaded the Mus musculus mm10 pre-built tophat indices from the Tophat website (http://ccb.jhu.edu/software/tophat/igenomes.shtml). This package comes with a .gtf file that specifies junction boundaries and start/stop codon positions. It also contains a .fa file for the mouse genome.
I would like to produce a .fa file that contains all of the mouse RNA transcript sequences from the above two files. Is there any software that does this? I have read of de novo transcriptome assembly software like Trinity, but I do not think it would be ideal for what I would like to do here. I merely need some kind of script/tool to generate the annotated mRNA (and ideally, rRNA, mtRNA, and ncRNA) sequences that will match the annotation in the aforementioned two files (.gtf and .fa from the tophat pre-built index). Thank you kindly for your time and help. Please let me know if there is any further information you would like to help evaluate my query. If I do find an appropriate tool for this problem, I plan on writing a script to extract and splice necessary sequences from the genomic .fa file, and will upload this script for others to use (but would rather avoid re-inventing the wheel if possible!). |
![]() |
![]() |
![]() |
#2 |
Senior Member
Location: NikoNarita.jp Join Date: Jul 2013
Posts: 142
|
![]()
Hi
Any solution of your problem ? I am confused with too |
![]() |
![]() |
![]() |
#3 |
Rick Westerman
Location: Purdue University, Indiana, USA Join Date: Jun 2008
Posts: 1,104
|
![]()
Whenever I heard the word "gtf" or "vcf" then my mind goes to BEDtools. And indeed one of the programs is "fastaFromBed" which "Creates FASTA sequences based on intervals in a BED/GFF/VCF file." GTF is just a GFF in disguise.
|
![]() |
![]() |
![]() |
#4 |
Senior Member
Location: NikoNarita.jp Join Date: Jul 2013
Posts: 142
|
![]()
Thanks westerman, however, I like to ask you something different but relative.
After running Denovo Trinity reports Trinity.fasta and then I did blast got blastx.outfmt6. I also have TransDecoder output i.e. Trinity.fasta.transdecoder.cds , Trinity.fasta.transdecoder.gff3 , Trinity.fasta.transdecoder.mRNA , Trinity.fasta.transdecoder.pep I am confused how to add real gene names in to Trinity.fasta So that Trinity numbered genes (TR1...n) will be replaced by real gene names which are available in Trinity.fasta.transdecoder.gff3 or blastx.outfmt6 ? Any idea ? |
![]() |
![]() |
![]() |
#5 | |
Rick Westerman
Location: Purdue University, Indiana, USA Join Date: Jun 2008
Posts: 1,104
|
![]() Quote:
|
|
![]() |
![]() |
![]() |
Thread Tools | |
|
|