View Single Post
Old 11-13-2015, 03:16 PM   #2
blancha
Senior Member
 
Location: Montreal

Join Date: May 2013
Posts: 367
Default

Your reference genome files are FASTA files.

The FASTA file format is simple.
There is one line, starting with the symbol ">" which describes the sequence.
The first word after ">" is the name of the sequence.
Following the sequence description is the sequence itself.

Bowtie and all programs dealing with raw genomic data are written to be able to parse FASTA files.

The FASTA format has nothing to do with the Unix redirection operator, and dates back to 1985.

Last edited by blancha; 11-13-2015 at 06:50 PM.
blancha is offline   Reply With Quote