Seqanswers Leaderboard Ad

**GenoMax** · 11-06-2012, 10:27 AM

You can "cat" the fasta formatted nucleotide sequence files to create a common "reference genome" file. This can be used for making the indexes.

**archgen** · 11-06-2012, 01:53 PM

Thanks for the reply.

Just to be clear on your response, "cat" only the .fna files for each chromosome, not any of the other fasta formatted sequence files, i.e. the .ffn with coding region info?

Again, much appreciated. Relieved that it seems like a simple solution.

**GenoMax** · 11-07-2012, 05:19 AM

Originally posted by archgen View Post

Thanks for the reply.

Just to be clear on your response, "cat" only the .fna files for each chromosome, not any of the other fasta formatted sequence files, i.e. the .ffn with coding region info?

Again, much appreciated. Relieved that it seems like a simple solution.

A simple "multi-fasta" formatted file that only has the ">fasta header" followed by the sequence starting on the subsequent line for all sequences.

**westerman** · 11-07-2012, 07:35 AM

Also it is often the case that the repository has a whole genome file already available thus alleviating the need to cat the individual chromosome files.

**TonyBrooks** · 11-07-2012, 08:18 AM

Illumina have helpfully supplied iGenomes archives for some common species.
These contain BWA and Bowtie indices making alignment a walk in the park (even I can do it!) There's no need to deal with FASTA (although that data is also in the archive you download from the Illumina website.

Illumina | ECommerce

https://my.illumina.com/Message/iGenome/

I think some of these files are also available on the Cufflinks page (http://cufflinks.cbcb.umd.edu/igenomes.html) if you don't have an Illumina login. They also contain RNA-Seq annotation, but you can just ignore that for genome assembly - the references are still there.

**archgen** · 11-07-2012, 03:32 PM

Sadly, I'm not working with any model organisms with well-known reference genomes. But it's good to know those sites exist for future projects.

Thanks again for the feedback.

Topics	Statistics	Last Post
TIGR Systems Offer a Compact Alternative to CRISPR for Gene Editing by seqadmin Started by seqadmin, 03-03-2025, 01:15 PM	0 responses 151 views 0 likes	Last Post by seqadmin 03-03-2025, 01:15 PM
Highlights from AGBT 2025 – Part II by seqadmin Started by seqadmin, 02-28-2025, 12:58 PM	0 responses 229 views 0 likes	Last Post by seqadmin 02-28-2025, 12:58 PM
Highlights from AGBT 2025 – Part I by seqadmin Started by seqadmin, 02-24-2025, 02:48 PM	0 responses 599 views 0 likes	Last Post by seqadmin 02-24-2025, 02:48 PM
Selecting the Right AI Model for Bioinformatics Research by seqadmin Started by seqadmin, 02-21-2025, 02:46 PM	0 responses 262 views 0 likes	Last Post by seqadmin 02-21-2025, 02:46 PM

Seqanswers Leaderboard Ad

Announcement

The one file to rule them all - ref genome

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News