Seqanswers Leaderboard Ad

**GenoMax** · 11-06-2012, 10:27 AM

You can "cat" the fasta formatted nucleotide sequence files to create a common "reference genome" file. This can be used for making the indexes.

**archgen** · 11-06-2012, 01:53 PM

Thanks for the reply.

Just to be clear on your response, "cat" only the .fna files for each chromosome, not any of the other fasta formatted sequence files, i.e. the .ffn with coding region info?

Again, much appreciated. Relieved that it seems like a simple solution.

**GenoMax** · 11-07-2012, 05:19 AM

Originally posted by archgen View Post

Thanks for the reply.

Just to be clear on your response, "cat" only the .fna files for each chromosome, not any of the other fasta formatted sequence files, i.e. the .ffn with coding region info?

Again, much appreciated. Relieved that it seems like a simple solution.

A simple "multi-fasta" formatted file that only has the ">fasta header" followed by the sequence starting on the subsequent line for all sequences.

**westerman** · 11-07-2012, 07:35 AM

Also it is often the case that the repository has a whole genome file already available thus alleviating the need to cat the individual chromosome files.

**TonyBrooks** · 11-07-2012, 08:18 AM

Illumina have helpfully supplied iGenomes archives for some common species.
These contain BWA and Bowtie indices making alignment a walk in the park (even I can do it!) There's no need to deal with FASTA (although that data is also in the archive you download from the Illumina website.

Illumina | ECommerce

https://my.illumina.com/Message/iGenome/

I think some of these files are also available on the Cufflinks page (http://cufflinks.cbcb.umd.edu/igenomes.html) if you don't have an Illumina login. They also contain RNA-Seq annotation, but you can just ignore that for genome assembly - the references are still there.

**archgen** · 11-07-2012, 03:32 PM

Sadly, I'm not working with any model organisms with well-known reference genomes. But it's good to know those sites exist for future projects.

Thanks again for the feedback.

Topics	Statistics	Last Post
Decoding Neurodegeneration with Advanced RNA Sequencing by seqadmin Started by seqadmin, 12-30-2024, 01:35 PM	0 responses 26 views 0 likes	Last Post by seqadmin 12-30-2024, 01:35 PM
Evaluating Genome Sequencing for ECMO Patients in the NICU by seqadmin Started by seqadmin, 12-17-2024, 10:28 AM	0 responses 41 views 0 likes	Last Post by seqadmin 12-17-2024, 10:28 AM
New Genetic Toolkit Refines Studies on Gene Function and Disease by seqadmin Started by seqadmin, 12-13-2024, 08:24 AM	0 responses 55 views 0 likes	Last Post by seqadmin 12-13-2024, 08:24 AM
Study Links Brain Mechanism to Emotional Responses in Animals and Humans by seqadmin Started by seqadmin, 12-12-2024, 07:41 AM	0 responses 41 views 0 likes	Last Post by seqadmin 12-12-2024, 07:41 AM

Seqanswers Leaderboard Ad

Announcement

The one file to rule them all - ref genome

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News