Unconfigured Ad

**tonybolger** · 05-05-2011, 01:12 AM

Originally posted by ndeshpan View Post

I have 3 paired-end read data-sets from Illumina Solexa (102 bp ;30917380 reads per file)
With a RAM memory of around 200+ GB can anyone suggest if MIRA has the capacity to assemble these reads? Does it use multiple processors?

Our experience trying to assemble 50GB of 454 data with MIRA suggests you won't want to wait that long.

If it's just solexa data, i'd suggest keeping to the various de-bruijn assemblers - not sure i can really 'recommend' one though, i've had bad experiences with all of them that i tried.

BTW, SOAP doesn't appear use paired data until the scaffolding stage, so it's not necessarily as bad as it looks.

**sphil** · 05-05-2011, 04:04 AM

Hey,

first of all you are right. Due to massive amount of flags mira is using its hard to come up with. Here, http://mira-assembler.sourceforge.ne...e_library_size, you can find a very nice description on how to assemble paired end reads. Nevertheless i would recommend to subscribe at mira mailing list. The are really fast and even Bastien tries to help as much as he can (if the manual doesn't work out)...
Only havin' a slight look i would say you need to concate your files into one.
Then just start assembly with:
mira
--project=XXXXX
--job=denovo,genome,accurate,solexa
SOLEXA_SETTINGS -GE:tismin=250:tismax=750
>&log_assembly.txt

accurate means that you will have to wait until sun burns out so i would change it to draft for a first shot.

SOLEXA_SETTINGS -GE:tismin=250:tismax=750 , means that your insert size is 500bp.... everything else might be quite ok with defualt solexa settings

best,

philip

**ndeshpan** · 05-23-2011, 05:59 PM

thanks guys

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 17 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 38 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 43 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 49 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Mira assembler: Medium sized genomes;How to use 2 separate files for paired-end reads

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News