Seqanswers Leaderboard Ad

**JackieBadger** · 08-08-2012, 04:52 AM

Try using MIRA (http://www.chevreux.org/projects_mira.html). I got much better de novo assemblies using it than Velvet.

**SES** · 08-08-2012, 06:30 AM

Is there a reason you are not using Roche's gsAssembler (a.k.a. Newbler) for this project? I've tried a number of other de novo assemblers and found Roche's own software produces better results with 454 data. Depending on the size of your data set, you may also find MIRA useful.

**JackieBadger** · 08-08-2012, 09:59 AM

Originally posted by SES View Post

Is there a reason you are not using Roche's gsAssembler (a.k.a. Newbler) for this project? I've tried a number of other de novo assemblers and found Roche's own software produces better results with 454 data. Depending on the size of your data set, you may also find MIRA useful.

The latest release of MIRA handles very large data sets much better than previous releases.

**SES** · 08-08-2012, 10:05 AM

Originally posted by JackieBadger View Post

The latest release of MIRA handles very large data sets much better than previous releases.

That is good to know. What is "very large" in this case (I haven't consulted the MIRA docs in a while)?

**DFJ111** · 08-08-2012, 01:09 PM

Thanks.. yes, I'll try MIRA. The main reason I used velvet is because 1) I'm familiar with it and 2) I'm familiar with how to preprocess fastq files based on quality/homopolymer runs, but not so much with .sff files. Which, having read the MIRA manual, I realise is not a problem since it takes fasta and fasta.qual files fine, doh!. There was a Newbler assembly run already, I think it was rubbish mainly because it treated all 4 runs as equally good, which they were not. But I may also throw that into the mix.

Generally speaking, I was wondering if any assemblers handled very heterogeneous coverage better? Anyway thanks for responses, I've got plenty to work on!

**colindaven** · 08-09-2012, 02:30 AM

There are assemblers designed for very heterogeneous coverage such as MetaVelvet but I don't think they'd be useful in your case.

**DFJ111** · 08-09-2012, 01:58 PM

FYI In case anyone else encounters this, I have used MIRA3, which seems to have produced some good results. It has specific switches for heterogeneous coverage: using "est" in the -job switch (i.e. telling MIRA3 to assemble as if it's an EST sequencing project) or alternatively setting the [uniform_read_distribution(urd)=on|yes|1, off|no|0] within the -AS (-ASSEMBLY) parameter section as "no". By my reading of the manual, either switch will tell MIRA3 to stop assuming that the coverage is homogeneous. I simply used the -est switch, even though sequences from EST sequencing are likely to be even more heterogeneous in coverage than what I have (which is exome sequencing). Seemed to work OK.

**JackieBadger** · 08-10-2012, 05:16 AM

There are multiple papers on the subject. Here is just one:http://www.biomedcentral.com/1471-2164/11/571

**DFJ111** · 08-12-2012, 12:57 PM

Thanks. That's a paper on transcriptome data though. Exome sequencing will be far less heterogeneous, although still heterogeneous enough that just throwing it into a normal genome assembly pipeline would be inadvisable. I agree there are obvious similarities. This special edition: http://genomebiology.com/content/12/9 is probably more relevant.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 37 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

De novo assembly of 454 exome sequencing

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News