Seqanswers Leaderboard Ad

**tonybolger** · 07-04-2011, 10:29 PM

I'm no velvet expert, but here goes

Originally posted by salmonella View Post

1. Apart changing k-mer length, what other parameters should be manipulated to optimize assembly?

The coverage parameters - expected coverage and coverage cutoff.

Originally posted by salmonella View Post

3. Is there free software that can take any of the output files from Velvet and calculate the N50 value, so that as we do our iterations, we can figure out what works better? As I said, I am not a programmer so I am looking for something that is plug and play.

Curtain, a related project to curtain, comes with a fairly simple program, statsContigAll which gives a reasonable set of stats, min, max, N10 .. N90 etc.

Originally posted by salmonella View Post

4. Does anyone have advice on the contig sizes that are 'normally expected' for this kind of assembly. In other words, if I get about 1700 contigs that are longer than 100 bp with a few that are 50-70 kb, is this considered good, or do I have a long way to go for optimization?

Depends on the target genome - if you have a relatively simple genome (and i guess you have), you should get only a few large contigs. You should also get a total size in the right range (unless you have a lot of repeats).

That said, you could really use longer paired reads - 40 bases is on the short side, and paired data is sooo much better for de-novo.

BTW, i wouldn't rule out the need to filter / trim - most assemblers ignore quality scores entirely, so why not help them by removing the crud, even if it is a relatively small percentage of the data. Removing adapters is also strongly recommended.

**chkuo** · 07-17-2011, 05:43 AM

As has been suggested, the coverage parameters are important, particularly when you want to separate the chromosome and plasmids (plasmids tend to have much higher coverage).

n50, max contig size, and some other information about the assembly should be readily available in the velvet log file.

Hope this helps.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

advice from more experienced users

Comment

Comment

Latest Articles

ad_right_rmr

News