Seqanswers Leaderboard Ad

**diptarka** · 08-03-2013, 10:41 PM

Hi, all
I am new to denovo genome assembly. I have a fastq sequence data which i have to assemble using velvet. I used the velvet optimiser script with different hash length from 27 to 41 and it predicted best to be 37. The output file contigs.fa contains 260 contigs whereas log file predicts 283 nodes, where are the rest gone? Length given in contigs.fa is in k mers? how do i calculate it's actual nucleotide length in bp?. How do i understand whether the assembly is good or bad. FInal stat given after script running:
Final graph has 283 nodes and n50 of 347, max 2336, total 68614, using 19064/50000 reads
Why are the number of used reads so low?

**mastal** · 08-04-2013, 02:05 AM

Contig length, k-mer coverage, and differential expression

I'm pretty sure velvet has a cutoff value for the length of the contigs
listed in the contigs.fa file, although I don't remember off the top of my head what that is. So the missing contigs are probably the very short ones.

The formula for calculating kmer coverage from base coverage is
given in the velvet manual. See

Error: 404 | EMBL-EBI

http://www.ebi.ac.uk/~zerbino/velvet/

As to whether the assembly is good, have a look at this Nature Methods article entitled ''De novo genome assembly: what every biologist should know"

http://www.nature.com/nmeth/journal/v9/n4/full/nmeth.1935.html

**diptarka** · 09-10-2013, 11:30 AM

What is the twin node as specified in velvet? It says reverse of reverse complement k merss. How are contigs actually generated using paired end assembly with velvet? can someone show using an example?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 29 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 25 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Contig length, k-mer coverage, and differential expression

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News