Seqanswers Leaderboard Ad

**ragowthaman** · 12-08-2009, 01:15 PM

Hi All i found this on Velvet manual "
The N’s in the sequence correspond to gaps between scaﬀolded contigs. The
number of N’s corresponds to the estimated length of the gap. For reasons of
compatibility with the archives, any gap shorter than 10bp is represented by a
sequence of 10 N’s. "

That answers my question, only to create more!
How does velvet estimates a gap in the assembly. Unless it uses a reference genome, how it calculates the gap between two reads? or for that matter how it finds two reads/scafolds are close by? Isn't it velvet assembles the scaffolds by de nova?

**Zigster** · 12-08-2009, 02:27 PM

paired-end reads

**nickloman** · 12-09-2009, 01:19 AM

First question, why are you assembling reads from 454 with Velvet? This is not the preferred choice except if you had complementary short-read data. I'd start with an assembly using Roche's Newbler software.

**kwebb** · 06-14-2010, 07:14 AM

I have just encountered Ns in my Velvet assembly as well. My reads are 36bp generated on an Illumina GA. all single-end reads and no reference sequence was used.

Can anyone answer ragowthaman's question regarding how velvet estimates a gap in the assembly in my case?

**nickloman** · 06-14-2010, 07:20 AM

Illumina sequencing can generates Ns. Just grep for them in your FASTQ files. And check you haven't accidentally used paired-end mode for your assembly.

**nickloman** · 06-14-2010, 07:21 AM

In answer to your question - it can't estimate gap lengths if you don't have paired-end data.

**kwebb** · 06-14-2010, 09:49 AM

Thanks for the quick response. I agree with you, but still can't figure out why I'm getting Ns in my assembled contig.

Prior to velvet assembly, I removed all short reads which contained one or more Ns. So the Ns are being inserted into my contigs during the assembly process. (just ran grep to double-check - no Ns)

And as far as accidently assembling as paired-end reads - I used the -short option which, according to the manual, is also the default.

any other thoughts?

Thanks!

**wgarzon** · 03-13-2012, 04:02 PM

Hi Friends,

How I can compute the hash_length??. I don't understand the formule kmC = C*(L-K+1)/L. I unknown this values.

This is my problem: I have two files paired in format fastq which contains DNA data in differents shorts HTS (high throughput sequencing). I need to assembly this files in only one sequence. I would like to compute the hash_length appropiate.

Thanks

**cascoamarillo** · 04-25-2012, 08:28 AM

Originally posted by wgarzon View Post

Hi Friends,

How I can compute the hash_length??. I don't understand the formule kmC = C*(L-K+1)/L. I unknown this values.

This is my problem: I have two files paired in format fastq which contains DNA data in differents shorts HTS (high throughput sequencing). I need to assembly this files in only one sequence. I would like to compute the hash_length appropiate.

Thanks

Hi
You can combine different paired-end reads in velveth; thats is not an issue. About which Kmer is the best, you have to compute it; I mean you've to try different Kmer lengths and check which one has the best output (like N50).
You probably find this info somewhere else:
kmC = C*(L-K+1)/L
C: expected coverage
L: length of reads
K: Kmer value used
L: genome size (in bp)

One option is to ask in velveth to compute different Kmers in the same run
$velveth file_name 27,45,2 -fastq
where it takes Kmers values from 27 to 45 (in 2); but I think this needs an awesome memory cluster requirement.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 39 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 41 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 35 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Runs of Ns in Velvet assembly?

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News