To start the assembly process we have to set ther kmer size.I'm not clear about the relationship between kmer size,read length and total no.of reads so that i can decide proper kmer size.
Unconfigured Ad
Collapse
X
-
Hi vaibhavvsk,
What is your read size? Is this paired end or single end? What kind of data is this (Illumina, 454, pacbio etc.)?
You may find this blog post helpful:
-
-
A lot of this must be worked out empirically, though you can use others' experience as a guide. kmer is always going to be shorter than your read length, and probably quite a bit shorter. For a lot of Illumina data and many assemblers, small k (21 or 31, for example) work well as a first cut, and then it is worth trying larger values
Also note that most assemblers require odd k values (so that no kmer is palindromic), so if you are scanning a range don't bother with evens.
Also note that some assemblers (e.g. Ray) have a default maximum on the kmer size which isn't very large; you need to set a parameter during the build to increase this. SOAPdenovo comes in various flavors compiled for different maxima; the memory requirements for all assemblies are greater if you use one of the binaries that can run a higher k.
Comment
-
Latest Articles
Collapse
-
by SEQadmin2
Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.
The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
...-
Channel: Articles
06-02-2026, 10:05 AM -
-
by SEQadmin2
With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.
Introduction
Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...-
Channel: Articles
05-22-2026, 06:42 AM -
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism
by SEQadmin2
Started by SEQadmin2, 06-09-2026, 11:58 AM
|
0 responses
15 views
0 reactions
|
Last Post
by SEQadmin2
06-09-2026, 11:58 AM
|
||
|
Started by SEQadmin2, 06-05-2026, 10:09 AM
|
0 responses
26 views
0 reactions
|
Last Post
by SEQadmin2
06-05-2026, 10:09 AM
|
||
|
Started by SEQadmin2, 06-04-2026, 08:59 AM
|
0 responses
37 views
0 reactions
|
Last Post
by SEQadmin2
06-04-2026, 08:59 AM
|
||
|
Started by SEQadmin2, 06-02-2026, 12:03 PM
|
0 responses
61 views
0 reactions
|
Last Post
by SEQadmin2
06-02-2026, 12:03 PM
|
Comment