Greetings. I'd like to do a de novo assembly using Velvet 1.2.10 from reads of two different Illumina , one with 50 basepair reads and a 115bp insert, one with 250bp reads and 550bp insert. For velveth, how what insert size do I give it? An average? Pick the larger one? Thanks!
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
you can use both.
the 2 sets of reads will be flagged as -shortPaired and -shortPaired2, respectively, when you run velveth, and the insert lengths can be
specified using the parameters -ins_length, -ins_length_sd, -ins_length2 and
-ins_length2_sd when you run velvetg.
-
From the velvet manual:
5.6 What’s long and what’s short?
Velvet was pretty much designed with micro-reads (e.g. Illumina) as short and
short to long reads (e.g. 454 and capillary) as long. Reference sequences can
also be thrown in as long.
That being said, there is no necessary distinction between the types of reads.
The only constraint is that a short read be shorter than 32kb. The real difference
is the amount of data Velvet keeps on each read. Short reads are presumably
too short to resolve many repeats, so only a minimal amount of information is
kept. On the contrary, long reads are tracked in detail through the graph.
This means that whatever you call your reads, you should be able to obtain
the same initial assembly. The differences will appear as you are trying to resolve
repeats, as long reads can be followed through the graph. On the other hand,
long reads cost more memory. It is therefore perfectly fine to store Sanger reads
as “short” if necessary.
Comment
Latest Articles
Collapse
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
-
by seqadmin
Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...-
Channel: Articles
03-22-2024, 06:39 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
22 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
24 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
||
Started by seqadmin, 04-10-2024, 09:21 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 09:21 AM
|
||
Started by seqadmin, 04-04-2024, 09:00 AM
|
0 responses
52 views
0 likes
|
Last Post
by seqadmin
04-04-2024, 09:00 AM
|
Comment