Originally posted by Markiyan
View Post
Unconfigured Ad
Collapse
X
-
Hey, I did try metaSPAdes, less than 1% of total reads assembled. A lot of people tried alternative methods, joined paired-ends and get long reads, but don't assembled reads. Then, use the long merged reads to do BLAST or other annotations.Originally posted by fanli View PostOut of curiosity, why are you joining the read pairs? A lot of the metagenomics software out there now supports paired end reads as input. The metaSPAdes assembler @GenoMax mentioned requires paired end data IIRC.
Comment
-
-
Would something like kraken or CLARK not be helpful? Are you trying to assemble and annotate de novo genomes? Or trying to figure out the microbial composition and functional content? I guess my point is you would discard ~40% of your data in the joining process, which may not be necessary depending on your task of interest.
Comment
-
-
I am not interested in a specific genome in the soil community. Basically, I am just interested in the microbial composition and functional content. There is another way that people usually do. They don't assemble or merge pairs and they just blast using raw data. However, I think blast using ~150bp reads is worse. Some publication shows the intermediate length (merged pair) is better than assembled longer reads or unasembled short reads for the question that I am asking for.Originally posted by fanli View PostWould something like kraken or CLARK not be helpful? Are you trying to assemble and annotate de novo genomes? Or trying to figure out the microbial composition and functional content? I guess my point is you would discard ~40% of your data in the joining process, which may not be necessary depending on your task of interest.
Comment
-
-
You might find this benchmark to be helpful:
My understanding is that you are better off using newer k-mer based approaches as opposed to BLAST. I've had reasonable success with kraken, although the memory requirements are somewhat onerous. You also have to be extremely careful about removing contaminant (aka human) sequences as these tend to get misclassified.
Another option is kallisto (https://github.com/pachterlab/metakallisto) but I have yet to be able to even build a database due to memory constraints.
Comment
-
-
I will try. Can you gimme the Kraken link? Thanks.Originally posted by fanli View PostYou might find this benchmark to be helpful:
My understanding is that you are better off using newer k-mer based approaches as opposed to BLAST. I've had reasonable success with kraken, although the memory requirements are somewhat onerous. You also have to be extremely careful about removing contaminant (aka human) sequences as these tend to get misclassified.
Another option is kallisto (https://github.com/pachterlab/metakallisto) but I have yet to be able to even build a database due to memory constraints.
Comment
-
Latest Articles
Collapse
-
by GATTACATLove this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
-
Channel: Articles
07-01-2026, 11:43 AM -
-
by SEQadmin2
I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.
Here are nine questions we think about, in roughly the order they matter, before...-
Channel: Articles
-
ad_right_rmr
Collapse
News
Collapse
| Topics | Statistics | Last Post | ||
|---|---|---|---|---|
|
Started by SEQadmin2, Yesterday, 11:08 AM
|
0 responses
7 views
0 reactions
|
Last Post
by SEQadmin2
Yesterday, 11:08 AM
|
||
|
Started by SEQadmin2, 06-30-2026, 05:37 AM
|
0 responses
11 views
0 reactions
|
Last Post
by SEQadmin2
06-30-2026, 05:37 AM
|
||
|
Started by SEQadmin2, 06-26-2026, 11:10 AM
|
0 responses
20 views
0 reactions
|
Last Post
by SEQadmin2
06-26-2026, 11:10 AM
|
||
|
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population
by SEQadmin2
Started by SEQadmin2, 06-17-2026, 06:09 AM
|
0 responses
54 views
0 reactions
|
Last Post
by SEQadmin2
06-17-2026, 06:09 AM
|
Comment