Are there any Metagenomic studies which used an assembler for generating contigs/longer reads from reads produced by Next Generation Sequencers?
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Sorry to be late in answering, but there is a recent study that did assembly on the gut metagenome with Illumina:
Qin et al, Nature 464, 59-65 A human gut microbial gene catalogue established by metagenomic sequencing.
The scale of the sequencing they did is pretty enormous, 576.7 gigabases, not sure if the methodology is applicable to smaller scales.
Comment
-
PANGEA for next gen data assembly
Hi- you might want to check out this paper too:
PANGEA: pipeline for analysis of next generation amplicons
The ISME Journal 4, 852-861 (July 2010) | doi:10.1038/ismej.2010.16
Adriana Giongo, David B Crabb, Austin G Davis-Richardson, Diane Chauliac, Jennifer M Mobberley, Kelsey A Gano, Nabanita Mukherjee, George Casella, Luiz FW Roesch, Brandon Walts, Alberto Riva, Gary King and Eric W Triplett
Comment
-
PANGEA is for amplicon sequencing only though- a somewhat different problem not requiring assembly. There are various of these amplicon analysis pipelines and they all have variants of the following steps: quality filtering, read trimming, clustering, BLAST, taxonomic assignment, comparison of relative abundances across data sets. They don't involve assembly, though I guess if you were doing Illumina PE amplicon sequencing you'd have to join the two overlapping ends to get a single sequence.
Originally posted by Suzanne View PostHi- you might want to check out this paper too:
PANGEA: pipeline for analysis of next generation amplicons
The ISME Journal 4, 852-861 (July 2010) | doi:10.1038/ismej.2010.16
Adriana Giongo, David B Crabb, Austin G Davis-Richardson, Diane Chauliac, Jennifer M Mobberley, Kelsey A Gano, Nabanita Mukherjee, George Casella, Luiz FW Roesch, Brandon Walts, Alberto Riva, Gary King and Eric W Triplett
http://www.nature.com/ismej/journal/...ej201016a.html
Comment
-
Check these:
- Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly. Dutilh BE, Huynen MA, Strous M., Bioinformatics 2009 vol. 25 (21) pp. 2878-81
- The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology. Schlüter et al.,
Journal of biotechnology, 2008 vol. 136 (1-2) pp. 77-90.
Metagenomes can be hard to assemble, but this really depends on the diversity found in the ecosystem you are studying. In studies discussing simple ecosystems where only a few species are found it is possible to do this. In more complex environments it really depends on how deep your sequencing is. Most metagenomes of complex environments don't achieve a coverage of more than 1 and then assembly is not easy.
And when you do an assembly, the question still remains, how can you test that what you have assembled is really valid. For that you need to be able to amplify the fragment using PCR and resequence it.
Comment
-
I've recently been playing with the MetaHit data from the paper cited above, assembling the data using the CLC Bio de novo assembler -- see http://www.clcdenovo.com/. Using gross statistics (total assembled bp in contigs > 200bp, average contig size > 200bp, N50, etc), my assemblies were better than those reported in the paper using Soap de novo, and required a fraction of the time/cpu/memory of SOAP. In fact, I could run the assemblies in 8Gb of memory. Give it a try!
Comment
-
Originally posted by saul View PostI've recently been playing with the MetaHit data from the paper cited above, assembling the data using the CLC Bio de novo assembler -- see http://www.clcdenovo.com/. Using gross statistics (total assembled bp in contigs > 200bp, average contig size > 200bp, N50, etc), my assemblies were better than those reported in the paper using Soap de novo, and required a fraction of the time/cpu/memory of SOAP. In fact, I could run the assemblies in 8Gb of memory. Give it a try!
can you send me more details how you did your analysis of the MetaHit data with CLCBio? I would like to try to analyze my metagenomic samples and I am beginner who would like to learn
Thank you!
Comment
-
Price
This paper hasn't been published yet, but this is hands down the best solution for metagenomic assembly I've used:
It's still being updated every couple weeks...
Comment
Latest Articles
Collapse
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
-
by seqadmin
Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...-
Channel: Articles
04-04-2024, 04:25 PM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Yesterday, 08:47 AM
|
0 responses
16 views
0 likes
|
Last Post
by seqadmin
Yesterday, 08:47 AM
|
||
Started by seqadmin, 04-11-2024, 12:08 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-11-2024, 12:08 PM
|
||
Started by seqadmin, 04-10-2024, 10:19 PM
|
0 responses
60 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 10:19 PM
|
||
Started by seqadmin, 04-10-2024, 09:21 AM
|
0 responses
54 views
0 likes
|
Last Post
by seqadmin
04-10-2024, 09:21 AM
|
Comment