Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • chanderbio
    Junior Member
    • Jul 2009
    • 3

    Assembly of short reads in Metagenomic studies

    Are there any Metagenomic studies which used an assembler for generating contigs/longer reads from reads produced by Next Generation Sequencers?
  • rglover
    rg
    • Dec 2008
    • 51

    #2
    We do this routinely on our metagenomics datasets - we're using the CLCbio de novo assembler although we're sequencing with a 454 rather than a shorter read technology. Paper here if you want some more details.

    Comment

    • cliffbeall
      Senior Member
      • Jan 2010
      • 144

      #3
      Sorry to be late in answering, but there is a recent study that did assembly on the gut metagenome with Illumina:

      Qin et al, Nature 464, 59-65 A human gut microbial gene catalogue established by metagenomic sequencing.

      The scale of the sequencing they did is pretty enormous, 576.7 gigabases, not sure if the methodology is applicable to smaller scales.

      Comment

      • Suzanne
        Registered Vendor
        • Nov 2007
        • 10

        #4
        PANGEA for next gen data assembly

        Hi- you might want to check out this paper too:

        PANGEA: pipeline for analysis of next generation amplicons

        The ISME Journal 4, 852-861 (July 2010) | doi:10.1038/ismej.2010.16

        Adriana Giongo, David B Crabb, Austin G Davis-Richardson, Diane Chauliac, Jennifer M Mobberley, Kelsey A Gano, Nabanita Mukherjee, George Casella, Luiz FW Roesch, Brandon Walts, Alberto Riva, Gary King and Eric W Triplett

        MO BIO Laboratories, Inc.
        Saving You Time for Life

        Comment

        • greigite
          Senior Member
          • Mar 2009
          • 145

          #5
          PANGEA is for amplicon sequencing only though- a somewhat different problem not requiring assembly. There are various of these amplicon analysis pipelines and they all have variants of the following steps: quality filtering, read trimming, clustering, BLAST, taxonomic assignment, comparison of relative abundances across data sets. They don't involve assembly, though I guess if you were doing Illumina PE amplicon sequencing you'd have to join the two overlapping ends to get a single sequence.

          Originally posted by Suzanne View Post
          Hi- you might want to check out this paper too:

          PANGEA: pipeline for analysis of next generation amplicons

          The ISME Journal 4, 852-861 (July 2010) | doi:10.1038/ismej.2010.16

          Adriana Giongo, David B Crabb, Austin G Davis-Richardson, Diane Chauliac, Jennifer M Mobberley, Kelsey A Gano, Nabanita Mukherjee, George Casella, Luiz FW Roesch, Brandon Walts, Alberto Riva, Gary King and Eric W Triplett

          http://www.nature.com/ismej/journal/...ej201016a.html

          Comment

          • Thomieh
            Junior Member
            • May 2010
            • 2

            #6
            Check these:
            - Increasing the coverage of a metapopulation consensus genome by iterative read mapping and assembly. Dutilh BE, Huynen MA, Strous M., Bioinformatics 2009 vol. 25 (21) pp. 2878-81
            - The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology. Schlüter et al.,
            Journal of biotechnology, 2008 vol. 136 (1-2) pp. 77-90.

            Metagenomes can be hard to assemble, but this really depends on the diversity found in the ecosystem you are studying. In studies discussing simple ecosystems where only a few species are found it is possible to do this. In more complex environments it really depends on how deep your sequencing is. Most metagenomes of complex environments don't achieve a coverage of more than 1 and then assembly is not easy.

            And when you do an assembly, the question still remains, how can you test that what you have assembled is really valid. For that you need to be able to amplify the fragment using PCR and resequence it.

            Comment

            • saul
              Junior Member
              • Aug 2010
              • 1

              #7
              I've recently been playing with the MetaHit data from the paper cited above, assembling the data using the CLC Bio de novo assembler -- see http://www.clcdenovo.com/. Using gross statistics (total assembled bp in contigs > 200bp, average contig size > 200bp, N50, etc), my assemblies were better than those reported in the paper using Soap de novo, and required a fraction of the time/cpu/memory of SOAP. In fact, I could run the assemblies in 8Gb of memory. Give it a try!

              Comment

              • barabara
                Junior Member
                • May 2011
                • 1

                #8
                Originally posted by saul View Post
                I've recently been playing with the MetaHit data from the paper cited above, assembling the data using the CLC Bio de novo assembler -- see http://www.clcdenovo.com/. Using gross statistics (total assembled bp in contigs > 200bp, average contig size > 200bp, N50, etc), my assemblies were better than those reported in the paper using Soap de novo, and required a fraction of the time/cpu/memory of SOAP. In fact, I could run the assemblies in 8Gb of memory. Give it a try!
                Hi Saul,
                can you send me more details how you did your analysis of the MetaHit data with CLCBio? I would like to try to analyze my metagenomic samples and I am beginner who would like to learn

                Thank you!

                Comment

                • raw937
                  Member
                  • Aug 2010
                  • 18

                  #9
                  META-Velvet!!!

                  Check it out or Abyss is also good!

                  Comment

                  • ucpete
                    Member
                    • Dec 2008
                    • 35

                    #10
                    Price

                    This paper hasn't been published yet, but this is hands down the best solution for metagenomic assembly I've used:



                    It's still being updated every couple weeks...

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM
                    • SEQadmin2
                      Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                      by SEQadmin2


                      With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                      Introduction

                      Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                      05-22-2026, 06:42 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    24 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    40 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-05-2026, 10:09 AM
                    0 responses
                    47 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-04-2026, 08:59 AM
                    0 responses
                    49 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...