Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Hybrid assembly with low coverage PacBio libraries

    Hi everyone,

    We are working on sequencing the genomes of a few bacteria and microalgae. We recently had a PacBio run which provided pretty bad results (only 76 MB of data - ~15x coverage of our ~4.5MB genome, average read length ~5,800 bp) which could not be assembled into anything coherent using HGAP. Canu assembler provided what looked like better results (106-110 contigs, N50 ~23kb) but these still covered less than one-half of the expected genome size (sum of contig length was ~2.3 MB, and our genome is estimated at ~4.5MB).

    Given that we are having issues with getting high-quality DNA for PacBio from some of our cultures, we were thinking of making an Illumina library to use these reads for a hybrid assembly. Does anyone have any experience with using such a hybrid method with low-coverage PacBio data? Specifically, are there any recommended tools or parameters suggested for such an assembly?

    One point to keep in mind is that our microalgae cultures are not axenic so we expect a (low) level of contaminating DNA.

    Any recommendations will be very helpful.

    Thanks
    Daniel

  • #2
    Hi, Daniel,

    I have similar question. My illumina reads assembly covered about 50% of the genome. Do you have some updated ideas to share?

    Thanks.

    C

    Comment


    • #3
      Hi Capricy - no updated ideas yet... How do you know your Illumina assembly covers only 50% of the genome? What kind of libraries did you use, and what coverage did you get on the assembled parts?
      Cheers
      Daniel

      Comment


      • #4
        The data were from E coli. New isolates. Library is illumina miseq. Coverage was estimated from the assembly/reference alignment.

        Comment


        • #5
          I don't know much about the genomics of different E. coli strains but isn't that coverage relatively low? If you just mapped the reads to one or more of the known E. coli genomes, what coverage would you get? That might help you see whether you have a very weird genome (e.g. very large genomic islands which do not map to your reference sequence) or whether your assembly isn't very good. With our larger cyanobacterial genomes I think the low coverage is because the PacBio assembly isn't very good.

          BTW we recently published a comparative analysis of what you get when you close a genome compared to fragmented de-novo assemblies for a different heterotrophic bacterium (http://journal.frontiersin.org/artic...016.00248/full). This has some information on the number of sequences and coverage we got that might be worth comparing with your results.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM
          • seqadmin
            Techniques and Challenges in Conservation Genomics
            by seqadmin



            The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

            Avian Conservation
            Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
            03-08-2024, 10:41 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 06:37 PM
          0 responses
          10 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, Yesterday, 06:07 PM
          0 responses
          10 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 03-22-2024, 10:03 AM
          0 responses
          51 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 03-21-2024, 07:32 AM
          0 responses
          67 views
          0 likes
          Last Post seqadmin  
          Working...
          X