Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • kende
    Junior Member
    • May 2015
    • 4

    HGAP and PBcR self-correction

    I'd like to know about difference between HGAP and PBcR self-correction.
    Both pipelines perform DeNovo assembly by using PacBio reads only.
    And these have Preassembly step and need to run Quiver after running CeleraAssembler.

    What is the difference of these?

    Many thanks,
    Last edited by kende; 05-28-2015, 02:46 AM.
  • rhall
    Senior Member
    • Aug 2012
    • 324

    #2
    The basic difference is the algorithm used to calculate the overlaps / alignments for the preassembly step. Using the MHAP algorithm in PBcR sould be significantly quicker for large genomes than HGAP. The Celera Assembler OLC (overlap layout consensus) after the preassembled reads are generated is essentially the same in both pipelines. One difference with quiver is that is is automatically ran in HGAP, but with PBcR you will have to run the resequencing and quiver step independently.
    Check out https://github.com/PacificBioscience...-Training/wiki for more info.

    Comment

    • kende
      Junior Member
      • May 2015
      • 4

      #3
      Thank you very much for your reply!
      I understand.

      Comment

      • ymc
        Senior Member
        • Mar 2010
        • 496

        #4
        I only worked with bacterial genomes <10MB using , does it make sense to switch to PBcR-MHAP?

        I read the MHAP paper and noticed the following:

        PBcR-MHAP seems to be 10x faster but lower in sensitivity. True?

        In the MHAP paper, it says it was using -minReadLength 2000 for BLASR. I found that by default, HGAP set -minReadLength to 200. Would it make sense for me to set to 2000?

        HGAP takes bax.h5 files but PBcR takes filtered fastq. Does that mean PBcR is missing some preprocessing steps?

        What are the advantages of using PBcR-BLASR over HGAP?

        Comment

        • rhall
          Senior Member
          • Aug 2012
          • 324

          #5
          I'll try to give my opinion one question at a time.
          I only worked with bacterial genomes <10MB using , does it make sense to switch to PBcR-MHAP?
          Probably not, unless you sequence enough that HGAP is a computational bottleneck, have difficulties running HGAP on your system for technical reasons, or you are struggling to get sufficient coverage, unlikely given one bacteria a cell. The latest version of PBcR-MHAP may give better low coverage assemblies (see release notes).
          PBcR-MHAP seems to be 10x faster but lower in sensitivity. True?
          I believe this was true when the paper was initially published, but the latest release of PBcR-MHAP has a sensitive setting, which I think is comparable to HGAP, but I have not compared the two.
          In the MHAP paper, it says it was using -minReadLength 2000 for BLASR. I found that by default, HGAP set -minReadLength to 200. Would it make sense for me to set to 2000?
          Given sufficient coverage, likely for <10Mb, increasing the minReadLength will probably increase the robustness of HGAP, but in most cases you will probably see very little difference.
          HGAP takes bax.h5 files but PBcR takes filtered fastq. Does that mean PBcR is missing some preprocessing steps?
          No, the filtered.fastq is a simple subset of the data in the bax.h5, HGAP also runs the same filtering process before running the assembly, but this step is largely hidden. This does get to one of the major reasons I would stick to HGAP, quiver correction requires a cmp.h5, which has to be generated from the bax.h5. in HGAP quiver correction is automatic, when running PBcR the quiver correction has to be done 'manually' at the end.
          What are the advantages of using PBcR-BLASR over HGAP?
          I would expect very similar results, I don't see any advantage other than PBcR may be easier to install and maintain. The disadvantage is that you don't get the quiver correction and the nice SMRT Portal gui for job management.

          Comment

          • ymc
            Senior Member
            • Mar 2010
            • 496

            #6
            Thanks rhall for your very detailed reply!

            Comment

            Latest Articles

            Collapse

            • SEQadmin2
              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
              by SEQadmin2


              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

              Here are nine questions we think about, in roughly the order they matter, before...
              06-18-2026, 07:11 AM
            • SEQadmin2
              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
              by SEQadmin2


              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
              ...
              06-02-2026, 10:05 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-26-2026, 11:10 AM
            0 responses
            10 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-17-2026, 06:09 AM
            0 responses
            45 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-09-2026, 11:58 AM
            0 responses
            105 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-05-2026, 10:09 AM
            0 responses
            125 views
            0 reactions
            Last Post SEQadmin2  
            Working...