SEQanswers

Go Back   SEQanswers > Sequencing Technologies/Companies > Pacific Biosciences



Similar Threads
Thread Thread Starter Forum Replies Last Post
PBcR hybrid assembly of 6.5MB genome with PacBio and MiSeq DrSpace Bioinformatics 9 06-11-2015 01:40 PM
HGAP Parameters sagarutturkar Pacific Biosciences 2 08-18-2014 08:06 AM
How to use Celera assembler with HGAP Monika_bioinf Pacific Biosciences 10 04-25-2014 01:43 PM
HGAP graph visualisation coldturkey Pacific Biosciences 3 10-09-2013 04:14 PM
HGAP assembly coldturkey Pacific Biosciences 2 04-30-2013 08:23 AM

Reply
 
Thread Tools
Old 05-27-2015, 07:44 PM   #1
kende
Junior Member
 
Location: Japan

Join Date: May 2015
Posts: 4
Default HGAP and PBcR self-correction

I'd like to know about difference between HGAP and PBcR self-correction.
Both pipelines perform DeNovo assembly by using PacBio reads only.
And these have Preassembly step and need to run Quiver after running CeleraAssembler.

What is the difference of these?

Many thanks,

Last edited by kende; 05-28-2015 at 03:46 AM.
kende is offline   Reply With Quote
Old 05-28-2015, 09:32 AM   #2
rhall
Senior Member
 
Location: San Francisco

Join Date: Aug 2012
Posts: 322
Default

The basic difference is the algorithm used to calculate the overlaps / alignments for the preassembly step. Using the MHAP algorithm in PBcR sould be significantly quicker for large genomes than HGAP. The Celera Assembler OLC (overlap layout consensus) after the preassembled reads are generated is essentially the same in both pipelines. One difference with quiver is that is is automatically ran in HGAP, but with PBcR you will have to run the resequencing and quiver step independently.
Check out https://github.com/PacificBiosciences/Bioinformatics-Training/wiki for more info.
rhall is offline   Reply With Quote
Old 05-31-2015, 05:14 PM   #3
kende
Junior Member
 
Location: Japan

Join Date: May 2015
Posts: 4
Default

Thank you very much for your reply!
I understand.
kende is offline   Reply With Quote
Old 08-26-2015, 02:05 AM   #4
ymc
Senior Member
 
Location: Hong Kong

Join Date: Mar 2010
Posts: 498
Default

I only worked with bacterial genomes <10MB using , does it make sense to switch to PBcR-MHAP?

I read the MHAP paper and noticed the following:

PBcR-MHAP seems to be 10x faster but lower in sensitivity. True?

In the MHAP paper, it says it was using -minReadLength 2000 for BLASR. I found that by default, HGAP set -minReadLength to 200. Would it make sense for me to set to 2000?

HGAP takes bax.h5 files but PBcR takes filtered fastq. Does that mean PBcR is missing some preprocessing steps?

What are the advantages of using PBcR-BLASR over HGAP?
ymc is offline   Reply With Quote
Old 08-26-2015, 09:35 AM   #5
rhall
Senior Member
 
Location: San Francisco

Join Date: Aug 2012
Posts: 322
Default

I'll try to give my opinion one question at a time.
Quote:
I only worked with bacterial genomes <10MB using , does it make sense to switch to PBcR-MHAP?
Probably not, unless you sequence enough that HGAP is a computational bottleneck, have difficulties running HGAP on your system for technical reasons, or you are struggling to get sufficient coverage, unlikely given one bacteria a cell. The latest version of PBcR-MHAP may give better low coverage assemblies (see release notes).
Quote:
PBcR-MHAP seems to be 10x faster but lower in sensitivity. True?
I believe this was true when the paper was initially published, but the latest release of PBcR-MHAP has a sensitive setting, which I think is comparable to HGAP, but I have not compared the two.
Quote:
In the MHAP paper, it says it was using -minReadLength 2000 for BLASR. I found that by default, HGAP set -minReadLength to 200. Would it make sense for me to set to 2000?
Given sufficient coverage, likely for <10Mb, increasing the minReadLength will probably increase the robustness of HGAP, but in most cases you will probably see very little difference.
Quote:
HGAP takes bax.h5 files but PBcR takes filtered fastq. Does that mean PBcR is missing some preprocessing steps?
No, the filtered.fastq is a simple subset of the data in the bax.h5, HGAP also runs the same filtering process before running the assembly, but this step is largely hidden. This does get to one of the major reasons I would stick to HGAP, quiver correction requires a cmp.h5, which has to be generated from the bax.h5. in HGAP quiver correction is automatic, when running PBcR the quiver correction has to be done 'manually' at the end.
Quote:
What are the advantages of using PBcR-BLASR over HGAP?
I would expect very similar results, I don't see any advantage other than PBcR may be easier to install and maintain. The disadvantage is that you don't get the quiver correction and the nice SMRT Portal gui for job management.
rhall is offline   Reply With Quote
Old 08-27-2015, 10:16 PM   #6
ymc
Senior Member
 
Location: Hong Kong

Join Date: Mar 2010
Posts: 498
Default

Thanks rhall for your very detailed reply!
ymc is offline   Reply With Quote
Reply

Tags
hgap, pbcr

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 05:27 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO