Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • allo
    Member
    • Jul 2009
    • 15

    PacBio AHA hybrid assembler

    Anybody out there with experience on how to run the PacBio AHA hybrid de novo assembler? I just downloaded the PacBio SMRT-pipeline software and I just cannot find any instructions on how to do it. I read all the READMEs, "index.htlm" and PDFs on the package.
    I want to do an Illumina/ PacBio assembly of a bacterial genome.

    Thanks!
    AlLo
  • allo
    Member
    • Jul 2009
    • 15

    #2
    OK, I dislike answering my own posts but this info may be important to others. The instructions for the SMRT-pipeline are in a document appropriately entitled “SMRT Pipe Reference Guide” on the Algorithms page at the PacBio DevNet website http://www.pacbiodevnet.com/Algorithms. The instructions on how to write the XML files that the pipeline requires are not for Biologists . If you have experience with XML or writing HTML you can do it. I come from the Bio world and it took me four tries to get the pipeline to run. I used the default parameters from the ‘longreads’ example but I would love to have a better explanation for some of these values . I guess I’ll have to play with them and learn from the results although this could get very time consuming. Cheers!

    Comment

    • sagarutturkar
      Member
      • Sep 2010
      • 61

      #3
      Originally posted by allo View Post
      OK, I dislike answering my own posts but this info may be important to others. http://www.pacbiodevnet.com/Algorithms.
      Indeed, keep answering your own post if you have figure out the solution. It help others. Its easy for me to do a quick search on seqanswers rather than searching bunch of pacbio pages. Thanks.

      Comment

      • sagarutturkar
        Member
        • Sep 2010
        • 61

        #4
        XML files for AHA

        HI,

        I have SMRTanalysis pipeline installed but struggling to get it running correctly. I guess some error with XML files creation.

        Can you please provide any pointers regarding this or post xml files that worked for you.

        Thanks
        Sagar

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #5
          Sagar,

          Do you have the web interface for SMRTanalysis installed? If you do then try running the test data set analysis through that interface and check if it completes successfully. If it does (it should, if you have things correctly installed) then grab the XML file from that run as a working example.

          We had spent some time on trying to get the command line PacBio tools to work (albeit a year or so ago) but could never get them to work right. We finally gave up on that effort and switched to using the web interface.

          Perhaps the disconnect between the web based SMRTanalysis and the command line tools is still there in the latest version. If I have time I will try to check on that.

          Comment

          • flxlex
            Moderator
            • Nov 2008
            • 412

            #6
            Have a look at the xml files in the installation here /opt/smrtanalysis/common/protocols/ (or $SEYMOUR_HOME/common/protocols/), especially RS_AHA_Scaffolding.1.xml. See also my answer at this thread: http://seqanswers.com/forums/showpos...27&postcount=2

            Comment

            • zhoufan
              Member
              • Feb 2009
              • 14

              #7
              Hi everyone,
              I am trying AHA for scaffolding the contigs assmbled from illumina reads,the P_filter moudule runs well,but when run HybridAssembly module , an error "invalid literal for int() with base 10: ' ' ",it seems like it's a problem relates with python. By the way,i am using smrtPipe v2.0.1 on ubuntu 10.04.
              can any body help?thanks.

              Comment

              • sagarutturkar
                Member
                • Sep 2010
                • 61

                #8
                Hi zhoufan,

                It seems like formatting issues with your contig assembly. Are you transferring these files between windows and linux? That might be the issue. Another way is to format your assembly with FASTX toolkit - FASTA Formatter (http://hannonlab.cshl.edu/fastx_tool...mmandline.html).

                Also, make sure your fasta headers are kept at minimum length and does not have special characters.

                I am not sure if this will work, but worth trying.

                Thanks

                Comment

                • sagarutturkar
                  Member
                  • Sep 2010
                  • 61

                  #9
                  Also you should look at the thread
                  http://seqanswers.com/forums/showthr...7090#post97090 and reply #21 onwards.

                  Comment

                  • zhoufan
                    Member
                    • Feb 2009
                    • 14

                    #10
                    Hi sagarutturkar,
                    Thanks for your help! The problem is there have space in the fatsta header.

                    Comment

                    Latest Articles

                    Collapse

                    • SEQadmin2
                      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                      by SEQadmin2


                      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                      Here are nine questions we think about, in roughly the order they matter, before...
                      06-18-2026, 07:11 AM
                    • SEQadmin2
                      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                      by SEQadmin2


                      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                      ...
                      06-02-2026, 10:05 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by SEQadmin2, 06-17-2026, 06:09 AM
                    0 responses
                    41 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-09-2026, 11:58 AM
                    0 responses
                    102 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-05-2026, 10:09 AM
                    0 responses
                    123 views
                    0 reactions
                    Last Post SEQadmin2  
                    Started by SEQadmin2, 06-04-2026, 08:59 AM
                    0 responses
                    114 views
                    0 reactions
                    Last Post SEQadmin2  
                    Working...