Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Partial Order Alignment Step

    I'm running through Jared and Nick's Nature Methods de novo assembly approach on my Lambda burn-in FAST5 data. Just to get the pipeline up and running and some familiarity using a focused data set.

    I've successfully converted FAST5 to FASTA using poretools. Using the nanocorrect pipeline, I've performed the DALIGNER steps, and then now am processing using the Partial Order Alignment step using poaV2.

    It is working and the corrected.fasta file is growing... slowly. I've been tailing the file and the output when blasted is giving me 95% accuracy to NCBI refseq Lambda phage sequence. It's been chuggin along for a day. I have a 16 core 3.7 Ghz setup with 64 GB of ram, and plenty of SSD drive space to spare. It's only using a single thread (based on my system process utilization). And it's only sucked up 150 MB of working RAM.

    Wondering what others have done to parallelize this step, or what can be done for speed up?

  • #2
    I used PBcR and then nanopolish with 2D reads only and I got good results. Is there a much better pipeline than PBcR+nanopolish?

    Comment


    • #3
      Nanocorrect (daligner + poa), is the step preceding the celera assembly and nanopolish. This is to say, PBcR and nanopolish are next once the POA is done... When it gets done.

      Comment


      • #4
        Thx. Let me give it a try

        Comment


        • #5
          Ah. Nanocorrect outputs fasta but PBcR requires fastq input. How do u deal with that?

          Comment


          • #6
            Interesting, I tried the combined PBcR MHAP pipeline with the oxford.spec and arrived at an assembly in 20 minutes with 98% match to the NCBI ref seq for Lambda.

            The DALIGNER, POA and RunCA with the oxford.spec arrived at the assembly after 1.5 days with >99% match to the NCBI ref seq for Lambda.

            The major difference seems to be the latter is more accurate in the homopolymer runs.

            Still for rapid identification and other purposes, the PBcR MHAP pipeline is more than adequate.

            -Tom

            Comment


            • #7
              How did you obtain the frg file need for runCA? I suppose you only had one fasta file from the nanocorrect pipeline without any qual file, right?

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Investigating the Gut Microbiome Through Diet and Spatial Biology
                by seqadmin




                The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                02-24-2025, 06:31 AM
              • seqadmin
                Quality Control Essentials for Next-Generation Sequencing Workflows
                by seqadmin




                Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

                Nucleic Acid Quality Control
                Preparing for NGS starts with isolating the...
                02-10-2025, 01:58 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 03-03-2025, 01:15 PM
              0 responses
              46 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 02-28-2025, 12:58 PM
              0 responses
              167 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 02-24-2025, 02:48 PM
              0 responses
              525 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 02-21-2025, 02:46 PM
              0 responses
              256 views
              0 likes
              Last Post seqadmin  
              Working...
              X