Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • HGAP assembly

    I recently installed SMRT portal on a virtual server. We used Ubuntu 12.04 server as we have SMRT portal all all associated programs (SMRT view, HGAP etc) running well on a desktop machine which is also running Ubuntu 12.04.

    I have completed a number of HGAP assemblies on the desktop machine and these ran without any problems. However, when the same jobs are run on the server everything proceed as normal untill the assembly stage. It appears that the reads are not being assembly or that step in the HGAP pipeline is not running.

    For example the assembly.qc file (located in the assembly dir) only contains the sections

    [Unitig Consensus]
    [Contig Consensus]
    [Read Depth Histogram]

    no sections detailing information of Scaffold numbers, contig numbers etc are produced.

    Additionally, HGAP assembly of 2 smrt cells (on the desktop machine) produces an assembly containing 200 contigs while on the server the consensus.fasta.gz file contains over 3000 fasta entries.

    I am assuming that the the assembler is not running or at least not running correctly and the output is merely the corrected long reads produced in the first step of the HGAP assembly. Is this correct?

    Have you encountered this issue before?

    Regards

    Brian

  • #2
    I'm having similar fun getting HGAP & Quiver to run correctly under Ubuntu.

    In the directory you launched from, there is an sge_log directory. If any of these have serious error messages (can't find an executable or shared library) that may help

    Good luck!

    Comment


    • #3
      A consensus.fasta.gz file would not be produced unless the Celera Assembler stage had completed and the raw reads mapped back to the draft contigs. HGAP is deterministic so you should get the same results on the two machines.
      The most common system issues causing the celera assembler step to fail are sge queue setup problems.
      Can you post the ./log/smrtpipe.log for the job?

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM
      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      18 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      22 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      16 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-04-2024, 09:00 AM
      0 responses
      47 views
      0 likes
      Last Post seqadmin  
      Working...
      X