Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Successive rounds of gapfilling and scaffolding

    Hi,

    we've been playing around with successive rounds of gapfilling and scaffolding and we see a constant improvement of our genome stats. Basically we run several iterations of gapfilling which generates new ends with potential mapping sites for our mate-pair libraries and re-scaffold the genome. However, besides not being able to really assess the quality of these so generated new scaffolds, we wondered if re-scaffolding after gapfilling should be performed on contig level i.e. gapfilling, breaking down the scaffolds to contigs (now longer thanks to the gapfilling) and rescaffolding, or if it would be rather advisable to re-scaffold the gapfilled scaffolds. Theoretically, breaking down the scaffolds to contigs after gapfilling would allow to intercalate contigs that could not be mapped to the previously existing gaps due to the missing sequence at the end of the contigs before gap filling.

    Has anyone played around with this and can share his experience?

    Thanks,
    Zapp

  • #2
    Gapfilled regions of your genome will be of lower quality than the parts created by kmer extension in the original assembly, so too many rounds of gapfilling will start to introduce more error. Because of this, I would lean towards keeping your scaffolds intact for further rounds of scaffolding. This way any new mismapping mate pairs in gap filled regions don’t invalidate and screw up what should be higher quality scaffolding links already made.

    Comment


    • #3
      That sound reasonable, thank you very much.

      Regards,
      Zapp

      Comment


      • #4
        Stand-alone scaffolding will never be as good a built-in scaffolder that can take in all information used for the assembly...

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Advancing Precision Medicine for Rare Diseases in Children
          by seqadmin




          Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
          12-16-2024, 07:57 AM
        • seqadmin
          Recent Advances in Sequencing Technologies
          by seqadmin



          Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

          Long-Read Sequencing
          Long-read sequencing has seen remarkable advancements,...
          12-02-2024, 01:49 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 12-17-2024, 10:28 AM
        0 responses
        26 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-13-2024, 08:24 AM
        0 responses
        43 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-12-2024, 07:41 AM
        0 responses
        29 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-11-2024, 07:45 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Working...
        X