Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • can I remove the N's in my sequence?

    Hello,

    I'm having a problem with mine assembly, I have a bacterial genome of 3.7bp assembled into 24 contigs, which was part of an under submission research, the reviewer gave me as a feedback that I should have a gap free sequence; Unfrontutly I can't not re-do the sequencing or use Pacbio. so I used ragout for reference-assisted assembly to improve the quality of my assembly; I got 1 scaffold of 3.7 bp, but a lot of ns (40000 n), how can I fix this problem? is this software reliable since I got some small fragment left? can I remove the ns without affecting the sequence?

    thank you

  • #2
    Just so we are using the same terminology: An assembly gap is typically represented by a series of N's connecting two contigs; the product of 2+ contigs connected by gaps is referred to as a scaffold.

    Your 24 original contigs contained no gaps (Ns), correct? I would confirm this first

    I have never used Ragout, but it seems to order and connect contigs based on a reference genome; in the process of connecting contigs, gap characters (Ns) are added. There is nothing atypical about this -- Without looking over your manuscript, it is difficult to say why the review took issue with the gaps.

    Anyhow, you can use
    Code:
    seqtk cutN -n 1 ragout.fasta
    to break up the single, 3.7(M?)bp scaffold produced by ragout into a set of gap-free contigs.

    Comment


    • #3
      Originally posted by cstack View Post
      Just so we are using the same terminology: An assembly gap is typically represented by a series of N's connecting two contigs; the product of 2+ contigs connected by gaps is referred to as a scaffold.

      Your 24 original contigs contained no gaps (Ns), correct? I would confirm this first

      I have never used Ragout, but it seems to order and connect contigs based on a reference genome; in the process of connecting contigs, gap characters (Ns) are added. There is nothing atypical about this -- Without looking over your manuscript, it is difficult to say why the review took issue with the gaps.

      Anyhow, you can use
      Code:
      seqtk cutN -n 1 ragout.fasta
      to break up the single, 3.7(M?)bp scaffold produced by ragout into a set of gap-free contigs.
      Thank you for your reply,
      My 24 contigs doesn't contain any N's.
      Im' doing a comparative genomic analysis of 2 strains from an extrem envirenement, I got this msg from the reviewer " In that the strains on which this Genome Report is based can be grown in culture, it is necessary to produce gap-free sequences for both strains, since the data would be more generally useful for the community at large. Given the high coverage that is already available, it would probably be necessary to take a more direct approach (rather than increasing coverage)."

      If I Use this commande I will go back to my initial contigs, No?

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM
      • seqadmin
        Techniques and Challenges in Conservation Genomics
        by seqadmin



        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

        Avian Conservation
        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
        03-08-2024, 10:41 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Yesterday, 06:37 PM
      0 responses
      10 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 06:07 PM
      0 responses
      9 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-22-2024, 10:03 AM
      0 responses
      51 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-21-2024, 07:32 AM
      0 responses
      67 views
      0 likes
      Last Post seqadmin  
      Working...
      X