Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Filtering Contigs by Coverage

    Hi all,

    I have a dataset where I assembled sequences into contigs and would now like to filter the contigs by coverage. My goal is to remove the contigs with low sequence coverage so that I can work only with those contigs with high coverage.

    I am wondering if anybody knows if this is implemented in an existing sequence analysis program or something? I know I could write some code myself to do this, but I figured it was worth asking if anybody knows of this already existing? I haven't been able to find something that does this.

    Thanks for your help!

  • #2
    Which assembler did you use? Some will output this information in the FASTA header, or in a separate statistics file, in which case it is easy to write a script to extract the ones you want. Otherwise you will need to align the reads back to the contigs to get this info.

    Comment


    • #3
      Thanks nickloman,

      I used IDBA-UD. I'll look into whether or not it will give me that output like you mentioned. Otherwise I will go ahead and re-align the reads and deal with it from there. I was just curious if there were already programs to do this, and it looks like the answer is that some assemblers do this automatically. Thanks for the answer!

      Comment


      • #4
        Mira does it automatically, and so do most kmer based assemblers, they put coverage in terms of kmer coverage.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM
        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        18 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        17 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        49 views
        0 likes
        Last Post seqadmin  
        Working...
        X