Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Whitelist subreads instead of ZMWs?

    Hello -

    I was wondering whether it is possible, in the whitelist.txt to use the format:
    Code:
    <movie>/<zmw>/<coord>_<coord>
    rather than the typical:
    Code:
    <movie>/<zmw>
    If not, what other ways are there to eliminate specific subreads, rather than the entire zmw?

    Thanks,
    A

  • #2
    The short answer is no.

    Whitelisting has been primarily used for contamination removal so per molecule has always been the expected use case.

    What is the intention of removing one subread and not another from the same molecule? Could you achieve the same outcome using whitelist and minimum subread length?
    Last edited by rhall; 03-30-2015, 09:07 AM. Reason: typo

    Comment


    • #3
      Hello rhall and thanks for your reply.

      To answer as shortly as possible:
      Due to some assembly issues, I want to select for assembly only those subreads that map to a reference genome for a minimum ratio of their length.

      If a 12 Kb subread only gets 3 Kb mapping to the reference, filter out.
      If a 3.5 Kb subread from the same hole gets 3 Kb mapping to the reference, keep it.

      I was thinking/wondering, that it might be an overkill to remove the whole ZMW.

      Comment


      • #4
        So long as you are not really coverage limited, filtering by ZMW is probably the most straight forward.
        Otherwise you could use an assembler that allows fasta input, https://github.com/PacificBiosciences/FALCON or http://wgs-assembler.sourceforge.net...index.php/PBcR

        Comment


        • #5
          I see. Will consider my options!

          Thanks for the fast replies.

          Cheers!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin


            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
            Yesterday, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          55 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          52 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          45 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          55 views
          0 likes
          Last Post seqadmin  
          Working...
          X