Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to correct wrong adapter splitting in PacBio films?

    While processing IsoSeq data, I observed that close to 10% of the films had adapter splitting problems.
    The pattern is shown in the graphic hereunder with 3 examples :



    Each graph correspond to one film (ZMW) which contains 20 ordered reads.
    The graph shows the read length found after adapter splitting.
    The one on the top is as awaited.
    The second one shows a couple of reads which have not been split in position 13.
    The last shows four errors.

    When films contain multiple reads (let's say over 5, which is common in IsoSeq), it should be quiet easy to compare each read length to the median and decide if there has been a problem in the adapter splitting.

  • #2
    Hi,

    The CCS algorithm is aware of possible incorrect adapter calling issues. It won't look at subreads that severely diverges from the rest of the subreads in terms of length and quality.

    What you see here is not entirely uncommon, but in most cases will be correctly dealt with by the CCS algorithm by discarding those anomaly subreads.

    Comment


    • #3
      Here you have a plot of the median length of the reads versus the length of the RoI for a library.



      Most of the RoI lengths seem to match the median length of the ZMW reads . Is it possible to correct the RoI in which this is not the case?

      Comment


      • #4
        I am assuming this plot is from the older version of CCS? CCS2 by default removes any subread that is over twice the size of the median subread (see here: https://github.com/PacificBioscience...nsensus.h#L288).

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 11:49 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-24-2024, 08:47 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        61 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Working...
        X