Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • reduced representation bisulfite seq MspI library prep - non target reads?

    Hi,

    Just got my first testdata from a RRBS library prepared with MspI digestion with the NEXTFLEX® Bisulfite Library Prep Kit for Illumina, and I am have some questions.

    The library was sequenced paired-end on Illumina Miseq. I was expecting that most of my reads had remnants of the MspI cutsite at the start, so I wrote a little script to check that. The expectation is that forward reads should start with 'CGG' or 'TGG', if the first 'C' was methylated or not, respectively. For the reverse reads I was expecting mainly 'CAA', but also a few 'CGA' (unmethylated cytosines were used for end-repair, and I am assuming that bisulfite conversion rate is not 100%), as well as a few 'CAG' or 'CGG' (theoretically possible).
    So, ideally I should have the following combinations (forward - reverse):
    CGG-CAA
    TGG-CAA
    CGG-CGA
    TGG-CGA
    CGG-CAG
    TGG-CAG
    CGG-CGG
    TGG-CGG

    My expections were met in the way that CGG-CAA and TGG-CAA was by far the most common, ~20% of read pairs and ~6% of read pairs, respectively. The other combinations account for less than 1% in total. What puzzles me though is that I only get ~30% read pairs that have any of the above patterns.

    I figured that perhaps some of the fragments I get after the MspI digestion are somehow sheared during the library prep and loose the cutsite on one side, so I relaxed the search criteria to also count read pairs that have the expected pattern only in the forward read OR in the reverse read. An additional ~28% of the read pairs fell into this category.

    Which leaves me with >40% of read pairs that don't have any of the expected patterns, neither in the forward nor the reverse read.

    So, my question: Is this normal? Sure there will be some sequencing error, but the read quality is very good overall and the error rate should be very low at the start of the reads, so that can't affect 40% reads. Am I missing something here?

    I'd really appreciate if anyone could share their experience!

    cheers,
    Christoph

  • #2
    Hi,

    Just reopened this in the Epigenetics section here - think it's perhaps a better fit there - sorry and thanks!

    cheers,
    Christoph

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM
    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    18 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    22 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    17 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    49 views
    0 likes
    Last Post seqadmin  
    Working...
    X