Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Elsie
    Member
    • Mar 2011
    • 85

    Agilent Methyl-Seq - enrichment + BS + sequencing

    Dear all, long time listener first time caller here.
    I've got some Agilent Methyl-Seq data, I'm looking for feedback from anyone with experience with this data - i.e. enrichment for regions of interest, then BS then sequencing. I've been using Bismark (great!, thanks very much Babraham!) and have just given Trim Galore! a whirl. Any tips on parameters used, trimming etc, I've only got about 66% mapping efficiency and would like to improve upon that.
    thanks.
  • fkrueger
    Senior Member
    • Sep 2009
    • 627

    #2
    Hi Elsie,

    66% mapping efficiency doesn't sound too bad, but could you give us some more information about your data such as read length, whether it was single or paired-end, the mapping parameters used or also the number of sequence that did not map at all or mapped ambiguously (this should be stated in the mapping report)?

    Comment

    • Elsie
      Member
      • Mar 2011
      • 85

      #3
      Hi fkrueger,
      thanks for the reply.
      100bp reads, paired-end, default bismark + trim galore other than specifying paired-ends and I used the default directional.
      Sequence pairs with no alignments under any condition: 13154097
      Sequence pairs did not map uniquely: 871370
      Maybe this is as good as it gets? First time with NGS data so I don't have any feeling yet for good/bad data.
      thanks.

      Comment

      • fkrueger
        Senior Member
        • Sep 2009
        • 627

        #4
        Compared to the sequences that did not align at all it seems that only a small number of sequences could did not map uniquely, which shows that you don't seem to have a problem with highly repetitive sequences (which you probably wouldn't expect from a sequence capture). For shotgun BS-Seq data one can typically expect around 75-80% mapping efficiency with 100bp paired-end reads, not quite sure how this figure is affected by the Agilent sequence capture though.

        The most common problems with low mapping efficiency for paired-end sequencing (apart from quality and adapter issues) are either that the sequenced fragments are getting so short that both reads of the pair completely contain each other or that the specified insert size is too small (controlled by the -X parameter, 500bp is the default). Trim Galore has an option '--trim1' to avoid the former case:

        Code:
        -t/--trim1           Trims 1 bp off every read from its 3' end. 
                             This may be needed for FastQ files that 
                             are to be aligned as paired-end data with 
                             Bowtie. This is because Bowtie (1) regards
                             alignments like this:
        
                             R1 --------------------------->
                             R2 <---------------------------
                             
                             as invalid (whenever a start/end coordinate
                             is contained within the other read).
        We have seen in the past that simply inlcuding '--trim1' for 100bp paired-end reads managed to increase the mapping efficiency in a sample with fairly small insert sizes from ~49% to 78%!

        Just as a side note: if you find that a lot of your fragments are overlapping in the middle you should use the methylation_extractor option '--no_overlap' to avoid having a coverage bias in overlapping parts of the read. I hope this helps.

        Comment

        • Elsie
          Member
          • Mar 2011
          • 85

          #5
          Hi Felix,
          thanks very much for your suggestions.
          Just tried --trim1 and now have 0% mapping efficiency!, argh!
          Emailed Agilent to see what they do with this data (their datasheet suggests Bismark), they said try GeneSpring even though GeneSpring cannot yet cope with this sort of data!
          thanks for your help.

          Comment

          • fkrueger
            Senior Member
            • Sep 2009
            • 627

            #6
            Then there seems to be something going very wrong... Could you maybe email me the details (precise commands) of what you have done so I can try and assist you further?

            Comment

            • mariebreen
              Junior Member
              • Feb 2013
              • 3

              #7
              Methyl-Seq Analysis

              Hi all,

              I too am having trouble with Methyl-Seq analysis.

              I was wondering how many samples you multiplexed for Methyl-Seq? I have been trying to pool 4 samples and thought this may be the cause of my problems.

              I am using Bismark but my initial % on target is 7% which I hope cannot be correct!

              Any help would be greatly appreciated,

              Marie

              Comment

              • Elsie
                Member
                • Mar 2011
                • 85

                #8
                Hi mariebreen

                Sorry can't help with the pooling question, I just received the data with minimal background information (it is more a test of the technology than anything else). I found the comments from Felix extremely helpful and basically followed his suggestions/Bismark manual. FYI Genespring can now cope with this sort of data so maybe you could give that a whirl and see what your results look like?

                cheers.

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, Today, 08:59 AM
                0 responses
                8 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                21 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                17 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                29 views
                0 reactions
                Last Post SEQadmin2  
                Working...