Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Some queries on DNA methylation when processing the BS-seq data

    I saw some papers on DNA methylation using BS-seq,now I meet some questions on DNA methylation when working with the BS-seq data:

    1)first,what is the minimum required threshold of percent methylation at a site to dedine a methylcytosine,I want to use 40%,but I don't know if it was appropriate;

    2)second,somebody mentioned methylation genes in some papers,so how to define a methylated genes,what the criteria is ? Because you know,the raw results of BS-seq is the methylation level at each site,and both of the methylation sites and the C are not even between a gene;

    3)Also,I want to know how to define a differentially methylated region(DMR),this question is in fact the same as my second one because of the uneven methylation cytosine in the chromosomes;and in plant how long (300 bp?) to set the length of DMR in your experiences

    Thanks for your time!
    Best wishes,

  • #2
    I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

    From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

    In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.

    Comment


    • #3
      To simonandrews

      Originally posted by simonandrews View Post
      I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

      From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

      In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.
      Hi,there!
      Thanks for you reply,and I have one question you didn't answer:how to define a methylated gene,or how to define a methylated region.Imaging I have the methylation information (0-1) at each context(CpG,CHG,CHH )for each chromosomes,how to define a methylated region(what the algoritm is) in your experience?
      Thanks again!

      Comment


      • #4
        Methylation level/percentage

        I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.

        Comment


        • #5
          Originally posted by sanamjeet View Post
          I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.
          It will be the percentage of all cytosines which are methylated. In some cases the cytosines will be divided into different contexts (CG CHG CHH etc) and you can quote a methylation level for each context.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          22 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          24 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          19 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          50 views
          0 likes
          Last Post seqadmin  
          Working...
          X