Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Some queries on DNA methylation when processing the BS-seq data

    I saw some papers on DNA methylation using BS-seq,now I meet some questions on DNA methylation when working with the BS-seq data:

    1)first,what is the minimum required threshold of percent methylation at a site to dedine a methylcytosine,I want to use 40%,but I don't know if it was appropriate;

    2)second,somebody mentioned methylation genes in some papers,so how to define a methylated genes,what the criteria is ? Because you know,the raw results of BS-seq is the methylation level at each site,and both of the methylation sites and the C are not even between a gene;

    3)Also,I want to know how to define a differentially methylated region(DMR),this question is in fact the same as my second one because of the uneven methylation cytosine in the chromosomes;and in plant how long (300 bp?) to set the length of DMR in your experiences

    Thanks for your time!
    Best wishes,

  • #2
    I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

    From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

    In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.

    Comment


    • #3
      To simonandrews

      Originally posted by simonandrews View Post
      I don't think you're going to find definitive answers to your questions because a lot of this is going to vary depending on the exact system you're working with.

      From the data we've seen the majority of cytosines we measured had a methylation percentage either above 80% or below 20%. Our standard filters are therefore 75% and 25%. However we've also gone back to look at regions with a known methylation state and have found that in those regions even if you apply no filtering at all (so set your cutoff at exactly 50%) the number of incorrect calls we make is still very low (below 5%). The errors also tend not to be clustered so the chance of miscalling a region containing several Cs is even lower.

      In terms of identifying DMRs you can either take a purely statistical approach where you look at the proportion of meth vs unmeth in two samples and see if those two groups are significantly different, or you set a cutoff on the amount of change you want to see and then test only regions which pass the initial filter. We took the second approach, which seemed to work out OK. Ideally you don't want to define a size of DMR since they could be of variable length. We've not seen convincing DMRs which were very short though, so you could set a lower cutoff of a couple of hundred bases to remove noise from your results.
      Hi,there!
      Thanks for you reply,and I have one question you didn't answer:how to define a methylated gene,or how to define a methylated region.Imaging I have the methylation information (0-1) at each context(CpG,CHG,CHH )for each chromosomes,how to define a methylated region(what the algoritm is) in your experience?
      Thanks again!

      Comment


      • #4
        Methylation level/percentage

        I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.

        Comment


        • #5
          Originally posted by sanamjeet View Post
          I want to ask what does it mean by methylation level ? is it the percentage of the sequence in whole genome methylated ? or something else.
          It will be the percentage of all cytosines which are methylated. In some cases the cytosines will be divided into different contexts (CG CHG CHH etc) and you can quote a methylation level for each context.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin


            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
            Yesterday, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          39 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          41 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          35 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          55 views
          0 likes
          Last Post seqadmin  
          Working...
          X