Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bismark Methyl-seq Analysis

    We have sequenced a genome using Illumina's True-seq bisulfite sequencing kit. After getting back the seq, we are analyzing methylation rate using Bismark. I Need help with the interpretation of the result and proper way of normalization.

    Before sequencing: Sample DNA was divided into 2 groups:
    1. Bisulfite treatment was carried out and DNA was subsequently sequenced (group 1, methylated group)
    2. DNA was sequenced without bisulfite treatment (group 2, control group)

    Both group was sequenced in paired-end fashion.

    I am using Bismark to analyze the seq and trying to get the methylation rate in this particular genome. After running Bismark on Methylated files I got this finale percentages:

    C methylated in CpG context: 0.6%

    C methylated in CHG context: 0.5%

    C methylated in CHH context: 0.7%

    Whereas after running Bismark on my Control files I got these percentages:

    C methylated in CpG context: 99.6%

    C methylated in CHG context: 99.3%

    C methylated in CHH context: 99.9%

    So, how would I interpret my data?

    a. Is 0.6 % (CpG) the actual methylation percentage in my genome?

    b. I have found in some literatures that if CpG, CHG, and CHH percentages are very close, that means that genome actually does not do methylation. Is it true?

    c. What was the purpose of using the control group (group 2)? Do I still need any spike-in control to normalize the data? If so, what that could be?

    Thank you very much for reading this long post!!

    Bests!!!

  • #2
    Hi Juulluu,

    Just briefly before answering your question, have you trimmed off 7-8bp from the 5’ end of reads as is recommended for TruSeq libraries? If not I would recommend doing so, please see also here:
    A tool to map bisulfite converted sequence reads and determine cytosine methylation states - File not found · FelixKrueger/Bismark


    The statistics being output at the end of a Bismark run are calculated from every single methylation call that has been performed during the entire run. While the number can be used as a proxy to what the average genome methylation levels are, this comes with the caveat that if you had some kind of overrepresented sequence in there, e.g. a certain class of repeats or the like, it might get a larger share towards the total average than the rest of genome. It would probably be more informative to calculate the percentage methylation over shorter stretches of your genome (e.g. using a fixed number of CpG residues or, more crudely, maybe 3-5kb running windows) and look at a bean plot representation of your genome. Just out of interest, which organism was it?

    But yes, from the numbers you are showing one could already say that:
    a) The overall methylation levels in your genome appear to be quite low. There still is a possibility that there is some localised methylation at specific loci in your genome (e.g. promoter regions), but you can’t tell this from a total average number but need to start visualising and exploring your data in more detail
    b) Good bisulfite conversion is normally > 99% efficicient. Since you are seeing total methylation levels between 0.5 and 0.7% you can say that the overall conversion must have been at least 99.5% efficient
    c) The methylated residues (0.5-0.7%) may either be true genomic methylation or a bisufulfite conversion error, but more likely a mix of the two. In other words the true methylation level may be even lower than that.
    d) Well, the three contexts appear to be very similar, so my first impression would be that there is at least discernible increase in CpG methylation which you see in mammals
    e) I am not sure why you included a non-bisulfite treated control. Not converted DNA should ideally come back as 100% methylated, which is nearly true. The fact that they are not exactly 100% can probably be explained by mis-alignments, or alignments of reads to regions in the genome that are not in the genome-build but were found in the library. It also shows that even here you are getting (conversion error) values between 0.1 and 0.7%, which is a very similar range as you real methylation data predicts. So unless you can convincingly show that there is some true, more localised methylation going on in your genome it will be a little difficult to argue that the methylation you are seeing is not within the expected noise.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM
    • seqadmin
      Techniques and Challenges in Conservation Genomics
      by seqadmin



      The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

      Avian Conservation
      Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
      03-08-2024, 10:41 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 06:37 PM
    0 responses
    8 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, Yesterday, 06:07 PM
    0 responses
    8 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 03-22-2024, 10:03 AM
    0 responses
    49 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 03-21-2024, 07:32 AM
    0 responses
    67 views
    0 likes
    Last Post seqadmin  
    Working...
    X