Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • ChIP-Seq Integration & Interpretation

    Hi. I have three ChIP-Seq data for the same sample. Peak calling was done separately. How to combine the peaks of these three experiments together?

    One more question, How to associate genes with these binding sites?

    Thanks in advance!

  • #2
    these are technical replicates, right? So you could simply do the peak calling on the combined data. If you want to do the peak calling first the easiest - maybe not the best - way is to define an overlapping set of peaks.

    About the association, there are some tools around that can help you (i guess Galaxy). On the other hand that is not a too complicated job, isn't it? Calculating the distances to some gene features and stuff.

    Comment


    • #3
      I think this is what you are looking for:


      There are some other ideas on my blog at:
      Everything Ethan knows about biology - The Ethan-ome


      On the first question, I agree combining the data is a good way to go. You can also get the common peaks between the samples by using the BedTools intersectBed program. It is also available on Galaxy if you are not comfortable with the command line.
      --------------
      Ethan

      Comment


      • #4
        Are the replicates purely technical (mutliple lanes of the same library) or was the experiment repeated three times (three different libraries)? This makes a big difference. If it is just three lanes of the same library, then I agree the data should be merged and peaks called once. However without replicating the experiment, you will never be able to get reliable results -- you have no clue as to the variation inherent in the experiment, and whether your library represents an outlier.

        If the experiment was repeated three times, then you should not combine the reads, but rather work with the three peaksets. Start by checking their overlaps (3-way venn diagram). To minimize false positives, you could take only the peaks that all three replicates agree on, or be less stringent and take those that overlap in at least two of the three libraries. Another approach is to take all the peaks, then go back to the aligned reads and re-score each peak for each replicate (whether or not it was identified as a peak in that replicate).

        See also this paper for ideas on how to combine peaksets: http://projecteuclid.org/DPubS?servi...oas/1318514284

        If you are using R and Bioconductor, there are a number of packages that can help with this type of analysis. For example DiffBind helps with overlap and re-scoring, and ChIPpeakAnno will map peaks to genes.
        Last edited by rory; 12-02-2011, 07:51 AM. Reason: typo

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 11:49 AM
        0 responses
        15 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-24-2024, 08:47 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        61 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Working...
        X