Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • MACS Chip Seq Biological Replicates

    Hey All,
    I have 2 biological replicates for a treatment chip seq experiment. When I analyze these 2 replicates independently (using MACS 1.4.2) I get about 3000 peaks identified for both replicates however when I merge them together I only get about 2000. How exactly does merging the replicates together change how MACS identifies peaks?
    Any help would be appreciated
    Leanne

  • #2
    The same 'problem'

    Hi!
    I wonder if you got an answer to your problem with 'loosing' peaks when merging files?
    We are experiencing the same thing.

    Thanks!
    Jenny

    Comment


    • #3
      Hi,
      Unfortunately I did not ever get an answer to that question.

      Sorry, If you end up finding out why you should let me know

      Leanne

      Comment


      • #4
        I'll ChIP in on this one...

        In general, it is not a good idea to combine the reads from biological replicates. One reason is that you lose the ability to distinguish the biological and experimental variability inherent in ChIP experiments. This is similar to the reason you wouldn't combine all the reads from biological replicates in an RNA-seq experiment. Remember we expect ChIP experiments to exhibit even greater variability than RNA!

        The other reason is that it tends to break peak calling, as you discovered. Peak calling involves separating signal (peaks) from noise (background). In a ChIP, the majority of reads are background. So when you combine replicate data, the background levels all get higher and/or more filled in, which makes it in general more difficult to distinguish real peaks. If the replicates lined up cleanly -- for example technical replicates -- the peaks would fill in as well, but the biological and experimental variance means that the peaks just tend to either be a) relatively "lower"/less dense or b) less well-formed (messier "peaks"). The good aspect of this is that any peaks that are able to be identified (your reduced set of 2000) should have a lower false discovery rate.

        For a two-replicate experiment, you can follow the ENCODE guidelines to call peaks separately and combine them into a low-FDR set using the IDR calculation. I'd expect this to give even less than 2000 peaks as it can be quite a stringent test. It may also help to look at the overlap of the two peaksets to see how similar the 3,000 peaks from each replicate are.

        Cheers-
        Rory

        Comment


        • #5
          Ok, at least I know now. Of course I'll let you know if I come across the answer!

          BW

          Jenny

          Comment


          • #6
            Oh, Rory, I only saw you reply after I posted mine.

            Thanks for the info :-)

            BW

            Jenny

            Comment


            • #7
              Hi Rory,

              Could you please tell me how to combine the peaks from replicates into a low-FDR set using the IDR calculation?

              Thanks,

              Irem

              Comment


              • #8
                This page, "ENCODE: TF ChIP-seq peak calling using the Irreproducibility Discovery Rate (IDR) framework" , contains the instructions referenced in the "Software Tools Used to Create the ENCODE Resource" page on the UCSC browser...

                -R

                Comment


                • #9
                  Rory,
                  I looked at the page you mentioned and that program specifically says if you used macs14 that you should not use it to calculate the IDR. Do you know of any programs that can accurately calculate the IDR for peaks found by mac14?

                  Thanks
                  Leanne

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  31 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  33 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  28 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  53 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X