Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • MACS output format (NAME_peaks.bed)

    Hi all,

    I am analyzing a ChIP-Seq output file with TF enrichment peaks from the published dataset. It looks like the data are in the MACS output format. The files are named "NAME_peaks.bed" and the content looks like this:

    chr1 2011287 2011686 . 34 +

    I guess the columns, left to right, are the chromosome name, peak start, peak end, the last one is the strand, but I don't know what is in between. Do you have any ideas what is in the 4th-5th columns? Also, do you have any ideas why the distance between the start and end is so large, and how one would extract the transcription factor binding site location from this broad peak?

    Thanks!
    Last edited by rebrendi; 03-27-2012, 07:29 AM.

  • #2
    It's a BED file, so just read the spec. From the MACS readme file, the 5th column is the "-10*log10pvalue of peak region".

    Comment


    • #3
      dpryan, thank you.

      concerning my second question: is it safe to assume that the TF binding site is located in the middle of the identified broad peak (start+end)/2 ?

      Comment


      • #4
        I wouldn't assume that. You could use something like meme for motif discovery.

        Comment


        • #5
          dpryan, I am not looking for motifs, just need the coordinates for each TF binding site (assuming that each peak contains one binding site). There is no single consensus motif for this TF, that is known. Do you think there is a simple way to use these peak.bed files to detect the coordinates of the binding sites?

          Comment


          • #6
            basic question: you sure your file is a MACS output file?

            Comment


            • #7
              Originally posted by mudshark View Post
              basic question: you sure your file is a MACS output file?
              nope. it is provided as is by the authors

              Comment


              • #8
                so it has been published, any description on how the peak calling was performed? did you ask the authors?
                are the raw data also published? maybe you better do the peak calling yourself then

                Comment


                • #9
                  Originally posted by mudshark View Post
                  so it has been published, any description on how the peak calling was performed? did you ask the authors?
                  are the raw data also published? maybe you better do the peak calling yourself then
                  you are right, have to contact the authors...
                  Last edited by rebrendi; 03-27-2012, 07:31 AM.

                  Comment


                  • #10
                    HI,
                    Once we are done with identifying the peaks for chip seq data, how can we validate those results in wetlab. Since these all are bioinformatics predictions, are there any techniques other than PCR to validate the peaks we called??

                    Comment


                    • #11
                      that all depends on your target, model organism etc etc.
                      reporter gen assays are probably most straight forward in any case.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Current Approaches to Protein Sequencing
                        by seqadmin


                        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                        04-04-2024, 04:25 PM
                      • seqadmin
                        Strategies for Sequencing Challenging Samples
                        by seqadmin


                        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                        03-22-2024, 06:39 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 04-11-2024, 12:08 PM
                      0 responses
                      24 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 10:19 PM
                      0 responses
                      25 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 09:21 AM
                      0 responses
                      21 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-04-2024, 09:00 AM
                      0 responses
                      52 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X