Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • MACS output format (NAME_peaks.bed)

    Hi all,

    I am analyzing a ChIP-Seq output file with TF enrichment peaks from the published dataset. It looks like the data are in the MACS output format. The files are named "NAME_peaks.bed" and the content looks like this:

    chr1 2011287 2011686 . 34 +

    I guess the columns, left to right, are the chromosome name, peak start, peak end, the last one is the strand, but I don't know what is in between. Do you have any ideas what is in the 4th-5th columns? Also, do you have any ideas why the distance between the start and end is so large, and how one would extract the transcription factor binding site location from this broad peak?

    Thanks!
    Last edited by rebrendi; 03-27-2012, 07:29 AM.

  • #2
    It's a BED file, so just read the spec. From the MACS readme file, the 5th column is the "-10*log10pvalue of peak region".

    Comment


    • #3
      dpryan, thank you.

      concerning my second question: is it safe to assume that the TF binding site is located in the middle of the identified broad peak (start+end)/2 ?

      Comment


      • #4
        I wouldn't assume that. You could use something like meme for motif discovery.

        Comment


        • #5
          dpryan, I am not looking for motifs, just need the coordinates for each TF binding site (assuming that each peak contains one binding site). There is no single consensus motif for this TF, that is known. Do you think there is a simple way to use these peak.bed files to detect the coordinates of the binding sites?

          Comment


          • #6
            basic question: you sure your file is a MACS output file?

            Comment


            • #7
              Originally posted by mudshark View Post
              basic question: you sure your file is a MACS output file?
              nope. it is provided as is by the authors

              Comment


              • #8
                so it has been published, any description on how the peak calling was performed? did you ask the authors?
                are the raw data also published? maybe you better do the peak calling yourself then

                Comment


                • #9
                  Originally posted by mudshark View Post
                  so it has been published, any description on how the peak calling was performed? did you ask the authors?
                  are the raw data also published? maybe you better do the peak calling yourself then
                  you are right, have to contact the authors...
                  Last edited by rebrendi; 03-27-2012, 07:31 AM.

                  Comment


                  • #10
                    HI,
                    Once we are done with identifying the peaks for chip seq data, how can we validate those results in wetlab. Since these all are bioinformatics predictions, are there any techniques other than PCR to validate the peaks we called??

                    Comment


                    • #11
                      that all depends on your target, model organism etc etc.
                      reporter gen assays are probably most straight forward in any case.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Advancing Precision Medicine for Rare Diseases in Children
                        by seqadmin




                        Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                        12-16-2024, 07:57 AM
                      • seqadmin
                        Recent Advances in Sequencing Technologies
                        by seqadmin



                        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                        Long-Read Sequencing
                        Long-read sequencing has seen remarkable advancements,...
                        12-02-2024, 01:49 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 12-17-2024, 10:28 AM
                      0 responses
                      27 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 12-13-2024, 08:24 AM
                      0 responses
                      43 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 12-12-2024, 07:41 AM
                      0 responses
                      29 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 12-11-2024, 07:45 AM
                      0 responses
                      42 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X