Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • crazyhottommy
    Senior Member
    • Apr 2012
    • 187

    How to make an average conservation plot from ChIP-seq data

    Hi all,

    I want to know how to make an average conservation plot
    http://ceas.cbi.pku.edu.cn/ this website can make something I want, but it can only plot
    a set of regions at one time, I want to plot two sets of regions, and compare them.

    " 2. GC content and evolutionary conservation of each ChIP-region and their
    average. CEAS uses PhastCons conservation scores from UCSC Genome
    Bioinformatics, which is based on multiz alignment of human, chimp, mouse,
    rat, dog, chicken, fugu, and zebrafish genomic DNA. CEAS generates thumbnail
    conservation plot for each ChIP-region and the average conservation plot for
    all the ChIP-regions, which can be directly used in ChIP-chip biologists'
    manuscript.
    "


    Any python scripts or bioconductor package can do it?

    Thanks
  • paolo.kunder
    Member
    • Aug 2011
    • 93

    #2
    Hi,

    I encountered the same problem few weeks ago,

    It seems that there are not many "expert" regarding PhastCons in this forum.

    I explain you what I did,

    I took my Chip-seq regions, in bed file format, intersect them with PhastCons element in (mouse example)

    Code:
    http://hgdownload-test.cse.ucsc.edu/goldenPath/mm9/phastCons30way/

    and calculated for each genomic position the score of conservation.
    Then averaged all the score.

    Consider that you will need to transform a bit the PhastCons file format,

    for example:

    original format ( just replaced some words with empty)
    Code:
    chrom=chr1 start=3000306
    0.006
    0.010
    0.014
    chrom=chrX start=40000306
    0.014
    chrom=chr9 start=80000306 
    0.1
    0.2
    processed format
    Code:
    chr1 3000306 3000307 0.006
    chr1 3000307 3000308 0.010
    chr1 3000308 3000309 0.019
    chrX 40000306 40000307 0.014
    chr9 80000306 80000307 0.1
    chr9 80000307 80000308 0.2
    with the following script:
    Code:
    awk '/^chrom/{split($1,a,"=");split($2,b,"=");next} { printf "%s\t%10d\t%10d\t%f\n",a[2],b[2],b[2]+1,$1;b[2]++}' filename


    Another possibility that I am investigating is using circos
    HTML Code:
    http://circos.ca/
    but You will need to study first how the software works,

    Cheers,
    Paolo

    Comment

    • crazyhottommy
      Senior Member
      • Apr 2012
      • 187

      #3
      Thank you so much!!



      Originally posted by paolo.kunder View Post
      Hi,

      I encountered the same problem few weeks ago,

      It seems that there are not many "expert" regarding PhastCons in this forum.

      I explain you what I did,

      I took my Chip-seq regions, in bed file format, intersect them with PhastCons element in (mouse example)

      Code:
      http://hgdownload-test.cse.ucsc.edu/goldenPath/mm9/phastCons30way/

      and calculated for each genomic position the score of conservation.
      Then averaged all the score.

      Consider that you will need to transform a bit the PhastCons file format,

      for example:

      original format ( just replaced some words with empty)
      Code:
      chrom=chr1 start=3000306
      0.006
      0.010
      0.014
      chrom=chrX start=40000306
      0.014
      chrom=chr9 start=80000306 
      0.1
      0.2
      processed format
      Code:
      chr1 3000306 3000307 0.006
      chr1 3000307 3000308 0.010
      chr1 3000308 3000309 0.019
      chrX 40000306 40000307 0.014
      chr9 80000306 80000307 0.1
      chr9 80000307 80000308 0.2
      with the following script:
      Code:
      awk '/^chrom/{split($1,a,"=");split($2,b,"=");next} { printf "%s\t%10d\t%10d\t%f\n",a[2],b[2],b[2]+1,$1;b[2]++}' filename


      Another possibility that I am investigating is using circos
      HTML Code:
      http://circos.ca/
      but You will need to study first how the software works,

      Cheers,
      Paolo

      Comment

      • szaman
        Junior Member
        • Jul 2015
        • 1

        #4
        Making the step size 200 bp

        Hello Paolo,

        Your code is extraordinarily useful, however, I am trying to make the step size 200 bp and while doing so, I want to average the phastCon score over each 200 bp region. I am not even sure how to get started (really not even sure which language to use, I only know rudimentary unix, perl, & R). As much as I love one-liner codes, I do not think that it possible in this situation. I am working with hg38.phastCons20way.wigFix (phastcon scores calculates for human genome wide based on 19 species alignments) and it is a huge file. Any guidance would be much appreciated.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Pathogen Surveillance with Advanced Genomic Tools
          by seqadmin




          The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
          03-24-2025, 11:48 AM
        • seqadmin
          New Genomics Tools and Methods Shared at AGBT 2025
          by seqadmin


          This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

          The Headliner
          The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
          03-03-2025, 01:39 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-20-2025, 05:03 AM
        0 responses
        46 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-19-2025, 07:27 AM
        0 responses
        56 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-18-2025, 12:50 PM
        0 responses
        47 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-03-2025, 01:15 PM
        0 responses
        197 views
        0 reactions
        Last Post seqadmin  
        Working...