Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Variant Effect Predictor Line Count

    Hi all,
    I am generating a VCF file and then running the Variant Effect Predictor (VEF) tool on it. This is in return giving me a new text file with the a list of variants and its potential effects. In one of the columns is the effect such as intronic change, exonic etc.

    I wanted to get a line count for the total number of lines which have say for e.e exonic in a certain coulmn of the text file. Ideally, i would like to have a list of all variations and the number of lines for e.g.

    Exonic = 200
    Intronic = 600
    ...
    ...

    If that's too complicated then i could simply have a single entity and run it multiple times.

    Thanks in advance.
    A

  • #2
    grep -v \# VEP_Annotation_File.ann | awk '{print $14}' | awk '{count[$1]++} END {for(j in count) print count[j], j}' | sort -nr

    Works for me - but we have a modified VEP so I'm not sure the column name ($14) is the same in your case.

    Output:

    410 INTRONIC
    277 DOWNSTREAM
    138 UPSTREAM
    119 3PRIME_UTR
    99 WITHIN_NON_CODING_GENE,INTRONIC
    51 INTERGENIC
    46 NMD_TRANSCRIPT,INTRONIC
    42 REGULATORY_REGION
    28 WITHIN_NON_CODING_GENE
    24 NON_SYNONYMOUS_CODING
    15 5PRIME_UTR
    9 SPLICE_SITE,INTRONIC
    4 SYNONYMOUS_CODING
    3 NMD_TRANSCRIPT,3PRIME_UTR
    3 ESSENTIAL_SPLICE_SITE
    2 NMD_TRANSCRIPT,SYNONYMOUS_CODING
    2 CODING_UNKNOWN
    1 STOP_GAINED
    1 SPLICE_SITE,WITHIN_NON_CODING_GENE,INTRONIC
    Last edited by Bukowski; 08-07-2012, 01:04 AM.

    Comment


    • #3
      worked

      hi there,
      thank you, thats awesome, it worked.

      In one of the columns the chromosomal location is mentioned as 1:1000 (for e.g.), can this script be slightly tweaked such that i can get a list based for each chromosome, independant upon the consequence type.

      For e.g.

      chromosome 1 number of consequences
      chromosome 2 number of consequences

      thanks again.
      a
      Last edited by ashkot; 08-07-2012, 03:48 PM.

      Comment


      • #4
        You could just split your input file up into chromosomes and run it over each one couldn't you? It's all trivially achieved with a bit of shell scripting.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          05-06-2024, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 06:35 AM
        0 responses
        12 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 02:46 PM
        0 responses
        18 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-07-2024, 06:57 AM
        0 responses
        17 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-06-2024, 07:17 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Working...
        X