Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Filtering homozygous variants in vcf file with vcftools

    Hi everybody,

    I'm trying to modify a vcf file to keep only homozygous variants.
    I've tried using vcftools, but i can't get the syntax right.

    I thought using the "keep-filtered" option of vcftools should be the right approach. Also I'm trying to find homozygous variants by the "GT 1/1" tag.

    I've tried
    vcftools --keep-filtered GT:1/1 --vcf in.vcf --out out.vcf
    but it kept none.

    edit:
    vcf file looks like this:
    c130956_g1_i1 50 . A C 5.49 . DP=2;RPB=-9.668049e-01;AF1=0.5049;AC1=1;DP4=1,0,1,0;MQ=33;FQ=-10.5;PV4=1,1,1,1 GT:PL: DP:GQ 0/1:34,0,17:2:22
    c130956_g1_i1 62 . T A 49 . DP=3;VDB=5.867524e-02;AF1=1;AC1=2;DP4=0,0,2,1;MQ=30;FQ=-36 GT:PL: DP:GQ 1/1:81,9,0:3:16




    What is the right syntax here?
    Last edited by Seqql; 06-03-2015, 06:08 AM.

  • #2
    --keep-filtered is for the FILTER column

    to answer this, I would use my tool https://github.com/lindenb/jvarkit/wiki/VCFFilterJS


    java -jar dist/vcffilterjs.jar -e 'variant.getGenotype(0).isHomVar()' file.vcf

    Comment


    • #3
      Thank you, this is looknig very promising.

      I got the filtered vcf file, but i've run into a problem.
      I can't load the file into IGV because of the FILTER field, values like "11,30" aren't accepted. Looks like IGV only accepts one number with/out dots ("11.30" is ok).

      Is there any quick fix for this I could apply to the code? If not i may just delete the whole column.

      Comment


      • #4
        11,30 is generated by my program (?) If yes, this is a locale problem:
        try to set the LANG variable


        $ export LANG=en_US.UTF-8
        $ echo $LANG
        en_US.UTF-8




        or/and (?) specify it on the command line:

        java -Duser.language=en -Duser.country=US (...)
        Last edited by lindenb; 06-03-2015, 08:16 AM.

        Comment


        • #5
          Good thought, it worked.
          Thanks for the help

          Comment


          • #6
            Heterozygous filtering

            Hello everybody,

            can you help me how to filter heterozygous variants in vcf file?

            thank in advance for answer

            with best regards

            werno

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              Yesterday, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            57 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            53 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            45 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Working...
            X