Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Filtering homozygous variants in vcf file with vcftools

    Hi everybody,

    I'm trying to modify a vcf file to keep only homozygous variants.
    I've tried using vcftools, but i can't get the syntax right.

    I thought using the "keep-filtered" option of vcftools should be the right approach. Also I'm trying to find homozygous variants by the "GT 1/1" tag.

    I've tried
    vcftools --keep-filtered GT:1/1 --vcf in.vcf --out out.vcf
    but it kept none.

    edit:
    vcf file looks like this:
    c130956_g1_i1 50 . A C 5.49 . DP=2;RPB=-9.668049e-01;AF1=0.5049;AC1=1;DP4=1,0,1,0;MQ=33;FQ=-10.5;PV4=1,1,1,1 GT:PL: DP:GQ 0/1:34,0,17:2:22
    c130956_g1_i1 62 . T A 49 . DP=3;VDB=5.867524e-02;AF1=1;AC1=2;DP4=0,0,2,1;MQ=30;FQ=-36 GT:PL: DP:GQ 1/1:81,9,0:3:16




    What is the right syntax here?
    Last edited by Seqql; 06-03-2015, 06:08 AM.

  • #2
    --keep-filtered is for the FILTER column

    to answer this, I would use my tool https://github.com/lindenb/jvarkit/wiki/VCFFilterJS


    java -jar dist/vcffilterjs.jar -e 'variant.getGenotype(0).isHomVar()' file.vcf

    Comment


    • #3
      Thank you, this is looknig very promising.

      I got the filtered vcf file, but i've run into a problem.
      I can't load the file into IGV because of the FILTER field, values like "11,30" aren't accepted. Looks like IGV only accepts one number with/out dots ("11.30" is ok).

      Is there any quick fix for this I could apply to the code? If not i may just delete the whole column.

      Comment


      • #4
        11,30 is generated by my program (?) If yes, this is a locale problem:
        try to set the LANG variable


        $ export LANG=en_US.UTF-8
        $ echo $LANG
        en_US.UTF-8




        or/and (?) specify it on the command line:

        java -Duser.language=en -Duser.country=US (...)
        Last edited by lindenb; 06-03-2015, 08:16 AM.

        Comment


        • #5
          Good thought, it worked.
          Thanks for the help

          Comment


          • #6
            Heterozygous filtering

            Hello everybody,

            can you help me how to filter heterozygous variants in vcf file?

            thank in advance for answer

            with best regards

            werno

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Techniques and Challenges in Conservation Genomics
              by seqadmin



              The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

              Avian Conservation
              Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
              03-08-2024, 10:41 AM
            • seqadmin
              The Impact of AI in Genomic Medicine
              by seqadmin



              Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
              02-26-2024, 02:07 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 03-14-2024, 06:13 AM
            0 responses
            34 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-08-2024, 08:03 AM
            0 responses
            72 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-07-2024, 08:13 AM
            0 responses
            81 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 03-06-2024, 09:51 AM
            0 responses
            68 views
            0 likes
            Last Post seqadmin  
            Working...
            X