Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • mspivakov
    Junior Member
    • Feb 2011
    • 2

    Python API for VCF files

    Hi everyone,

    I wanted to share with you a prototype Python module for parsing and filtering VCF files. It was tested on pooled files generated by GATK UnifiedGenotyper (ie, they contain both pool-level and sample-level fields), but it shouldn't require much tweaking (if at all) for use with VCFs generated by other tools.

    The module and a short tutorial can be downloaded from here:


    Please let me know if you find it useful or have any comments/suggestions.

    Cheers,
    Mikhail
    Last edited by mspivakov; 02-04-2011, 01:08 PM.
  • mspivakov
    Junior Member
    • Feb 2011
    • 2

    #2
    Have now fixed a minor bug in VCFfilter.
    Please re-download the module if you've already tried it.

    Comment

    • casbon
      Junior Member
      • Sep 2011
      • 7

      #3
      I have a python VCF parser up on github, if that helps. Check the pyvcf documentation. There is an extensible VCF filter as well.

      Comment

      • casbon
        Junior Member
        • Sep 2011
        • 7

        #4
        Just a quick update, I have released the last pyvcf in the 0.4 release http://pyvcf.readthedocs.org/en/late...Y.html#release

        I'm now considering VCF 4.1 and SV support, if you have any experience with these types of data, help us flesh out the API:
        Some new metadata in VCF 4.1 spec, notably contigs. Added test data and tests, need to write code for this.

        Comment

        • casbon
          Junior Member
          • Sep 2011
          • 7

          #5
          I just released PyVCF 0.5.0. This release brings VCF 4.1 Support, including support for structural variation and breakends. It includes many improvements to the VCF writer, a new utility vcf_melt and a new set of filters. You can see the full list of changes.

          Thanks very much to @dzerbino, @lennax, @libor-m, @ilyaminkin and @martijnvermaat for help on this release.

          Comment

          • gourneau
            Junior Member
            • Jan 2011
            • 2

            #6
            Thanks for the lib casbon

            Comment

            • FrankiB
              Member
              • Dec 2013
              • 23

              #7
              Is it working with python 3.3 and if yes, where can I find instructions for installation?
              Last edited by FrankiB; 04-07-2014, 11:32 AM.

              Comment

              • TiborNagy
                Senior Member
                • Mar 2010
                • 329

                #8
                If you have setuptools installed, you can use:
                python setup.py install

                Comment

                Latest Articles

                Collapse

                • GATTACAT
                  Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                  by GATTACAT
                  Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                  07-01-2026, 11:43 AM
                • SEQadmin2
                  Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                  by SEQadmin2


                  I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                  Here are nine questions we think about, in roughly the order they matter, before...
                  06-18-2026, 07:11 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, 07-02-2026, 11:08 AM
                0 responses
                17 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-30-2026, 05:37 AM
                0 responses
                18 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-26-2026, 11:10 AM
                0 responses
                21 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-17-2026, 06:09 AM
                0 responses
                54 views
                0 reactions
                Last Post SEQadmin2  
                Working...