Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • mspivakov
    Junior Member
    • Feb 2011
    • 2

    Python API for VCF files

    Hi everyone,

    I wanted to share with you a prototype Python module for parsing and filtering VCF files. It was tested on pooled files generated by GATK UnifiedGenotyper (ie, they contain both pool-level and sample-level fields), but it shouldn't require much tweaking (if at all) for use with VCFs generated by other tools.

    The module and a short tutorial can be downloaded from here:


    Please let me know if you find it useful or have any comments/suggestions.

    Cheers,
    Mikhail
    Last edited by mspivakov; 02-04-2011, 01:08 PM.
  • mspivakov
    Junior Member
    • Feb 2011
    • 2

    #2
    Have now fixed a minor bug in VCFfilter.
    Please re-download the module if you've already tried it.

    Comment

    • casbon
      Junior Member
      • Sep 2011
      • 7

      #3
      I have a python VCF parser up on github, if that helps. Check the pyvcf documentation. There is an extensible VCF filter as well.

      Comment

      • casbon
        Junior Member
        • Sep 2011
        • 7

        #4
        Just a quick update, I have released the last pyvcf in the 0.4 release http://pyvcf.readthedocs.org/en/late...Y.html#release

        I'm now considering VCF 4.1 and SV support, if you have any experience with these types of data, help us flesh out the API:
        Some new metadata in VCF 4.1 spec, notably contigs. Added test data and tests, need to write code for this.

        Comment

        • casbon
          Junior Member
          • Sep 2011
          • 7

          #5
          I just released PyVCF 0.5.0. This release brings VCF 4.1 Support, including support for structural variation and breakends. It includes many improvements to the VCF writer, a new utility vcf_melt and a new set of filters. You can see the full list of changes.

          Thanks very much to @dzerbino, @lennax, @libor-m, @ilyaminkin and @martijnvermaat for help on this release.

          Comment

          • gourneau
            Junior Member
            • Jan 2011
            • 2

            #6
            Thanks for the lib casbon

            Comment

            • FrankiB
              Member
              • Dec 2013
              • 23

              #7
              Is it working with python 3.3 and if yes, where can I find instructions for installation?
              Last edited by FrankiB; 04-07-2014, 11:32 AM.

              Comment

              • TiborNagy
                Senior Member
                • Mar 2010
                • 329

                #8
                If you have setuptools installed, you can use:
                python setup.py install

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                  by SEQadmin2


                  I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                  Here are nine questions we think about, in roughly the order they matter, before...
                  Today, 07:11 AM
                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, Yesterday, 06:09 AM
                0 responses
                16 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-09-2026, 11:58 AM
                0 responses
                37 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-05-2026, 10:09 AM
                0 responses
                43 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-04-2026, 08:59 AM
                0 responses
                49 views
                0 reactions
                Last Post SEQadmin2  
                Working...