Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • vcftools site filter using bedfile

    Hi all

    I am trying to filter my vcf file in vcftools using a bed file. I am consistently getting an output vcf with 0 sites retained. Following is the command line I used:

    vcftools --gzvcf myfile.vcf.gz --recode --bed bedfile.bed --out outfile

    The log file looks like this:
    VCFtools - v0.1.6
    (C) Adam Auton 2009
    Parameters as interpreted:
    --out PsvaMvd7a
    --recode
    --gzvcf PsvaMvd7.vcf.gz
    --bed CF2_probes_mod_sorted.bed

    Scanning PsvaMvd7.vcf.gz ...
    Currently scanning CHROM: MT
    Currently scanning CHROM: 38
    Currently scanning CHROM: 35
    Currently scanning CHROM: 36
    Currently scanning CHROM: 37
    Currently scanning CHROM: 33
    Currently scanning CHROM: 32
    Currently scanning CHROM: 26.......
    Keeping 1718454 entries (out of 1718454 read)
    Done
    Filtering sites by BED file
    Kept 14 out of 14 Individuals
    Kept 0 out of 1718454 Sites
    Error:No data left for analysis!

    My bed file looks like this:

    "track name=""tiled_region"" description=""NimbleGen Tiled Regions"""
    chr1 11398149 11398377
    chr1 11398197 11398414
    chr1 11398377 11398448
    chr1 11453954 11454357
    chr1 11453999 11454327
    chr1 11456016 11456508
    chr1 11456061 11456496
    chr1 11537417 11537695
    chr1 11537488 11537637

    Appreciate any help. Thanks.
    Last edited by info_nowise; 04-19-2012, 05:58 AM. Reason: added more info

  • #2
    Are you 100% sure that's wrong?

    BEDTools can also get you the intersect between a .vcf and a .bed

    Comment


    • #3
      Yes, I am sure. However, I figured out the problem. I got rid of the letters 'chr' in the bed file, leaving only the numbers and vcftools worked!

      Comment


      • #4
        Thanks for the tip, though, swbarnes2!

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM
        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        17 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        46 views
        0 likes
        Last Post seqadmin  
        Working...
        X