Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • failed when merging the example VCF files using VCFtools

    Dear SEQanswers,

    I am learning VCFtools by executing VCFtools commands on VCF files in Examples folder of the VCFtools installation path. Please help me to fix the three problems followed and give me some tips or directions to merging VCF files.

    tabix has been in my path.

    (1). When I want to merge the three example VCF files, I failed.

    commands:
    merge-vcf merge-test-a.vcf merge-test-b.vcf merge-test-c.vcf > merg.vcf

    results:
    [main] fail to load the index file.
    The command "tabix -l merge-test-a.vcf" exited with an error. Is the
    file tabix indexed?

    at /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 167
    Vcf::throw('Vcf4_0=HASH(0x10082df18)', 'The command "tabix -l
    merge-test-a.vcf" exited with an error....') called at
    /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 1687
    VcfReader::get_chromosomes('Vcf4_0=HASH(0x10082df18)') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf line 139
    main::init_cols('HASH(0x10082a3d0)', 'Vcf4_0=HASH(0x10082e110)')
    called at /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf
    line 219
    main::merge_vcf_files('HASH(0x10082a3d0)') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf line 12

    (2). Then I tried to compressed them. After I compressed and indexed
    the VCF files, I still failed to merge them.

    bgzip merge-test-a.vcf
    bgzip merge-test-b.vcf
    bgzip merge-test-c.vcf

    tabix -p vcf merge-test-a.vcf.gz
    tabix -p vcf merge-test-b.vcf.gz
    tabix -p vcf merge-test-c.vcf.gz

    ###############################################
    merge Command:
    merge-vcf merge-test-a.vcf.gz merge-test-b.vcf.gz merge-test-c.vcf.gz
    | bgzip -c > merg.vcf.gz

    results:
    zcat: merge-test-a.vcf.gz.Z: No such file or directory
    Error reading VCF file.

    at /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 167
    Vcf::throw('Vcf=HASH(0x1008f32a8)', 'Error reading VCF file.\x{a}')
    called at /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line
    280
    Vcf::next_line('Vcf=HASH(0x1008f32a8)') called at
    /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 219
    Vcf::_open('Vcf=HASH(0x1008f32a8)') called at
    /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 161
    Vcf::new('Vcf', 'file', 'merge-test-a.vcf.gz') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf line 125
    main::init_cols('HASH(0x10082a3d0)', 'Vcf4_0=HASH(0x10082e110)')
    called at /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf
    line 219
    main::merge_vcf_files('HASH(0x10082a3d0)') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf line 12
    ###############################################
    merge Command:
    merge-vcf merge-test-a.vcf.gz merge-test-b.vcf.gz merge-test-c.vcf.gz
    > merg.vcf.gz

    results:
    zcat: merge-test-a.vcf.gz.Z: No such file or directory
    Error reading VCF file.

    at /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 167
    Vcf::throw('Vcf=HASH(0x1008f32a8)', 'Error reading VCF file.\x{a}')
    called at /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line
    280
    Vcf::next_line('Vcf=HASH(0x1008f32a8)') called at
    /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 219
    Vcf::_open('Vcf=HASH(0x1008f32a8)') called at
    /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 161
    Vcf::new('Vcf', 'file', 'merge-test-a.vcf.gz') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf line 125
    main::init_cols('HASH(0x10082a3d0)', 'Vcf4_0=HASH(0x10082e110)')
    called at /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf
    line 219
    main::merge_vcf_files('HASH(0x10082a3d0)') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/merge-vcf line 12

    (3). vcf-stats and vcf-validator can work on all the three
    uncompressed VCF files: merge-test-a.vcf, merge-test-b.vcf,
    merge-test-c.vcf. But can not on the compressed files.

    Command:
    vcf-validator merge-test-a.vcf.gz

    Results:
    zcat: merge-test-c.vcf.gz.Z: No such file or directory
    Error reading VCF file.

    at /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 167
    Vcf::throw('Vcf=HASH(0x10082a0d0)', 'Error reading VCF file.\x{a}')
    called at /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line
    280
    Vcf::next_line('Vcf=HASH(0x10082a0d0)') called at
    /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 219
    Vcf::_open('Vcf=HASH(0x10082a0d0)') called at
    /Users/jianfengmao/programe_files/VCFtools/lib/Vcf.pm line 161
    Vcf::new('Vcf', 'file', 'merge-test-c.vcf.gz') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/vcf-validator line 53
    main::do_validation('HASH(0x100804ed0)') called at
    /Users/jianfengmao/programe_files/VCFtools/bin/vcf-validator line 14

  • #2
    Some of the VCFtools seem to only work with uncompressed or compressed files. Another example I found is the --remove-filtered <string> and --keep-filtered <string>, which were only working with uncompressed files for me.

    As for your ZCAT problem, Mac OS X's version of ZCAT demands that the file end in .Z. I was under the impression the latest VCFtools fixed this issue but if not, you can just move the files and append a .Z on the end for the sake of merging I think.
    Mendelian Disorder: A blogshare of random useful information for general public consumption. [Blog]
    Breakway: A Program to Identify Structural Variations in Genomic Data [Website] [Forum Post]
    Projects: U87MG whole genome sequence [Website] [Paper]

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM
    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    24 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    25 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    21 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    52 views
    0 likes
    Last Post seqadmin  
    Working...
    X