Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • vcftools typical usage

    Hi all,
    I've noticed that vcftools exist... seriously, if you search "vcftools" in seqanswer forum you'll find only 3 threads. BTW I don't want to start a flame about this :-)
    I just wonder if anybody here has some experience of these tools and wants to share some usage tips and typical scenarios. vcftools binary comes with many options, and so do the perl scripts. The vcftools pages apparently have no wiki pages (such the samtools or GATK ones)...

    d

  • #2
    All I've used it for

    Code:
    vcftools --vcf some1000genomesFile.genotypes.vcf --counts --minDP 3
    will give you something like:
    Code:
    CHROM	POS	N_ALLELES	N_CHR	{ALLELE:COUNT}
    1	533	2	92	G:87	C:5
    1	41342	2	54	T:38	A:16
    1	41791	2	62	G:58	A:4
    1	44449	2	50	T:48	C:2
    1	44539	2	36	C:35	T:1
    Code:
    awk '{print "chr"$1"\t"$2-1"\t"$2"\t"$5"\t"$6}'
    and you have a bed file of 1000 genomes variants:
    Code:
    chr1	532	533	G:87	C:5
    chr1	41341	41342	T:38	A:16
    chr1	41790	41791	G:58	A:4
    chr1	44448	44449	T:48	C:2

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM
    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    30 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    32 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    28 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    53 views
    0 likes
    Last Post seqadmin  
    Working...
    X