Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • vcftools typical usage

    Hi all,
    I've noticed that vcftools exist... seriously, if you search "vcftools" in seqanswer forum you'll find only 3 threads. BTW I don't want to start a flame about this :-)
    I just wonder if anybody here has some experience of these tools and wants to share some usage tips and typical scenarios. vcftools binary comes with many options, and so do the perl scripts. The vcftools pages apparently have no wiki pages (such the samtools or GATK ones)...

    d

  • #2
    All I've used it for

    Code:
    vcftools --vcf some1000genomesFile.genotypes.vcf --counts --minDP 3
    will give you something like:
    Code:
    CHROM	POS	N_ALLELES	N_CHR	{ALLELE:COUNT}
    1	533	2	92	G:87	C:5
    1	41342	2	54	T:38	A:16
    1	41791	2	62	G:58	A:4
    1	44449	2	50	T:48	C:2
    1	44539	2	36	C:35	T:1
    Code:
    awk '{print "chr"$1"\t"$2-1"\t"$2"\t"$5"\t"$6}'
    and you have a bed file of 1000 genomes variants:
    Code:
    chr1	532	533	G:87	C:5
    chr1	41341	41342	T:38	A:16
    chr1	41790	41791	G:58	A:4
    chr1	44448	44449	T:48	C:2

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 08:47 AM
    0 responses
    15 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    54 views
    0 likes
    Last Post seqadmin  
    Working...
    X