Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Read Depth in vcf (samtools / bcftools)

    I generated one vcf file from 4 different bam-files (4 samples) and had a look at some variants in the vcf and in the IGViewer (IGV).

    I don't understand why the value for DP in the formatstring (i.e. for one sample) often differs (is less) from what I can see in the IGV. Meaning, vcf tells me a certain variant in a certain sample has read depth (DP) of 3 but I see more reads covering that position in the IGV. I thought that maybe bases of bad quality where left out, but reads and base qualities are good. Is there any other measure that I don't know yet which filters out certain reads / bases from being reported in the DP field in the format string (and consequently is not used for genotype assignment)?

    Any hint is appreciated.

  • #2
    I also had same question when I started my analysis. Then I figured out that DP (Depth of coverage) is not exactly the "counts of reads" at that position. Please see following links and I am sure you will get the answer:



    Hope it helps

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 08:47 AM
    0 responses
    10 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    57 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    53 views
    0 likes
    Last Post seqadmin  
    Working...
    X