Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • krawitz
    Member
    • Feb 2010
    • 35

    A heretically simple approach to variant calling

    Hi everyone,

    we had a look at the distribution of heterozygous allele frequencies in NGS datasets and found that their variance is larger than expected by a bionomial distribution (http://www.ncbi.nlm.nih.gov/pubmed?term=22127862). For every variant caller this means a binomial prior distribution is not the right choice and might lead to false negative calls. We also found that a simple frequency classifier (heterozygous if covered by more the 20 reads and variant allele between 14% and 86%) is more sensitive at comparable specificity for high quality data, compared to default setting of most standard calling tools.

    Is anyone aware of a fast tool, that allows to apply such a frequency filter directly on a .bam file?

    cheers,

    peter
  • Hena
    Member
    • Nov 2009
    • 19

    #2
    You can ask samtools mpileup to print out the nucleotide pileups for each position in bam file. Parsing that should be fairly simple with a script.

    Comment

    • krawitz
      Member
      • Feb 2010
      • 35

      #3
      the samtools mpileup output can be piped into VarScan to apply a coverage and frequency filter:
      samtools pileup -f reference.fasta myData.bam | java -jar VarScan.v2.2.jar pileup2snp --min-coverage 20 --min-var-freq 0.14
      see: http://varscan.sourceforge.net/using-varscan.html

      Comment

      • NGSfan
        Senior Member
        • Apr 2009
        • 181

        #4
        Very nice paper.

        Sometimes simpler is better.

        Comment

        • Rocketknight
          Member
          • Sep 2011
          • 86

          #5
          This is awesome. Moving away from big fancy well-established tools to something like the "14-86%" rule is scary though.

          Comment

          Latest Articles

          Collapse

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Yesterday, 10:09 AM
          0 responses
          10 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-04-2026, 08:59 AM
          0 responses
          17 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 12:03 PM
          0 responses
          26 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 11:40 AM
          0 responses
          21 views
          0 reactions
          Last Post SEQadmin2  
          Working...