Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Samtools mpileup/bcftools

    Hi all,
    I read through samtools manuals several times, but I'm still not clear on how exactly samtools & bcftools decide to call a SNP. I've tried to run through multiple combination of arguments with mpileup (-B, -C, -q, etc) & bcftools, but still ran into the problem below. I even ran bcftools view on the bcf file without the varFilter step, but the problem persists.

    I have 2 samples, an original & an "evolved" cell line. Based on numerous runs, I found that there are many SNPs being called only in the "evolved" cell line but not on the original, making it look like they're "novel" SNP. However, when I view them on IGV, I can see the SNP in the original cell line and there don't seem to be significant differences between the mapping quality or base quality at the SNP position in these 2 samples.
    It's not important to me if reads below a certain mapping quality don't get count, but the trouble is that it seems to be inconsistent. In one sample, SNPs on reads having mapping quality of 0 don't get count, but then they would get counted in the other sample, making it difficult to identify the true novel SNP. Is there anyway to force the SNP count to be more consistent?
    -Ann

  • #2
    Hi,

    I'm also having a similar problem! I have PCR confirmed SNPs that I can see in IGV when I look at the BAM file, however they are not present after varFilter. I'm guessing that there are parameters that I need to change to include them, but there doesn't seem to be any trend in map quality or coverage that I can distinguish ones that are called vs. missed in the vcf output.

    I mapped my PE Illumina reads with BWA and then running samtools mpileup:
    $ samtools mpileup -uf x.fasta Initial-sorted.bam | bcftools/bcftools view -bvcg - > initial.raw.bcf

    Then ran:
    $bcftools view initial.raw.bcf | vcfutils.pl varFilter -D500 > initial_snps.vcf

    I've tried adding -E switch to the samtools mpileup and get alot more SNPs, however this still does not include the SNPs confirmed by PCR. Also tried increasing the -D value when on the bcftools commands - again increases SNP count, but still not ones I know of.

    Please help! Not sure what else to do....

    Comment


    • #3
      Try VarScan http://varscan.sourceforge.net/

      It takes the 'mpileup -f reference' output (piped or in a file) and then calls variants based on statistical analysis. Their paper does a good job at describing the algorithm. Plus the command line parameters are much more straightforward.

      I hope this helps!

      Comment


      • #4
        Thanks so much! The VarScan results seemed to agree much better with my pcr data!

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          Yesterday, 07:48 AM
        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 06:57 AM
        0 responses
        9 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 07:17 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-02-2024, 08:06 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-30-2024, 12:17 PM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Working...
        X