Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Samtools SNP calling

    Hello there
    I have a issue with sam SNP calling. I work with captured genomic sequences.
    The fold coverage is very high at 600X. I used BWA (mismatch penalty -7) to map the reads to the genome and used samtools to call SNPs. I used mpileup and then realised that a known SNP was not called by mpileup and I tried to investigate what is happening in that region with pileup. The output is as follows
    CFA15 1299612 T T 255 0 59 309 g,GG,,.g,,.G.g.,,,gg.G,gg,g,,.G.,,,,,,GG...gGG,,,g.ggggg,.ggG.gg..g,Gg.Gggg.G
    .g,,.gg,,,,gg.,g,g.,,.Gg,.,gg,gg,ggggG,..Ggg,,g,g,,,.g,gG,gGgg.g,GG,Gg,,..,,g.G,,,,,,,.G,,gg,gG.gg,gGGGg.GGGG,g,gGg,G,,g,,g,.,g,g..GG.G.,gggg
    GggG.G,,g,g,.,,..,gGG..G,G,,..g,gg,g,.,Ggg,.G,g,.,gGGGGg,G.GGg,.gggG,g,,,g,G.G..G...,g^]g^],^],^]g^]G !T!!]^^!^^^!^!^^^^!!^!^!!^!^^^!^^^^^^
    ^!!^^^!!!^^^!^!!!!!^^!!!^!!^^!^!!^!!!!^!^!^^^!!^^^^!!^^!^!^^^^!!^^^!!^!!^!!!!!^^^!!!^^!^!^^^^!^!!^!!!!^!^!!^!!^^^^^^!^!^^^^^^^^!^^!!^!!^!!^!!
    !!!^!!!!^!^!!!^!^^!^^!^^^!^!^^!!^!^^!!!!!!!!^!^^!]!^^^^^^^!!!^^!^!^^^B!^!!^!^^^!!!^^!^!^^^!!!!,!^!^!!!^^!!!!^!^__!_!_!EE!EEA>!!EE!!
    What I do not understand is why is samtools not reporting the consensus sequence as K ? Is this the reason why it is not called as variant position ?
    Thanks a lot for the answers

  • #2
    All those !'s are the lowest quality. It doesn't want to call a G because it thinks all the G calls are horribly unreliable.

    Did you run mpileup with the default settings? At my fingertips, I've got a similar case, with 2 SNPs that definitely sanger confirmed, but also had lousy quality scores. When I re-ran mpileup with -B, the quality scores improved to what the .sam file said they ought to be, and my two SNPs popped up.

    If you ran pileup, you should really get the newest version of samtools, and run mpileup. People will be less willing to troubleshoot software they know is superseded.

    Comment


    • #3
      do you have format the original data?If not ,I also understand...

      Comment


      • #4
        Thank you very much. It worked.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin


          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
          Yesterday, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        39 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        41 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 09:21 AM
        0 responses
        35 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-04-2024, 09:00 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X