Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • MAQ SNP calling

    > Hi,
    >
    > From my understanding MAQ is tested for 20-50x coverage regions for SNP
    > calling. However, we routinely have projects where depth of coverage
    > from Solexa reads is ~5000x.
    >
    > I see quite a few 'SNPs' that are not called by MAQ.
    > These are the duplicate samples, where MAQ does NOT call a SNP
    >
    > Pos Depth %a %c %g %t
    > 64 6632 0.11 92.01 0.69 7.19
    > 64 6687 0.07 90.88 0.40 8.64
    >
    > While for these duplicates it makes a SNP call!
    > 64 8594 0.05 91.91 0.54 7.49
    > 64 7889 0.13 91.15 0.49 8.23
    >
    > Any suggestions on what threshold/filter to use to make consistent
    > calling with high depth of coverage regions?
    --
    bioinfosm

  • #2
    SNP calling in high depth/coverage solexa data

    Any other options to use apart from MAQ in calling SNPs on such data?
    --
    bioinfosm

    Comment


    • #3
      Sort of on-topic....

      I'm not sure what the answer is to your question about using MAQ, but I have had the pleasure of doing SNP calls on a MAQ .map file, for which I wrote my own SNP caller. Doing your own SNP calls is actually very easy, but leaves you with the same questions about filters (though, no consistency issues, that I'm aware of.).
      The more you know, the more you know you don't know. —Aristotle

      Comment


      • #4
        Yes, MAQ SNP calling is amazingly good! I just wish it scaled to the high-depth data I have. The other alternative which I will try today is take a random subset of my data and artificially get manageable depth
        --
        bioinfosm

        Comment


        • #5
          How Maq works for small indels?

          Comment


          • #6
            Originally posted by Rao View Post
            How Maq works for small indels?
            from the PDF manual/published paper: maq for indels needs paired end reads. maps read 1, then does gapped alignment of read 2. maq indelpe will list the indels called from the read2 gapped alignment.

            try novocraft software if you want more indels from single end or paired end reads.

            Comment


            • #7
              Originally posted by dvh View Post
              from the PDF manual/published paper: maq for indels needs paired end reads. maps read 1, then does gapped alignment of read 2. maq indelpe will list the indels called from the read2 gapped alignment.

              try novocraft software if you want more indels from single end or paired end reads.
              Thanks dvh,
              How about SOAP

              Comment


              • #8
                I used SOAP to detect upto 4bp indels in the solexa data. I usually use it on the reads that MAQ is not able to align to reference.
                --
                bioinfosm

                Comment


                • #9
                  Another case of MAQ SNP calling that I would like to discuss here:

                  cns.final.snp
                  PKD1 37187 T C 255 255 1.12 63 62
                  PKD1 37189 T C 255 255 1.56 63 62

                  These are so close to each other, and look very likely to be False Positives. I was of the idea that MAQ throws such away, but not so..
                  --
                  bioinfosm

                  Comment


                  • #10
                    Missed SNP call; obvious from pileup output

                    I see this obvious SNP from the pileup file with 382 reads mapping the position of which 99.74% say its a C, as opposed to the reference T.

                    Still, the position is not reported in cns.snp file! As I understand, the maq cns2snp command on consensus.cns does not do any filtering and reports all SNPs. This ought to have been reported..

                    Anything I am missing here?
                    --
                    bioinfosm

                    Comment


                    • #11
                      MAQ SNP calling doubts

                      Originally posted by bioinfosm View Post
                      Another case of MAQ SNP calling that I would like to discuss here:

                      cns.final.snp
                      PKD1 37187 T C 255 255 1.12 63 62
                      PKD1 37189 T C 255 255 1.56 63 62

                      These are so close to each other, and look very likely to be False Positives. I was of the idea that MAQ throws such away, but not so..
                      Hi Bioinfoism,

                      This is probably 4 years late, but have you resolve the SNP calling issue with MAQ? I am actually facing the same scenario here where neighbouring SNPs were called. Any ideas on what is causing this? Also, since neighbouring SNPs were called, can the alignment be trusted?

                      Rookie question. But thanks in advance

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Essential Discoveries and Tools in Epitranscriptomics
                        by seqadmin




                        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                        04-22-2024, 07:01 AM
                      • seqadmin
                        Current Approaches to Protein Sequencing
                        by seqadmin


                        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                        04-04-2024, 04:25 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 04-11-2024, 12:08 PM
                      0 responses
                      59 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 10:19 PM
                      0 responses
                      57 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 09:21 AM
                      0 responses
                      51 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-04-2024, 09:00 AM
                      0 responses
                      56 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X