Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Jane M
    Senior Member
    • Aug 2011
    • 239

    Which method to confirm SNPs and indels?

    Dear all,

    I have analyzed my DNA-seq data, in the common way: alignment, de-duplication, realignement, recalibration and variant calling, to get SNPs and indels. Now, we want to confirm these variants with biology. I am wondering what is the best way to do it.
    I haven't found a topic about it on the forum that is why I open this discussion.

    There is Sanger sequencing, but it could give false negative depending on the material used (if I understood well).
    It is also possible to use Ion Torrent.
    I am not a biologist and I am not familiar with these 2 methods. In my institute, we start working with DNA-seq data.

    Could you please let me know what do you do, which difficulties did you meet with these methods? Which traps to avoid?

    Thanks in advance,
    Jane
  • swbarnes2
    Senior Member
    • May 2008
    • 910

    #2
    Sanger sequencing is going to be the gold standard for accuracy in DNA sequence. It's kind of overkill when you only care about a single base, but you'll have the added context of a few hundred bases around your SNP.

    Comment

    • Jane M
      Senior Member
      • Aug 2011
      • 239

      #3
      Thank you swbarnes2 for your answer. I heard that with Sanger sequencing, we can get false negatives because of the tak (DNA polymerase). I would like to ask people used to do Sanger sequencing how frequent are these cases.

      Finally, in my study, the biologists intend to use Ion Torrent to save time. What do you think about this method? Do you have any feedback?

      Comment

      • brofallon
        Member
        • May 2011
        • 26

        #4
        Sanger

        We Sanger suspected variants pretty frequently here and we don't get too many false negatives. Ion Torrent is another next-gen method and is both expensive and probably not much more reliable than other NGS methods.

        Comment

        • swbarnes2
          Senior Member
          • May 2008
          • 910

          #5
          Do you understand how sanger sequencing works? As long as your primers are really sitting down where you believe they should be, and nowhere else, you will get the right sequence. It might not be pretty, but there's no bias towards the letter in your reference. How would the enzyme know what is the reference letter, and what is the variant you expect to see?

          Comment

          • Jane M
            Senior Member
            • Aug 2011
            • 239

            #6
            Thank you both for your answers!

            brofallon, when you get or suspect false positives, what do you do? Do you resequence with an other enzyme or by changing something else? Or you don't further study this SNP?

            swbarnes2, I know a bit how Sanger sequencing works. I understand that there is no bias towards the letter in the reference. I just need an idea about how frequent are the false positives with Sanger sequencing.
            My question is: are we always able to detect when Sanger sequencing doesn't work, so that if we do not validate a SNP, we cannot conclude that there is no SNPs. We can only say that Sanger sequencing didn't validate or invalidate this variant?

            If I understand correctly the point of view of brofallon, he doesn't recommend to use Ion Torrent to validate variants. I would like to know if this technology has already been used in this case (instead of Sanger sequencing). Did you read some papers about it?

            Comment

            • gsgs
              Senior Member
              • Oct 2009
              • 139

              #7
              as I understand, you get multiple partial sequences of length ~500 and they overlap
              and each SNP position is covered -say- 10 times. Now you can have one nucleotide
              there -say- 8 times and another one 2 times.
              Assume one read gives us 90% probability then 8:2 should give 99.99996%
              if I calculated correctly.
              Provided that the reads are independent and there is no systematic error,
              some constellation that makes the error more likely ...
              ...............
              I don't know. There are probably lots of statistics available already
              and papers written about it

              Comment

              • Jane M
                Senior Member
                • Aug 2011
                • 239

                #8
                I guess this answer was not intended for this topic ...?

                Comment

                • swbarnes2
                  Senior Member
                  • May 2008
                  • 910

                  #9
                  Originally posted by Jane M View Post
                  Thank you both for your answers!

                  brofallon, when you get or suspect false positives, what do you do? Do you resequence with an other enzyme or by changing something else? Or you don't further study this SNP?

                  swbarnes2, I know a bit how Sanger sequencing works. I understand that there is no bias towards the letter in the reference. I just need an idea about how frequent are the false positives with Sanger sequencing.
                  Okay, but you did ask repeatedly about the false negative rate. But no, if your primers are sitting down where they are supposed to be, the only way for you to get the wrong sequence would be a freak polymerase error. The odds of that happening exactly on the base that you care about are tiny. It's not going to happen.

                  Now, you might get poor quality data, but traces come with quality scores. If your sequences blasts to where it ought to, and the quality around your base is good, then that's what your letter is. If you are trying to make mixed calls, that's trickier, but people have been dealing with the quality of sanger trace files for more than 10 years. It's ground that is well trod.

                  Comment

                  • brofallon
                    Member
                    • May 2011
                    • 26

                    #10
                    For our part, if we find a questionable variant in NGS data, and then Sanger sequence the area and find that the variant is not there, we typically assume the variant is a false positive and go no further. As swbarnes mentioned, if we have reason to believe the Sanger wasn't of good quality, we might redesign the primers and try it again, but as long as things don't seem too suspicious we trust the Sanger results.

                    Comment

                    • gsgs
                      Senior Member
                      • Oct 2009
                      • 139

                      #11
                      what's your statistical error-rate ?
                      you or others must have measured that ...

                      where to find a list of labs with their error-rates

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Pathogen Surveillance with Advanced Genomic Tools
                        by seqadmin




                        The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
                        03-24-2025, 11:48 AM
                      • seqadmin
                        New Genomics Tools and Methods Shared at AGBT 2025
                        by seqadmin


                        This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

                        The Headliner
                        The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
                        03-03-2025, 01:39 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Today, 12:59 PM
                      0 responses
                      6 views
                      0 reactions
                      Last Post seqadmin  
                      Started by seqadmin, Yesterday, 10:17 AM
                      0 responses
                      7 views
                      0 reactions
                      Last Post seqadmin  
                      Started by seqadmin, 03-20-2025, 05:03 AM
                      0 responses
                      49 views
                      0 reactions
                      Last Post seqadmin  
                      Started by seqadmin, 03-19-2025, 07:27 AM
                      0 responses
                      60 views
                      0 reactions
                      Last Post seqadmin  
                      Working...