Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to tell SNPs synonymous v. nonsynonymous

    Hi,
    I have aligned my Illumina reads to hg19 and used SAMtools to give me a list of SNPs. I uploaded the list of SNPs to Galaxy and used tools there to filter out all dbSNPs and to select only the SNPs that are in coding exons. Now I would like to determine whether the each SNP is synonymous or nonsynonymous. Is there an automated way of doing this? It will take me a long time if I need to enter each SNP into the UCSC browser to see whether it changes the amino acid.

  • #2
    I have the same problem as well

    Comment


    • #3
      Finding the good ones

      Originally posted by Margot View Post
      Hi,
      I have aligned my Illumina reads to hg19 and used SAMtools to give me a list of SNPs. I uploaded the list of SNPs to Galaxy and used tools there to filter out all dbSNPs and to select only the SNPs that are in coding exons. Now I would like to determine whether the each SNP is synonymous or nonsynonymous. Is there an automated way of doing this? It will take me a long time if I need to enter each SNP into the UCSC browser to see whether it changes the amino acid.
      Hi Margot,

      I use SIFT (http://sift.jcvi.org/) to do this and I check it does support hg19/GRCh37. It works relatively fast and is fairly user friendly, though I wish they didn't use * for nonsense codons...really messes up my database but you can replace them in excel using find/replace with find=~* and replace=X

      Hope that helps,

      Jonathan

      Comment


      • #4
        another tool is http://gvs.gs.washington.edu/SeattleSeqAnnotation/

        Comment


        • #5
          Looks like the SeattleSeqAnnotation only supports NCBI36/hg18 1-indexed and doesn't support the new GRCh37/hg19 genome build, but that was only on a quick check so I might be wrong. Anyone with experience with both?

          Comment


          • #6
            As far as I know you are right see http://gvs.gs.washington.edu/Seattle...lpHowToUse.jsp

            Comment


            • #7
              Thanks for the help!

              Comment


              • #8
                I would suggest you to write a program by yourself.

                By definition, "synonymous" means that a change in a nucleotide position doesn't change the translated amino acid, whereas "non-synonymous" changes. So you need to decide the translational frame of the exon and the codon usage table of your organism and organelle, before you can determine whether each SNP is "synonymous" or "non-synonymous".
                Website: http://www.fengfengzhou.org/FengfengZhou/

                Comment


                • #9
                  Thanks for the information which I'm looking for. Sift needs SNPs' direction while gvs only needs ref_allele, first/second allele, no strand required.

                  samtools pileup does not provide strand information, does it matter that I just give 1 or -1 for sift to run? thanks.

                  Comment


                  • #10
                    Originally posted by bair View Post
                    Thanks for the information which I'm looking for. Sift needs SNPs' direction while gvs only needs ref_allele, first/second allele, no strand required.

                    samtools pileup does not provide strand information, does it matter that I just give 1 or -1 for sift to run? thanks.
                    For SIFT you can just mark all SNPs as forward strand if the strand is not known.

                    "Orientation:
                    Use 1 for positve strand and -1 for negative strand. If orientation is not known, use 1 as default."
                    The one trick I noticed is that you must always include the known reference allele as the first allele in the allele column otherwise it defaults to reporting a synonymous SNP. I caught this issue after two samples had the identical mutation but one was heterozygous with the reference allele (Call nonsynonymous) and the other was homozygous for the mutation (Called synonymous).

                    Comment


                    • #11
                      Thank you Jon, the trick is very helpful, I need to be careful to modify my input file.
                      Actually I have not setup sift yet, may worth to compare the outputs from sift and gvs.

                      Comment


                      • #12
                        Originally posted by Margot View Post
                        I uploaded the list of SNPs to Galaxy and used tools there to filter out all dbSNPs and to select only the SNPs that are in coding exons. Now I would like to determine whether the each SNP is synonymous or nonsynonymous.
                        Late answer I know, but this is a very common requirement so this may help someone. Unfortunately Galaxy lacks a simple SNV annotation tool, but with some work you *can* separate your synonymous and nonsynonymous Galaxy (hint: read the help text for the "Mutate Codons" tool).

                        Or you could use a variant annotation tool e.g.:
                        http://gvs.gs.washington.edu/SeattleSeqAnnotation/
                        http://www.svaproject.org/
                        http://www.openbioinformatics.org/annovar/
                        http://seqant.genetics.emory.edu/
                        http://www.ualberta.ca/~stothard/dow...SNP/index.html

                        Comment


                        • #13
                          Originally posted by BetterPrimate View Post
                          Late answer I know, but this is a very common requirement so this may help someone. Unfortunately Galaxy lacks a simple SNV annotation tool, but with some work you *can* separate your synonymous and nonsynonymous Galaxy (hint: read the help text for the "Mutate Codons" tool).

                          Or you could use a variant annotation tool e.g.:
                          http://gvs.gs.washington.edu/SeattleSeqAnnotation/
                          http://www.svaproject.org/
                          http://www.openbioinformatics.org/annovar/
                          http://seqant.genetics.emory.edu/
                          http://www.ualberta.ca/~stothard/dow...SNP/index.html
                          Thank you very much.

                          Comment


                          • #14
                            Use the latest ANNOVAR program... it's great!

                            Comment


                            • #15
                              The ensembl snp effect predictor is very good for this

                              http://bioinformatics.oxfordjournals...6/16/2069.full
                              http://www.ensembl.org/Homo_sapiens/...loadVariations

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Investigating the Gut Microbiome Through Diet and Spatial Biology
                                by seqadmin




                                The human gut contains trillions of microorganisms that impact digestion, immune functions, and overall health1. Despite major breakthroughs, we’re only beginning to understand the full extent of the microbiome’s influence on health and disease. Advances in next-generation sequencing and spatial biology have opened new windows into this complex environment, yet many questions remain. This article highlights two recent studies exploring how diet influences microbial...
                                02-24-2025, 06:31 AM
                              • seqadmin
                                Quality Control Essentials for Next-Generation Sequencing Workflows
                                by seqadmin




                                Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

                                Nucleic Acid Quality Control
                                Preparing for NGS starts with isolating the...
                                02-10-2025, 01:58 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 03-03-2025, 01:15 PM
                              0 responses
                              151 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 02-28-2025, 12:58 PM
                              0 responses
                              229 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 02-24-2025, 02:48 PM
                              0 responses
                              599 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 02-21-2025, 02:46 PM
                              0 responses
                              262 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X