Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to construct phylogenetic tree using SNPs

    hi,
    i am puzzled of how to construct phylogenetic tree using SNPs from re-sequence data.

    may i extract validated SNPs of all to generate a artificial sequence and import it to MEGA or other programes?

    any details will be appreciated!

  • #2
    "may i extract validated SNPs of all to generate a artificial sequence and import it to MEGA or other programes?"

    That's exactly what you can do. It's not really even an artificial sequence, since the SNPs, by definition, are the informative sites that a phylogeny is based upon, i.e. all the other sequence would be is filler that is filtered out of by the programs.

    The only issue with just using the SNPs outside of a genomic context is that you will not know where they are in the codons and thus cannot use amino acid sequences for any analyse,s or know if a SNP represents a synonymous change (amino acid remains the same) or non-synonymous (changes amino acid sequence). Knowing if there is an amino acid change can be important for functional/medical genomics, for more complex codon-based models use to build phylogenies, and for selecting neutral (non-synonymous) changes to look at analyses using the molecular clock.

    Comment


    • #3
      I found this link the other day that constructs a phylogeny from SNPs in kmers:

      An Open Access Publisher & Scientific Events Organizer and other Open Access Resources


      You have to ask the author for the source code, but it appears to work with both assemblies and raw sequence reads.

      Comment


      • #4
        Originally posted by zmartine View Post
        "may i extract validated SNPs of all to generate a artificial sequence and import it to MEGA or other programes?"

        That's exactly what you can do. It's not really even an artificial sequence, since the SNPs, by definition, are the informative sites that a phylogeny is based upon, i.e. all the other sequence would be is filler that is filtered out of by the programs.

        The only issue with just using the SNPs outside of a genomic context is that you will not know where they are in the codons and thus cannot use amino acid sequences for any analyse,s or know if a SNP represents a synonymous change (amino acid remains the same) or non-synonymous (changes amino acid sequence). Knowing if there is an amino acid change can be important for functional/medical genomics, for more complex codon-based models use to build phylogenies, and for selecting neutral (non-synonymous) changes to look at analyses using the molecular clock.
        well, thanks.
        but will you give more details about constructing tree like how the SNPs and its position in chromosome are orgnized?

        Comment


        • #5
          Originally posted by themerlin View Post
          I found this link the other day that constructs a phylogeny from SNPs in kmers:

          An Open Access Publisher & Scientific Events Organizer and other Open Access Resources


          You have to ask the author for the source code, but it appears to work with both assemblies and raw sequence reads.
          that is out of my understanding

          Comment


          • #6
            Would it make sense to generate n artificial sequences with the SNPs and their codon contexts for
            all strains of interest ?

            Alternatively, I guess SNPs can be used as characters in a parsimony based approach ? Has anyone
            tried this ?

            Lastly, this program may also be helpful.

            Comment


            • #7
              Originally posted by zmartine View Post
              "may i extract validated SNPs of all to generate a artificial sequence and import it to MEGA or other programes?"

              That's exactly what you can do. It's not really even an artificial sequence, since the SNPs, by definition, are the informative sites that a phylogeny is based upon, i.e. all the other sequence would be is filler that is filtered out of by the programs.
              That's only true for maximum parsimony. Maximum likelihood and Bayesian methods consider invariant sites.

              Comment


              • #8
                See http://www.pnas.org/content/early/2009/10/21/0904691106 for an example of using maximum parsimony to analyze SNP data.

                Comment


                • #9
                  Hi,
                  No idea how construct phylogenetic tree using SNPs ???
                  MM

                  Comment


                  • #10
                    The workflow for creating phylogenetic tree by SNPs

                    Dear all,

                    I will really appreciate if anyone would share their workflow for creating the phylogenetic tree by SNPs? Is there any commercial software that can do it?

                    Thank you very much!

                    Victor

                    Comment


                    • #11
                      Haven't used it yet, but

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Essential Discoveries and Tools in Epitranscriptomics
                        by seqadmin




                        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                        04-22-2024, 07:01 AM
                      • seqadmin
                        Current Approaches to Protein Sequencing
                        by seqadmin


                        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                        04-04-2024, 04:25 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Today, 08:47 AM
                      0 responses
                      10 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-11-2024, 12:08 PM
                      0 responses
                      60 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 10:19 PM
                      0 responses
                      57 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 09:21 AM
                      0 responses
                      53 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X