Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Average nucleotide identity

    Hi all,
    I'm new to the forum and working on comparative genomics.

    I'm comparing around 50 bacterial genomes, most of which contain palsmid and few doesn't.

    I would like to know, when i calculate average nucleotide identity(ANI) using orhtoANI should i use only chromosomal genome or both chromosomal and plasmid together?

  • #2
    Depends on your goal, but... I'd use everything. I don't see that it matters whether a gene is on a plasmid or not when calculating ANI. It gets a bit more complicated when you have multiple plasmid copies, but for simplicity, I'd just calculate the ANI from the full haploid genome representation.

    Comment


    • #3
      Thank you for your reply.
      Yes u are right some strains have 15 or 20 plasmids and some doesn't have at all. That is the reason i would like to know whether it will be reasonable to use only chromosomal and not both chromosomal and plasmid

      Comment


      • #4
        There are lots of ways of calculating ANI. The best one depends on your goal. For example, if you want to say "These two bacteria are really closely related" then probably just the main chromosome is important, since plasmids can come and go pretty rapidly. If you want to say "These two bacteria are behaviorally similar" then you need to include all the plasmids as well. Note that ANI is not a sufficient metric in the latter case, you also need to calculate completeness. ANI generally only factors in things that align, so you might get 100% ANI between human chromosome 1 and the full human genome, but that does not mean they are equivalent.

        I encourage you to try BBMap's CompareSketch:

        First, for each genome fasta, run fuse.sh on it to combine the contigs into a single sequence, which makes the all-to-all comparison run at a per-genome rather than a per-sequence level (I'll probably make that automatic at some point). Then:

        Code:
        comparesketch.sh *.fasta alltoall records=100
        That does an all-to-all comparison and reports both ANI and completeness. It's alignment-free and will give different results to alignment-based methods (well, all ANI calculation methods will give different results) but it's useful in that it reports completeness also. A bacteria with 100% ANI and 90% genome completeness compared to another bacteria will be missing some functionality, even though they are very closely related.
        Last edited by Brian Bushnell; 07-14-2017, 10:28 AM.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM
        • seqadmin
          The Impact of AI in Genomic Medicine
          by seqadmin



          Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
          02-26-2024, 02:07 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-14-2024, 06:13 AM
        0 responses
        33 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-08-2024, 08:03 AM
        0 responses
        72 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-07-2024, 08:13 AM
        0 responses
        81 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-06-2024, 09:51 AM
        0 responses
        68 views
        0 likes
        Last Post seqadmin  
        Working...
        X