Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sankarithirumal@gmail.com
    Junior Member
    • Jul 2017
    • 2

    Average nucleotide identity

    Hi all,
    I'm new to the forum and working on comparative genomics.

    I'm comparing around 50 bacterial genomes, most of which contain palsmid and few doesn't.

    I would like to know, when i calculate average nucleotide identity(ANI) using orhtoANI should i use only chromosomal genome or both chromosomal and plasmid together?
  • Brian Bushnell
    Super Moderator
    • Jan 2014
    • 2709

    #2
    Depends on your goal, but... I'd use everything. I don't see that it matters whether a gene is on a plasmid or not when calculating ANI. It gets a bit more complicated when you have multiple plasmid copies, but for simplicity, I'd just calculate the ANI from the full haploid genome representation.

    Comment

    • sankarithirumal@gmail.com
      Junior Member
      • Jul 2017
      • 2

      #3
      Thank you for your reply.
      Yes u are right some strains have 15 or 20 plasmids and some doesn't have at all. That is the reason i would like to know whether it will be reasonable to use only chromosomal and not both chromosomal and plasmid

      Comment

      • Brian Bushnell
        Super Moderator
        • Jan 2014
        • 2709

        #4
        There are lots of ways of calculating ANI. The best one depends on your goal. For example, if you want to say "These two bacteria are really closely related" then probably just the main chromosome is important, since plasmids can come and go pretty rapidly. If you want to say "These two bacteria are behaviorally similar" then you need to include all the plasmids as well. Note that ANI is not a sufficient metric in the latter case, you also need to calculate completeness. ANI generally only factors in things that align, so you might get 100% ANI between human chromosome 1 and the full human genome, but that does not mean they are equivalent.

        I encourage you to try BBMap's CompareSketch:

        First, for each genome fasta, run fuse.sh on it to combine the contigs into a single sequence, which makes the all-to-all comparison run at a per-genome rather than a per-sequence level (I'll probably make that automatic at some point). Then:

        Code:
        comparesketch.sh *.fasta alltoall records=100
        That does an all-to-all comparison and reports both ANI and completeness. It's alignment-free and will give different results to alignment-based methods (well, all ANI calculation methods will give different results) but it's useful in that it reports completeness also. A bacteria with 100% ANI and 90% genome completeness compared to another bacteria will be missing some functionality, even though they are very closely related.
        Last edited by Brian Bushnell; 07-14-2017, 10:28 AM.

        Comment

        Latest Articles

        Collapse

        • GATTACAT
          Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by GATTACAT
          Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
          07-01-2026, 11:43 AM
        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, Yesterday, 11:08 AM
        0 responses
        6 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-30-2026, 05:37 AM
        0 responses
        11 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        19 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        53 views
        0 reactions
        Last Post SEQadmin2  
        Working...