GRAViTy-V2: A grounded viral taxonomy application

dc.contributor.authorMayne R.
dc.contributor.authorAiewsakun P.
dc.contributor.authorTurner D.
dc.contributor.authorAdriaenssens E.M.
dc.contributor.authorSimmonds P.
dc.contributor.correspondenceMayne R.
dc.contributor.otherMahidol University
dc.date.accessioned2025-01-01T18:28:11Z
dc.date.available2025-01-01T18:28:11Z
dc.date.issued2024-12-01
dc.description.abstractTaxonomic classification of viruses is essential for understanding their evolution. Genomic classification of viruses at higher taxonomic ranks, such as order or phylum, is typically based on alignment and comparison of amino acid sequence motifs in conserved genes. Classification at lower taxonomic ranks, such as genus or species, is usually based on nucleotide sequence identities between genomic sequences. Building on our whole-genome analytical classification framework, we here describe Genome Relationships Applied to Viral Taxonomy Version 2 (GRAViTy-V2), which encompasses a greatly expanded range of features and numerous optimisations, packaged as an application that may be used as a general-purpose virus classification tool. Using 28 datasets derived from the ICTV 2022 taxonomy proposals, GRAViTy-V2 output was compared against human expert-curated classifications used for assignments in the 2023 round of ICTV taxonomy changes. GRAViTy-V2 produced taxonomies equivalent to manually-curated versions down to the family level and in almost all cases, to genus and species levels. The majority of discrepant results arose from errors in coding sequence annotations in INDSC records, or from inclusion of incomplete genome sequences in the analysis. Analysis times ranged from 1-506 min (median 3.59) on datasets with 17-1004 genomes and mean genome length of 3000-1 000 000 bases.
dc.identifier.citationNAR Genomics and Bioinformatics Vol.6 No.4 (2024)
dc.identifier.doi10.1093/nargab/lqae183
dc.identifier.eissn26319268
dc.identifier.scopus2-s2.0-85213004600
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/102584
dc.rights.holderSCOPUS
dc.subjectMathematics
dc.subjectBiochemistry, Genetics and Molecular Biology
dc.subjectComputer Science
dc.titleGRAViTy-V2: A grounded viral taxonomy application
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85213004600&origin=inward
oaire.citation.issue4
oaire.citation.titleNAR Genomics and Bioinformatics
oaire.citation.volume6
oairecerif.author.affiliationFaculty of Science, Mahidol University
oairecerif.author.affiliationUniversity of the West of England
oairecerif.author.affiliationQuadram Institute Bioscience
oairecerif.author.affiliationNuffield Department of Medicine

Files

Collections