Classifying DNA barcode sequences of four insects belonging to Orthoptera order using tensor network

dc.contributor.authorBhadola P.
dc.contributor.authorGupta Y.M.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-18T16:35:41Z
dc.date.available2023-06-18T16:35:41Z
dc.date.issued2022-07-01
dc.description.abstractImportance of the work: Orthoptera species are one of the most rapidly increasing groups of insects being used as food and feed. However, identifying edible insects can be difficult due to their small size and the similar morphological features in closely related species. Therefore, classification of insects is often conducted by amplifying their DNA barcode sequence and comparing it with databases containing reference sequences. However, the absence of reference DNA sequences (such as cytochrome c oxidase subunit I (COI)) may confound predictions of the taxonomic community of interest and make it difficult to characterize biodiversity from DNA samples. Objective: To develop a quantum-inspired tensor network-based machine-learning model to categorize COI sequences for four insects belonging to the Orthoptera order. Materials & Methods: For alignment-free classification, each DNA barcode was represented as a tensor product of k-mers encoded in a D-dimensional space, which acts as the feature map and input for a tensor network layer for the classification. The developed model was tested with two different numbers of tensor units as well as different k-mer sizes. Results: The presented model was effective for making accurate predictions for unseen DNA barcodes and can be generalized for any DNA/RNA sequence categorization. The tensor network classifier could assign COI sequences of varying lengths to four different classes with an accuracy greater than 99% and with fewer hyper-parameters. Main finding: The developed model is free and publicly available through GitHub: https://github.com/yashmgupta/DNA-barcode-sequence-classification.
dc.identifier.citationAgriculture and Natural Resources Vol.56 No.4 (2022) , 705-712
dc.identifier.doi10.34044/J.ANRES.2022.56.4.05
dc.identifier.eissn2452316X
dc.identifier.issn24681458
dc.identifier.scopus2-s2.0-85139108861
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/83187
dc.rights.holderSCOPUS
dc.subjectAgricultural and Biological Sciences
dc.titleClassifying DNA barcode sequences of four insects belonging to Orthoptera order using tensor network
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85139108861&origin=inward
oaire.citation.endPage712
oaire.citation.issue4
oaire.citation.startPage705
oaire.citation.titleAgriculture and Natural Resources
oaire.citation.volume56
oairecerif.author.affiliationNaresuan University
oairecerif.author.affiliationMahidol University

Files

Collections