An integrative approach to DNA barcoding, geometric morphometrics, and machine learning for field identification of Culex mosquitoes (Diptera: Culicidae), with implications for vector-borne disease surveillance

dc.contributor.authorLaojun S.
dc.contributor.authorChangbunjong T.
dc.contributor.authorKamoltham T.
dc.contributor.authorChaiphongpachara T.
dc.contributor.correspondenceLaojun S.
dc.contributor.otherMahidol University
dc.date.accessioned2025-11-04T18:24:42Z
dc.date.available2025-11-04T18:24:42Z
dc.date.issued2025-11-01
dc.description.abstractCulex mosquitoes are of considerable medical and veterinary importance, acting as vectors of arboviruses such as Japanese encephalitis, Rift Valley fever, and West Nile virus, as well as the filarial parasite Wuchereria bancrofti. Accurate identification of Culex species, however, remains challenging due to their close morphological similarity, frequent damage to field-collected specimens, and the limited availability of trained taxonomists. To address these challenges, this study employed an integrative framework combining DNA barcoding, wing geometric morphometrics (GM), and Random Forest (RF) to improve the identification of 12 Culex species (Cx. bicornutus, Cx. bitaeniorhynchus, Cx. brevipalpis, Cx. fuscocephala, Cx. gelidus, Cx. hutchinsoni, Cx. nigropunctatus, Cx. pseudovishnui, Cx. quinquefasciatus, Cx. sinensis, Cx. sitiens, and Cx. tritaeniorhynchus) in Thailand. DNA barcoding successfully validated the morphological identifications, with nucleotide sequences from representative specimens showing strong concordance with the GenBank and Barcode of Life Data Systems (BOLD) databases (≥96 %), confirming the reliability of morphological diagnoses. Complementarily, wing GM demonstrated stronger discriminatory power: Mahalanobis distance analysis revealed all species to be significantly different (p < 0.05), and a cross-validated reclassification test achieved 82.18 % performance with an adjusted total accuracy of 80 %. For field identification of unknown specimens, both Mahalanobis distance and RF produced comparable results, yielding very high accuracy (80 %–100 %) for eight species. Overall, the integration of DNA barcoding, wing GM, and machine learning offers a robust and practical framework for strengthening mosquito-borne disease surveillance. Nonetheless, as each method has distinct strengths and limitations, their application should be carefully adapted to specific epidemiological and operational contexts.
dc.identifier.citationActa Tropica Vol.271 (2025)
dc.identifier.doi10.1016/j.actatropica.2025.107885
dc.identifier.eissn18736254
dc.identifier.issn0001706X
dc.identifier.scopus2-s2.0-105020039942
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/112914
dc.rights.holderSCOPUS
dc.subjectMedicine
dc.subjectImmunology and Microbiology
dc.titleAn integrative approach to DNA barcoding, geometric morphometrics, and machine learning for field identification of Culex mosquitoes (Diptera: Culicidae), with implications for vector-borne disease surveillance
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105020039942&origin=inward
oaire.citation.titleActa Tropica
oaire.citation.volume271
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationSuan Sunandha Rajabhat University

Files

Collections