Unraveling the drivers of leptospirosis risk in Thailand using machine learning

dc.contributor.authorSuttirat P.
dc.contributor.authorChadsuthi S.
dc.contributor.authorModchang C.
dc.contributor.authorRocklöv J.
dc.contributor.correspondenceSuttirat P.
dc.contributor.otherMahidol University
dc.date.accessioned2025-11-01T18:06:48Z
dc.date.available2025-11-01T18:06:48Z
dc.date.issued2025-10-01
dc.description.abstractLeptospirosis poses a significant public health challenge in Thailand, driven by a complex mix of environmental and socioeconomic factors. This study develops an XGBoost machine learning model to predict leptospirosis outbreak risk at the provincial level in Thailand, integrating climatic, socioeconomic, and agricultural features. Using national surveillance data from 2007-2022, the model was trained to classify provinces as high or low risk based on the median incidence rate. The model's predictive performance was validated for the years 2018-2022, spanning pre-COVID-19, COVID-19, and post-COVID-19 periods. SHapley Additive exPlanation (SHAP) analysis was employed to identify key predictive factors. The optimized XGBoost model achieved high predictive accuracy for the pre-pandemic (AUC = 0.937 with 95% CI: 0.878 - 0.976) and post-pandemic (AUC = 0.951 with 95% CI: 0.861 - 0.999) testing periods. SHAP analysis revealed rice production factors, household size, and specific climatic variables as the strongest predictors of leptospirosis risk. However, model performance declined during the COVID-19 pandemic (2020-2021), suggesting surveillance disruption and potential underreporting. This study demonstrates the utility of machine learning for predicting leptospirosis risk in Thailand and highlights the complex interplay of environmental and socioeconomic factors in driving outbreaks. The adaptable modeling framework provides a foundation for developing early warning systems and targeted interventions to reduce the burden of this neglected tropical disease.
dc.identifier.citationPlos Neglected Tropical Diseases Vol.19 No.10 (2025) , e0013618
dc.identifier.doi10.1371/journal.pntd.0013618
dc.identifier.eissn19352735
dc.identifier.pmid41086238
dc.identifier.scopus2-s2.0-105019818472
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/112880
dc.rights.holderSCOPUS
dc.subjectMedicine
dc.titleUnraveling the drivers of leptospirosis risk in Thailand using machine learning
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105019818472&origin=inward
oaire.citation.issue10
oaire.citation.titlePlos Neglected Tropical Diseases
oaire.citation.volume19
oairecerif.author.affiliationUniversität Heidelberg
oairecerif.author.affiliationUmeå Universitet
oairecerif.author.affiliationFaculty of Science, Mahidol University
oairecerif.author.affiliationNaresuan University
oairecerif.author.affiliationCentre of Excellence in Mathematics

Files

Collections