Prognostic model of complications in type 2 diabetes : traditional statistical model versus machine learning
| dc.contributor.advisor | Oraluck Pattanaprateep | |
| dc.contributor.advisor | Ammarin Thakkinstian | |
| dc.contributor.advisor | Anuchate Pattanateepapon | |
| dc.contributor.advisor | Wichai Aekplakorn | |
| dc.contributor.author | Sigit Ari Saputro, 1990- | |
| dc.date.accessioned | 2026-02-06T07:50:51Z | |
| dc.date.available | 2026-02-06T07:50:51Z | |
| dc.date.copyright | 2021 | |
| dc.date.created | 2026 | |
| dc.date.issued | 2021 | |
| dc.description.abstract | Diabetes and its complications have become the major public health concerns globally. Many prognostic models of diabetic complications have been developed, but their performances are still varied. Only limited studies derived the models by using the appropriate methods and were satisfactorily reported. The accurate prognostic model related risk factors are essentially needed for early detection and prevention strategies notably for Type 2 diabetic (T2D) patients. This study aimed to validate the existing models that have been published in the literature and develop a new model based on the traditional statistical model and machine learnings (MLs). This study was a retrospective cross-sectional design. We obtained the nationwide survey data and linked it with the standard hospital databases. We synthesized the existing model by the recent systematic review. Traditional metrics (i.e., discrimination, calibration, and reclassification) were compared to evaluate their predictive performances. We updated the equations and developed the new models using the substantial prognostic factors for adjusting the model performance in our settings. A total of 2, 3, 3, and 4 prognostic studies for developing DR, CKD, ESRD and CVDs were included into external validations. We updated the equations by including the routine clinical factors which significantly improved C-statistics. After the recalibration and update, the O/E ratios were getting closer whereas the C-statistics also significantly increased. In validation, most CKD and ESRD models had moderate performance whereas DR and CVDs still varied from low to poor performances. We derived a successful Thai DR risk score with good discrimination and calibration. The traditional statistical models by logistic regression and seven MLs (i.e., Neural Network, Support Vector Machine, Decision Tree, Extreme Gradient Boosting, Random Forest, Naïve Bayesian, and LASSO Logistic Regression) were performed well and compared for their evaluation metrics. The Thai DR risk score is easy to use in identifying the T2D patient who have the high risk to develop DR in the routine clinical practice. Further external validations are needed to clarify the performance with the parallel development of the software applications | |
| dc.format.mimetype | application/pdf | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/114202 | |
| dc.language.iso | eng | |
| dc.publisher | Mahidol University | |
| dc.rights | ผลงานนี้เป็นลิขสิทธิ์ของมหาวิทยาลัยมหิดล ขอสงวนไว้สำหรับเพื่อการศึกษาเท่านั้น ต้องอ้างอิงแหล่งที่มา ห้ามดัดแปลงเนื้อหา และห้ามนำไปใช้เพื่อการค้า | |
| dc.rights.holder | Mahidol University | |
| dc.subject | Diabetes -- Complications -- Risk factors | |
| dc.subject | Prognosis | |
| dc.subject | Machine learning -- Medical applications | |
| dc.subject | Diabetes, Type 2 -- Prognosis. | |
| dc.title | Prognostic model of complications in type 2 diabetes : traditional statistical model versus machine learning | |
| dc.type | Doctoral Thesis | |
| dcterms.accessRights | open access | |
| thesis.degree.department | Faculty of Medicine Ramathibodi Hospital | |
| thesis.degree.discipline | Data Science for Health Care | |
| thesis.degree.grantor | Mahidol University | |
| thesis.degree.level | Doctoral degree | |
| thesis.degree.name | Doctor of Philosophy |
