Bilingual Audio Depression Identification Model by Machine Learning
| dc.contributor.author | Poomrittigul S. | |
| dc.contributor.author | Kiatrungrit K. | |
| dc.contributor.author | Homsiang P. | |
| dc.contributor.author | Treebupachatsakul T. | |
| dc.contributor.correspondence | Poomrittigul S. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2025-09-27T18:14:23Z | |
| dc.date.available | 2025-09-27T18:14:23Z | |
| dc.date.issued | 2025-01-01 | |
| dc.description.abstract | The number of depression patients worldwide, particularly in Thailand, is increasing on an upward trend. Depression screening commonly relies on self-report questionnaires. However, these instruments provide subjective assessments. Recent advancements in machine learning technology offer potential improvements in diagnostic accuracy through more objective measures. This study aims to evaluate the effectiveness of machine learning models in classifying depression using a bilingual audio dataset comprising Thai and English languages. Such models have the potential to assist clinicians by providing objective preliminary screening for depression based on vocal analysis, enhancing diagnostic precision and clinical decision-making. Various machine learning models were implemented including KNN, MLP, Random Forest, Decision Tree, SGD, Logistic Regression, SVM, AdaBoost, and Gaussian Naïve Bayes using MFCC-converted audio datasets. The results indicate that machine learning models effectively classify and identify depression even in bilingual audio datasets compared to individual language models, with the highest accuracy reaching 0.95 from MLP and KNN when testing the trained model by a single Thai audio. | |
| dc.identifier.citation | 2025 International Technical Conference on Circuits Systems Computers and Communications Itc Cscc 2025 (2025) | |
| dc.identifier.doi | 10.1109/ITC-CSCC66376.2025.11137688 | |
| dc.identifier.scopus | 2-s2.0-105016359557 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/112290 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Computer Science | |
| dc.subject | Engineering | |
| dc.title | Bilingual Audio Depression Identification Model by Machine Learning | |
| dc.type | Conference Paper | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105016359557&origin=inward | |
| oaire.citation.title | 2025 International Technical Conference on Circuits Systems Computers and Communications Itc Cscc 2025 | |
| oairecerif.author.affiliation | King Mongkut's Institute of Technology Ladkrabang | |
| oairecerif.author.affiliation | Ramathibodi Hospital |
