The Significance of Time Duration and Feature Extraction of Voice Signal Dataset for Depression Classification
Issued Date
2024-01-01
Resource Type
Scopus ID
2-s2.0-105000365814
Journal Title
16th Biomedical Engineering International Conference, BMEiCON 2024
Rights Holder(s)
SCOPUS
Bibliographic Citation
16th Biomedical Engineering International Conference, BMEiCON 2024 (2024)
Suggested Citation
Tongnopparat P., Kiatrungrit K., Treebupachatsakul T., Poomrittigul S. The Significance of Time Duration and Feature Extraction of Voice Signal Dataset for Depression Classification. 16th Biomedical Engineering International Conference, BMEiCON 2024 (2024). doi:10.1109/BMEiCON64021.2024.10896333 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/108610
Title
The Significance of Time Duration and Feature Extraction of Voice Signal Dataset for Depression Classification
Corresponding Author(s)
Other Contributor(s)
Abstract
This project aims to study the effect of time duration of audio datasets and feature extractions for depression classification of machine learning models. The three different time durations of audio signals including dataset (1): 1-6 min, dataset (2): 1 min, and dataset (3): 30 seconds were investigated. There were 5 machine learning models applied with 12 feature extractions including 4 features of time domains, 4 features of frequency domains, and 4 features of time-frequency domains. The statistic significant of time duration of dataset and feature extraction were examined by one-way ANDV A compared to a p-critical of 0.05. The results of performance metrics indicate the highest accuracy, and fl-score was achieved by logistic regression with MFCC on dataset (1), which was 88.24 % and 85.71%, respectively. Moreover, the precision and recall reached 1 by several models and feature extraction, especially in frequency domains and time-frequency domains. Also, the time durations and feature extraction on each dataset didn't have statistical significance on model performance.