Comparative Evaluation of Large Language Models for Automated Text Classification and Data Labeling in Drug Recommendation
| dc.contributor.author | Rattanachayabun K. | |
| dc.contributor.author | Jamrat S. | |
| dc.contributor.author | Samanchuen T. | |
| dc.contributor.correspondence | Rattanachayabun K. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2026-06-09T18:24:33Z | |
| dc.date.available | 2026-06-09T18:24:33Z | |
| dc.date.issued | 2025-01-01 | |
| dc.description.abstract | Data labeling remains one of the most laborintensive stages in artificial intelligence (AI) development, particularly in natural language processing. This study investigates the potential of large language models (LLMs) to automate text classification and reduce the burden of manual annotation. Four state-of-the-art LLMs-GPT-4o, Llama 3.3, Llama 3.1, and Gemma 2-were evaluated on a drug recommendation dataset using three prompting strategies: zero-shot, one-shot, and few-shot learning. Model performance was assessed using F1-scores across therapeutic consideration, use as directed, and contraindication categories. The results indicate that incorporating even a single labeled example substantially improves classification accuracy compared with zero-shot prompting, while the performance gain from one-shot to few-shot prompting is marginal. Among the tested models, Llama 3.3 achieved the most consistent results, whereas Gemma 2 demonstrated strong zeroshot generalization and GPT-4o provided stable cross-strategy performance. The findings highlight the feasibility of employing LLMs for automated data labeling and underscore the efficiency of one-shot prompting as a practical balance between accuracy and computational cost. | |
| dc.identifier.citation | 6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings (2025) | |
| dc.identifier.doi | 10.1109/TIMES-iCON67125.2025.11488078 | |
| dc.identifier.scopus | 2-s2.0-105040591055 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/117183 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Energy | |
| dc.subject | Business, Management and Accounting | |
| dc.subject | Computer Science | |
| dc.subject | Medicine | |
| dc.subject | Decision Sciences | |
| dc.title | Comparative Evaluation of Large Language Models for Automated Text Classification and Data Labeling in Drug Recommendation | |
| dc.type | Conference Paper | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105040591055&origin=inward | |
| oaire.citation.title | 6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings | |
| oairecerif.author.affiliation | Mahidol University | |
| oairecerif.author.affiliation | Silpakorn University |
