Comparative Evaluation of Large Language Models for Automated Text Classification and Data Labeling in Drug Recommendation

Rattanachayabun K.; Jamrat S.; Samanchuen T.

Comparative Evaluation of Large Language Models for Automated Text Classification and Data Labeling in Drug Recommendation

dc.contributor.author	Rattanachayabun K.
dc.contributor.author	Jamrat S.
dc.contributor.author	Samanchuen T.
dc.contributor.correspondence	Rattanachayabun K.
dc.contributor.other	Mahidol University
dc.date.accessioned	2026-06-09T18:24:33Z
dc.date.available	2026-06-09T18:24:33Z
dc.date.issued	2025-01-01
dc.description.abstract	Data labeling remains one of the most laborintensive stages in artificial intelligence (AI) development, particularly in natural language processing. This study investigates the potential of large language models (LLMs) to automate text classification and reduce the burden of manual annotation. Four state-of-the-art LLMs-GPT-4o, Llama 3.3, Llama 3.1, and Gemma 2-were evaluated on a drug recommendation dataset using three prompting strategies: zero-shot, one-shot, and few-shot learning. Model performance was assessed using F1-scores across therapeutic consideration, use as directed, and contraindication categories. The results indicate that incorporating even a single labeled example substantially improves classification accuracy compared with zero-shot prompting, while the performance gain from one-shot to few-shot prompting is marginal. Among the tested models, Llama 3.3 achieved the most consistent results, whereas Gemma 2 demonstrated strong zeroshot generalization and GPT-4o provided stable cross-strategy performance. The findings highlight the feasibility of employing LLMs for automated data labeling and underscore the efficiency of one-shot prompting as a practical balance between accuracy and computational cost.
dc.identifier.citation	6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings (2025)
dc.identifier.doi	10.1109/TIMES-iCON67125.2025.11488078
dc.identifier.scopus	2-s2.0-105040591055
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/117183
dc.rights.holder	SCOPUS
dc.subject	Energy
dc.subject	Business, Management and Accounting
dc.subject	Computer Science
dc.subject	Medicine
dc.subject	Decision Sciences
dc.title	Comparative Evaluation of Large Language Models for Automated Text Classification and Data Labeling in Drug Recommendation
dc.type	Conference Paper
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105040591055&origin=inward
oaire.citation.title	6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings
oairecerif.author.affiliation	Mahidol University
oairecerif.author.affiliation	Silpakorn University

Collections

Scopus 2025

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Comparative Evaluation of Large Language Models for Automated Text Classification and Data Labeling in Drug Recommendation

Files

Collections