Optimizing Optical Character Recognition Within a Physical - Agentic AI System for Flexible Drug Preparation

Maneechay P.; Warinsiriruk E.; Wang Y.T.

Optimizing Optical Character Recognition Within a Physical - Agentic AI System for Flexible Drug Preparation

dc.contributor.author	Maneechay P.
dc.contributor.author	Warinsiriruk E.
dc.contributor.author	Wang Y.T.
dc.contributor.correspondence	Maneechay P.
dc.contributor.other	Mahidol University
dc.date.accessioned	2026-06-09T18:28:16Z
dc.date.available	2026-06-09T18:28:16Z
dc.date.issued	2025-01-01
dc.description.abstract	The conventional camera-based prescription-label reading process used in existing automated systems has notable limitations in both accuracy and latency. These issues stem primarily from Optical Character Recognition (OCR) pipelines that were not optimized for real-world label characteristics-such as varying font complexity, size, and image quality-resulting in misread text and delays that fail to meet operational requirements. To address these shortcomings, this study developed an improved processing pipeline by comparing the performance of EasyOCR and PyTesseract under image-downscaling conditions ranging from 0.1 to 0.9. In parallel, an integrated N8N-AI Agent workflow was designed to enhance both the speed and accuracy of medication-label extraction. The proposed system combines appropriate pre-processing, selective OCR utilization, and the incorporation of reference data directly within the model. This integration leads to more stable label-reading performance, enabling the system to correctly identify medication names while reducing overall processing time compared with the previous approach. Experimental results show that PyTesseract processes images approximately 5-10 times faster than EasyOCR, whereas EasyOCR consistently delivers higher recognition accuracy. When combined with reference data, the workflow using a system-prompt approach proved more than ten times faster than the CSV-based lookup method. Optimizing the OCR for image complexity, minimizing node count, and applying in-memory processing collectively improved both the responsiveness and accuracy of the system. As a result, the new pipeline operates near real time, reduces bottlenecks associated with redundant file operations, and maintains stable performance across diverse medication-label formats-an essential requirement for reliable deployment in medical environments where precision and consistency are critical.
dc.identifier.citation	6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings (2025)
dc.identifier.doi	10.1109/TIMES-iCON67125.2025.11488140
dc.identifier.scopus	2-s2.0-105040605495
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/117191
dc.rights.holder	SCOPUS
dc.subject	Energy
dc.subject	Business, Management and Accounting
dc.subject	Computer Science
dc.subject	Medicine
dc.subject	Decision Sciences
dc.title	Optimizing Optical Character Recognition Within a Physical - Agentic AI System for Flexible Drug Preparation
dc.type	Conference Paper
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105040605495&origin=inward
oaire.citation.title	6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings
oairecerif.author.affiliation	Mahidol University
oairecerif.author.affiliation	Tamkang University

Collections

Scopus 2025

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Optimizing Optical Character Recognition Within a Physical - Agentic AI System for Flexible Drug Preparation

Files

Collections