Optimizing Optical Character Recognition Within a Physical - Agentic AI System for Flexible Drug Preparation
| dc.contributor.author | Maneechay P. | |
| dc.contributor.author | Warinsiriruk E. | |
| dc.contributor.author | Wang Y.T. | |
| dc.contributor.correspondence | Maneechay P. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2026-06-09T18:28:16Z | |
| dc.date.available | 2026-06-09T18:28:16Z | |
| dc.date.issued | 2025-01-01 | |
| dc.description.abstract | The conventional camera-based prescription-label reading process used in existing automated systems has notable limitations in both accuracy and latency. These issues stem primarily from Optical Character Recognition (OCR) pipelines that were not optimized for real-world label characteristics-such as varying font complexity, size, and image quality-resulting in misread text and delays that fail to meet operational requirements. To address these shortcomings, this study developed an improved processing pipeline by comparing the performance of EasyOCR and PyTesseract under image-downscaling conditions ranging from 0.1 to 0.9. In parallel, an integrated N8N-AI Agent workflow was designed to enhance both the speed and accuracy of medication-label extraction. The proposed system combines appropriate pre-processing, selective OCR utilization, and the incorporation of reference data directly within the model. This integration leads to more stable label-reading performance, enabling the system to correctly identify medication names while reducing overall processing time compared with the previous approach. Experimental results show that PyTesseract processes images approximately 5-10 times faster than EasyOCR, whereas EasyOCR consistently delivers higher recognition accuracy. When combined with reference data, the workflow using a system-prompt approach proved more than ten times faster than the CSV-based lookup method. Optimizing the OCR for image complexity, minimizing node count, and applying in-memory processing collectively improved both the responsiveness and accuracy of the system. As a result, the new pipeline operates near real time, reduces bottlenecks associated with redundant file operations, and maintains stable performance across diverse medication-label formats-an essential requirement for reliable deployment in medical environments where precision and consistency are critical. | |
| dc.identifier.citation | 6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings (2025) | |
| dc.identifier.doi | 10.1109/TIMES-iCON67125.2025.11488140 | |
| dc.identifier.scopus | 2-s2.0-105040605495 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/117191 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Energy | |
| dc.subject | Business, Management and Accounting | |
| dc.subject | Computer Science | |
| dc.subject | Medicine | |
| dc.subject | Decision Sciences | |
| dc.title | Optimizing Optical Character Recognition Within a Physical - Agentic AI System for Flexible Drug Preparation | |
| dc.type | Conference Paper | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105040605495&origin=inward | |
| oaire.citation.title | 6th Technology Innovation Management and Engineering Science International Conference Times Icon 2025 Proceedings | |
| oairecerif.author.affiliation | Mahidol University | |
| oairecerif.author.affiliation | Tamkang University |
