Integrating Agentic Artificial Intelligence to Automate International Classification of Diseases, Tenth Revision, Medical Coding

Akkhawatthanakun K.; Narupiyakul L.; Wongpatikaseree K.; Hnoohom N.; Termritthikun C.; Muneesawang P.

Integrating Agentic Artificial Intelligence to Automate International Classification of Diseases, Tenth Revision, Medical Coding

dc.contributor.author	Akkhawatthanakun K.
dc.contributor.author	Narupiyakul L.
dc.contributor.author	Wongpatikaseree K.
dc.contributor.author	Hnoohom N.
dc.contributor.author	Termritthikun C.
dc.contributor.author	Muneesawang P.
dc.contributor.correspondence	Akkhawatthanakun K.
dc.contributor.other	Mahidol University
dc.date.accessioned	2026-04-09T18:38:23Z
dc.date.available	2026-04-09T18:38:23Z
dc.date.issued	2026-03-01
dc.description.abstract	Automating ICD-10 coding from discharge summaries remains demanding because coders analyze clinical narratives while justifying decisions. This study compares three automation patterns: PLM-ICD as a standalone deep learning system emitting 15 codes per case, LLM-only generation with full autonomy, and a hybrid approach where PLM-ICD drafts candidates for an agentic LLM audit to accept or reject. All strategies were evaluated on 19,801 MIMIC-IV summaries using four LLMs spanning compact (Qwen2.5-3B-Instruct, Llama-3.2-3B-Instruct, Phi-4-mini-instruct) to large-scale (Sonnet-4.5). Precision guided evaluation because coders still supply any missing diagnoses. PLM-ICD alone reached 55.8% precision while always surfacing 15 suggestions. LLM-only generation lagged severely (1.5–34.6% precision) and produced inconsistent output sizes. The agentic audit delivered the best trade-off: compact LLMs reviewed the 15 candidates, discarded weak evidence, and returned 2–8 high-confidence codes. Llama-3.2-3B-Instruct, for example, improved from 1.5% as a generator to 55.1% as a verifier while trimming false positives by 73%. These results show that positioning LLMs as quality controllers, rather than primary generators, yields reliable support for clinical coding teams, while formal recall/F1 reporting remains future work for fully autonomous implementations.
dc.identifier.citation	Informatics Vol.13 No.3 (2026)
dc.identifier.doi	10.3390/informatics13030039
dc.identifier.eissn	22279709
dc.identifier.scopus	2-s2.0-105033869908
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/116029
dc.rights.holder	SCOPUS
dc.subject	Computer Science
dc.subject	Social Sciences
dc.title	Integrating Agentic Artificial Intelligence to Automate International Classification of Diseases, Tenth Revision, Medical Coding
dc.type	Article
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105033869908&origin=inward
oaire.citation.issue	3
oaire.citation.title	Informatics
oaire.citation.volume	13
oairecerif.author.affiliation	Mahidol University
oairecerif.author.affiliation	Naresuan University

Collections

Scopus 2026

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Integrating Agentic Artificial Intelligence to Automate International Classification of Diseases, Tenth Revision, Medical Coding

Files

Collections