Large language model triaging of simulated nephrology patient inbox messages

Pham J.H.; Thongprayoon C.; Miao J.; Suppadungsuk S.; Koirala P.; Craici I.M.; Cheungpasitporn W.

Large language model triaging of simulated nephrology patient inbox messages

dc.contributor.author	Pham J.H.
dc.contributor.author	Thongprayoon C.
dc.contributor.author	Miao J.
dc.contributor.author	Suppadungsuk S.
dc.contributor.author	Koirala P.
dc.contributor.author	Craici I.M.
dc.contributor.author	Cheungpasitporn W.
dc.contributor.correspondence	Pham J.H.
dc.contributor.other	Mahidol University
dc.date.accessioned	2024-09-29T18:37:34Z
dc.date.available	2024-09-29T18:37:34Z
dc.date.issued	2024-01-01
dc.description.abstract	Background: Efficient triage of patient communications is crucial for timely medical attention and improved care. This study evaluates ChatGPT’s accuracy in categorizing nephrology patient inbox messages, assessing its potential in outpatient settings. Methods: One hundred and fifty simulated patient inbox messages were created based on cases typically encountered in everyday practice at a nephrology outpatient clinic. These messages were triaged as non-urgent, urgent, and emergent by two nephrologists. The messages were then submitted to ChatGPT-4 for independent triage into the same categories. The inquiry process was performed twice with a two-week period in between. ChatGPT responses were graded as correct (agreement with physicians), overestimation (higher priority), or underestimation (lower priority). Results: In the first trial, ChatGPT correctly triaged 140 (93%) messages, overestimated the priority of 4 messages (3%), and underestimated the priority of 6 messages (4%). In the second trial, it correctly triaged 140 (93%) messages, overestimated the priority of 9 (6%), and underestimated the priority of 1 (1%). The accuracy did not depend on the urgency level of the message (p = 0.19). The internal agreement of ChatGPT responses was 92% with an intra-rater Kappa score of 0.88. Conclusion: ChatGPT-4 demonstrated high accuracy in triaging nephrology patient messages, highlighting the potential for AI-driven triage systems to enhance operational efficiency and improve patient care in outpatient clinics.
dc.identifier.citation	Frontiers in Artificial Intelligence Vol.7 (2024)
dc.identifier.doi	10.3389/frai.2024.1452469
dc.identifier.eissn	26248212
dc.identifier.scopus	2-s2.0-85204728264
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/123456789/101418
dc.rights.holder	SCOPUS
dc.subject	Computer Science
dc.title	Large language model triaging of simulated nephrology patient inbox messages
dc.type	Article
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85204728264&origin=inward
oaire.citation.title	Frontiers in Artificial Intelligence
oaire.citation.volume	7
oairecerif.author.affiliation	Mayo Clinic College of Medicine and Science
oairecerif.author.affiliation	Faculty of Medicine Ramathibodi Hospital, Mahidol University
oairecerif.author.affiliation	Mayo Clinic

Collections

Scopus 2024

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Large language model triaging of simulated nephrology patient inbox messages

Files

Collections