The IMPACT framework for evaluating generative AI in critical care: development and multinational consensus validation

dc.contributor.authorYeh Y.C.
dc.contributor.authorShih M.C.
dc.contributor.authorDe Backer D.
dc.contributor.authorCeli L.A.
dc.contributor.authorSee K.C.
dc.contributor.authorFujii T.
dc.contributor.authorLing L.
dc.contributor.authorMongkolpun W.
dc.contributor.authorHu H.W.
dc.contributor.authorChen H.Y.
dc.contributor.authorChen W.C.
dc.contributor.authorCholley B.
dc.contributor.authorFong K.K.
dc.contributor.authorRyu H.G.
dc.contributor.authorNa S.
dc.contributor.authorEgi M.
dc.contributor.authorChan W.S.
dc.contributor.authorChen K.F.
dc.contributor.authorKamaleswaran R.
dc.contributor.authorChuang Y.C.
dc.contributor.authorYang C.J.
dc.contributor.authorHsiao W.L.
dc.contributor.authorLai S.R.
dc.contributor.authorKu D.
dc.contributor.authorJahan A.
dc.contributor.authorMartin G.S.
dc.contributor.correspondenceYeh Y.C.
dc.contributor.otherMahidol University
dc.date.accessioned2026-06-08T18:15:28Z
dc.date.available2026-06-08T18:15:28Z
dc.date.issued2026-01-01
dc.description.abstractBackground Generative artificial intelligence (GenAI) is increasingly used for clinical decision support in critical care, yet standardized methods for evaluating GenAI content in intensive care settings are lacking. Existing metrics assess textual similarity but fail to capture clinical accuracy, reasoning quality, or urgency. Methods We developed and validated the IMPACT framework through a five-phase multinational panel consensus process. Reporting adhered to the ACCORD guideline. A steering committee of eight persons provided clinical and methodological oversight. Panelists were recruited through purposive sampling to ensure geographic and multidisciplinary representation. Content validity was assessed using the Content Validity Ratio (CVR) and Item-level Content Validity Index (I-CVI), with retention thresholds set at 70% agreement and I-CVI ≥0.80. Results A total of 58 panelists from 12 countries and regions participated, with 42 completing formal consensus voting. Participants included intensivists, physicians with AI research expertise, information technology specialists, and other critical care professionals. All six IMPACT domains exceeded validity thresholds (mean agreement 89.3%, CVR = 0.79, I-CVI = 0.92). Of 24 candidate subitems, 21 met retention criteria (mean agreement 85.7%, CVR = 0.71, I-CVI = 0.90). Three subitems were removed due to insufficient consensus and conceptual overlap. The validated framework comprises six domains with 21 subitems. Conclusions The IMPACT framework provides a consensus-validated approach for evaluating GenAI clinical decision support in intensive care, addressing gaps in current evaluation methods.
dc.identifier.citationAnnals of Intensive Care Vol.16 (2026)
dc.identifier.doi10.1016/j.aicoj.2026.100078
dc.identifier.eissn21105820
dc.identifier.scopus2-s2.0-105040547932
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/117141
dc.rights.holderSCOPUS
dc.subjectMedicine
dc.titleThe IMPACT framework for evaluating generative AI in critical care: development and multinational consensus validation
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105040547932&origin=inward
oaire.citation.titleAnnals of Intensive Care
oaire.citation.volume16
oairecerif.author.affiliationMassachusetts Institute of Technology
oairecerif.author.affiliationChinese University of Hong Kong
oairecerif.author.affiliationEmory University School of Medicine
oairecerif.author.affiliationUniversité Libre de Bruxelles
oairecerif.author.affiliationNational Tsing Hua University
oairecerif.author.affiliationDuke University School of Medicine
oairecerif.author.affiliationNational Taiwan University Hospital
oairecerif.author.affiliationChang Gung University
oairecerif.author.affiliationSeoul National University Hospital
oairecerif.author.affiliationChina Medical University Hospital
oairecerif.author.affiliationHôpital Européen Georges-Pompidou
oairecerif.author.affiliationSiriraj Hospital
oairecerif.author.affiliationNational University Hospital
oairecerif.author.affiliationSeverance Hospital
oairecerif.author.affiliationMonash Health
oairecerif.author.affiliationKyoto University Hospital
oairecerif.author.affiliationFar Eastern Memorial Hospital
oairecerif.author.affiliationJikei University Hospital
oairecerif.author.affiliationQueen Elizabeth Hospital
oairecerif.author.affiliationBrahmanbaria Medical College
oairecerif.author.affiliationTaiwan Artificial Intelligence Foundation

Files

Collections