EC-RAFT: Automated Generation of Clinical Trial Eligibility Criteria through Retrieval-Augmented Fine-Tuning

dc.contributor.authorLekuthai N.
dc.contributor.authorPewngam N.
dc.contributor.authorSokrai S.
dc.contributor.authorAchakulvisut T.
dc.contributor.correspondenceLekuthai N.
dc.contributor.otherMahidol University
dc.date.accessioned2026-02-06T18:28:18Z
dc.date.available2026-02-06T18:28:18Z
dc.date.issued2025-01-01
dc.description.abstractEligibility criteria (EC) are critical components of clinical trial design, defining the parameters for participant inclusion and exclusion. However, designing EC remains a complex, expertise-intensive process. Traditional approaches to EC generation may fail to produce comprehensive, contextually appropriate criteria. To address these challenges, we introduce EC-RAFT, a method that utilizes Retrieval-Augmented Fine-Tuning (RAFT) to generate structured and cohesive EC directly from clinical trial titles and descriptions. EC-RAFT integrates contextual retrieval, synthesized intermediate reasoning, and fine-tuned language models to produce comprehensive EC sets. To enhance clinical alignment evaluation with referenced criteria, we also propose an LLM-guided evaluation pipeline. Our results demonstrate that our solution, which uses Llama-3.1-8BInstruct as a base model, achieves a BERTScore of 86.23 and an EC-matched LLM-as-a-Judge score of 1.66 out of 3, outperforming zero-shot Llama-3.1 and Gemini-1.5 by 0.41 and 0.11 points, respectively. On top of that, EC-RAFT also outperforms other fine-tuned versions of Llama-3.1. EC-RAFT was trained in a low-cost setup and, therefore, can be used as a practical solution for EC generation while ensuring quality and relevance in clinical trial design. We release our code on GitHub at https://github.com/biodatlab/ec-raft/.
dc.identifier.citationProceedings of the Annual Meeting of the Association for Computational Linguistics (2025) , 9432-9444
dc.identifier.doi10.18653/v1/2025.findings-acl.491
dc.identifier.issn0736587X
dc.identifier.scopus2-s2.0-105028640416
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/114700
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.subjectSocial Sciences
dc.subjectArts and Humanities
dc.titleEC-RAFT: Automated Generation of Clinical Trial Eligibility Criteria through Retrieval-Augmented Fine-Tuning
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105028640416&origin=inward
oaire.citation.endPage9444
oaire.citation.startPage9432
oaire.citation.titleProceedings of the Annual Meeting of the Association for Computational Linguistics
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationFaculty of Medicine Ramathibodi Hospital, Mahidol University
oairecerif.author.affiliationRavis Technology

Files

Collections