EC-RAFT: Automated Generation of Clinical Trial Eligibility Criteria through Retrieval-Augmented Fine-Tuning
| dc.contributor.author | Lekuthai N. | |
| dc.contributor.author | Pewngam N. | |
| dc.contributor.author | Sokrai S. | |
| dc.contributor.author | Achakulvisut T. | |
| dc.contributor.correspondence | Lekuthai N. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2026-02-06T18:28:18Z | |
| dc.date.available | 2026-02-06T18:28:18Z | |
| dc.date.issued | 2025-01-01 | |
| dc.description.abstract | Eligibility criteria (EC) are critical components of clinical trial design, defining the parameters for participant inclusion and exclusion. However, designing EC remains a complex, expertise-intensive process. Traditional approaches to EC generation may fail to produce comprehensive, contextually appropriate criteria. To address these challenges, we introduce EC-RAFT, a method that utilizes Retrieval-Augmented Fine-Tuning (RAFT) to generate structured and cohesive EC directly from clinical trial titles and descriptions. EC-RAFT integrates contextual retrieval, synthesized intermediate reasoning, and fine-tuned language models to produce comprehensive EC sets. To enhance clinical alignment evaluation with referenced criteria, we also propose an LLM-guided evaluation pipeline. Our results demonstrate that our solution, which uses Llama-3.1-8BInstruct as a base model, achieves a BERTScore of 86.23 and an EC-matched LLM-as-a-Judge score of 1.66 out of 3, outperforming zero-shot Llama-3.1 and Gemini-1.5 by 0.41 and 0.11 points, respectively. On top of that, EC-RAFT also outperforms other fine-tuned versions of Llama-3.1. EC-RAFT was trained in a low-cost setup and, therefore, can be used as a practical solution for EC generation while ensuring quality and relevance in clinical trial design. We release our code on GitHub at https://github.com/biodatlab/ec-raft/. | |
| dc.identifier.citation | Proceedings of the Annual Meeting of the Association for Computational Linguistics (2025) , 9432-9444 | |
| dc.identifier.doi | 10.18653/v1/2025.findings-acl.491 | |
| dc.identifier.issn | 0736587X | |
| dc.identifier.scopus | 2-s2.0-105028640416 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/114700 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Computer Science | |
| dc.subject | Social Sciences | |
| dc.subject | Arts and Humanities | |
| dc.title | EC-RAFT: Automated Generation of Clinical Trial Eligibility Criteria through Retrieval-Augmented Fine-Tuning | |
| dc.type | Conference Paper | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105028640416&origin=inward | |
| oaire.citation.endPage | 9444 | |
| oaire.citation.startPage | 9432 | |
| oaire.citation.title | Proceedings of the Annual Meeting of the Association for Computational Linguistics | |
| oairecerif.author.affiliation | Mahidol University | |
| oairecerif.author.affiliation | Faculty of Medicine Ramathibodi Hospital, Mahidol University | |
| oairecerif.author.affiliation | Ravis Technology |
