PromptOps: Automated Tool for Testing Trustworthiness of LLMs

dc.contributor.authorSontesadisai C.
dc.contributor.authorSae-Ngow C.
dc.contributor.authorRudeerudchanawong J.
dc.contributor.authorDangsungnoen L.
dc.contributor.authorRagkhitwetsagul C.
dc.contributor.authorRacharak T.
dc.contributor.authorSunetnanta T.
dc.contributor.correspondenceSontesadisai C.
dc.contributor.otherMahidol University
dc.date.accessioned2026-04-16T18:45:39Z
dc.date.available2026-04-16T18:45:39Z
dc.date.issued2025-01-01
dc.description.abstractLarge Language Models (LLMs) are increasingly utilized in a wide range of natural language processing tasks. Despite their growing adoption, concerns regarding their trustworthiness, i.e., reliability and validity across diverse applications, still remain. This paper introduces a novel visual-based LLM testing tool called PromptOps using the principles of metamorphic testing to assess LLMs beyond traditional accuracy metrics. The tool evaluates LLMs on critical properties such as robustness, fairness, and logical consistency. The tool enables users to design custom test cases via visual programming, define specific prompts, and automatically generate diverse test scenarios. PromptOps fosters greater transparency for model developers by identifying areas for improvement in both performance and fairness. The video demonstration of the PromptOps tool is available at https://youtu.be/M6TbvPIt9kE, and the tool is available at https://github.com/MUICT-SERU/PromptOps.
dc.identifier.citationProceedings Asia Pacific Software Engineering Conference APSEC (2025) , 1005-1008
dc.identifier.doi10.1109/APSEC66846.2025.00117
dc.identifier.issn15301362
dc.identifier.scopus2-s2.0-105035196380
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/116232
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titlePromptOps: Automated Tool for Testing Trustworthiness of LLMs
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105035196380&origin=inward
oaire.citation.endPage1008
oaire.citation.startPage1005
oaire.citation.titleProceedings Asia Pacific Software Engineering Conference APSEC
oairecerif.author.affiliationTohoku University
oairecerif.author.affiliationMahidol University

Files

Collections