PromptOps: Automated Tool for Testing Trustworthiness of LLMs
| dc.contributor.author | Sontesadisai C. | |
| dc.contributor.author | Sae-Ngow C. | |
| dc.contributor.author | Rudeerudchanawong J. | |
| dc.contributor.author | Dangsungnoen L. | |
| dc.contributor.author | Ragkhitwetsagul C. | |
| dc.contributor.author | Racharak T. | |
| dc.contributor.author | Sunetnanta T. | |
| dc.contributor.correspondence | Sontesadisai C. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2026-04-16T18:45:39Z | |
| dc.date.available | 2026-04-16T18:45:39Z | |
| dc.date.issued | 2025-01-01 | |
| dc.description.abstract | Large Language Models (LLMs) are increasingly utilized in a wide range of natural language processing tasks. Despite their growing adoption, concerns regarding their trustworthiness, i.e., reliability and validity across diverse applications, still remain. This paper introduces a novel visual-based LLM testing tool called PromptOps using the principles of metamorphic testing to assess LLMs beyond traditional accuracy metrics. The tool evaluates LLMs on critical properties such as robustness, fairness, and logical consistency. The tool enables users to design custom test cases via visual programming, define specific prompts, and automatically generate diverse test scenarios. PromptOps fosters greater transparency for model developers by identifying areas for improvement in both performance and fairness. The video demonstration of the PromptOps tool is available at https://youtu.be/M6TbvPIt9kE, and the tool is available at https://github.com/MUICT-SERU/PromptOps. | |
| dc.identifier.citation | Proceedings Asia Pacific Software Engineering Conference APSEC (2025) , 1005-1008 | |
| dc.identifier.doi | 10.1109/APSEC66846.2025.00117 | |
| dc.identifier.issn | 15301362 | |
| dc.identifier.scopus | 2-s2.0-105035196380 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/116232 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Computer Science | |
| dc.title | PromptOps: Automated Tool for Testing Trustworthiness of LLMs | |
| dc.type | Conference Paper | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105035196380&origin=inward | |
| oaire.citation.endPage | 1008 | |
| oaire.citation.startPage | 1005 | |
| oaire.citation.title | Proceedings Asia Pacific Software Engineering Conference APSEC | |
| oairecerif.author.affiliation | Tohoku University | |
| oairecerif.author.affiliation | Mahidol University |
