Synthetic Data for Scam Detection: Leveraging LLMs to Train Deep Learning Models
| dc.contributor.author | Gumphusiri P. | |
| dc.contributor.author | Triyason T. | |
| dc.contributor.correspondence | Gumphusiri P. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2025-06-10T18:18:39Z | |
| dc.date.available | 2025-06-10T18:18:39Z | |
| dc.date.issued | 2024-01-01 | |
| dc.description.abstract | This paper presents a novel approach to training scam detection models using synthetic data generated by Large Language Models (LLMs). We propose single-agent and multi-agent methods for data generation and train six deep learning architectures-LSTM, BiLSTM, GRU, BiGRU, CNN, and BERT-to classify conversations as scam or non-scam. Our experiments demonstrate that models trained on synthetic data achieve high accuracy on both generated test sets and real-world scam conversations. The models perform well even with limited conversation turns and when analyzing only the suspect's messages, indicating potential for early scam detection and privacy-preserving applications. Our findings highlight the efficacy of synthetic data in overcoming real-world dataset limitations for scam detection. We make the dataset and trained models publicly available to facilitate further research and development in this critical area of fraud prevention. | |
| dc.identifier.citation | Proceedings 2024 IEEE Wic International Conference on Web Intelligence and Intelligent Agent Technology Wi Iat 2024 (2024) , 868-873 | |
| dc.identifier.doi | 10.1109/WI-IAT62293.2024.00141 | |
| dc.identifier.scopus | 2-s2.0-105007152224 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/110613 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Computer Science | |
| dc.title | Synthetic Data for Scam Detection: Leveraging LLMs to Train Deep Learning Models | |
| dc.type | Conference Paper | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105007152224&origin=inward | |
| oaire.citation.endPage | 873 | |
| oaire.citation.startPage | 868 | |
| oaire.citation.title | Proceedings 2024 IEEE Wic International Conference on Web Intelligence and Intelligent Agent Technology Wi Iat 2024 | |
| oairecerif.author.affiliation | Mahidol University | |
| oairecerif.author.affiliation | King Mongkut's University of Technology Thonburi |
