Synthetic Data for Scam Detection: Leveraging LLMs to Train Deep Learning Models

dc.contributor.authorGumphusiri P.
dc.contributor.authorTriyason T.
dc.contributor.correspondenceGumphusiri P.
dc.contributor.otherMahidol University
dc.date.accessioned2025-06-10T18:18:39Z
dc.date.available2025-06-10T18:18:39Z
dc.date.issued2024-01-01
dc.description.abstractThis paper presents a novel approach to training scam detection models using synthetic data generated by Large Language Models (LLMs). We propose single-agent and multi-agent methods for data generation and train six deep learning architectures-LSTM, BiLSTM, GRU, BiGRU, CNN, and BERT-to classify conversations as scam or non-scam. Our experiments demonstrate that models trained on synthetic data achieve high accuracy on both generated test sets and real-world scam conversations. The models perform well even with limited conversation turns and when analyzing only the suspect's messages, indicating potential for early scam detection and privacy-preserving applications. Our findings highlight the efficacy of synthetic data in overcoming real-world dataset limitations for scam detection. We make the dataset and trained models publicly available to facilitate further research and development in this critical area of fraud prevention.
dc.identifier.citationProceedings 2024 IEEE Wic International Conference on Web Intelligence and Intelligent Agent Technology Wi Iat 2024 (2024) , 868-873
dc.identifier.doi10.1109/WI-IAT62293.2024.00141
dc.identifier.scopus2-s2.0-105007152224
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/110613
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleSynthetic Data for Scam Detection: Leveraging LLMs to Train Deep Learning Models
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105007152224&origin=inward
oaire.citation.endPage873
oaire.citation.startPage868
oaire.citation.titleProceedings 2024 IEEE Wic International Conference on Web Intelligence and Intelligent Agent Technology Wi Iat 2024
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationKing Mongkut's University of Technology Thonburi

Files

Collections