Thai Privacy Notice Analysis Based On Named-Entity Recognition Technique

dc.contributor.authorWongvises C.
dc.contributor.authorKhurat A.
dc.contributor.authorNoraset T.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-18T17:02:25Z
dc.date.available2023-06-18T17:02:25Z
dc.date.issued2022-01-01
dc.description.abstractWith the spread of global technology and the rising value of personal data, privacy has become a major problem. Many countries have enacted privacy laws to protect their citizens' privacy. Thailand has a similar law known as the Personal Data Protection Act (PDPA). According to the PDPA, the data controllers must provide data subjects with information about handling of personal data which is generally referred to as privacy notices. A privacy notice is typically a comprehensive document that uses formal language. As a result, many customers find it difficult to comprehend essential information in a short time. One of Natural Language Processing (NLP) methods, Named-Entity Recognition (NER), is a technique that can be used to capture the main contents of a document. The feasibility of NLP techniques for extracting significant privacy practices regarding the PDPA requirements for Thai privacy notices, however, has not yet been explored. Therefore, this study explores this feasibility and proposes a dataset of Thai privacy notices and a privacy annotation scheme based on PDPA requirements. The effectiveness of NLP approaches in extracting Thai privacy practice information is also evaluated in this study. The privacy notices comprehension has been formulated into the problem of Thai privacy information extraction, Thai privacy NER. The results show that the pre-trained transformer model outperforms the traditional method, and the increasing training dataset can affect higher performance value.
dc.identifier.citationICSEC 2022 - International Computer Science and Engineering Conference 2022 (2022) , 257-262
dc.identifier.doi10.1109/ICSEC56337.2022.10049321
dc.identifier.scopus2-s2.0-85149706383
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/84302
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleThai Privacy Notice Analysis Based On Named-Entity Recognition Technique
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85149706383&origin=inward
oaire.citation.endPage262
oaire.citation.startPage257
oaire.citation.titleICSEC 2022 - International Computer Science and Engineering Conference 2022
oairecerif.author.affiliationMahidol University

Files

Collections