Finetuning Language Model for Person Description Search in Thai
| dc.contributor.author | Yuenyong S. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2023-06-18T17:02:24Z | |
| dc.date.available | 2023-06-18T17:02:24Z | |
| dc.date.issued | 2022-01-01 | |
| dc.description.abstract | Person description search is matching a textual description of a person with the image of the same person. This is a multimodal image-text task, where the model generally has two branches: image and text. The objective is for these two branches to embed their respective input into a joint space, where the embeddings should be near each other if the image and text pair is a match, and far apart if they are not. The image branch can simply use pretrained vision models off-the-shelf without any modification, because 'person' is a common class in large image datasets. For the text branch on the other hand, person descriptions are not part of the dataset commonly used to train large language models (LM). Recent deep learning language models are based on the transformer architecture, which are commonly trained using large text corpus using masked language model loss. In this paper we propose finetuning the transformer-based LM in an unsupervised manner using the person description text before supervised training on the actual task. The result shows that unsupervised LM finetuning is beneficial for Thai person description search. | |
| dc.identifier.citation | 6th International Conference on Information Technology, InCIT 2022 (2022) , 207-210 | |
| dc.identifier.doi | 10.1109/InCIT56086.2022.10067683 | |
| dc.identifier.scopus | 2-s2.0-85151633056 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/84301 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Computer Science | |
| dc.title | Finetuning Language Model for Person Description Search in Thai | |
| dc.type | Conference Paper | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85151633056&origin=inward | |
| oaire.citation.endPage | 210 | |
| oaire.citation.startPage | 207 | |
| oaire.citation.title | 6th International Conference on Information Technology, InCIT 2022 | |
| oairecerif.author.affiliation | Mahidol University |
