Finetuning Language Model for Person Description Search in Thai

dc.contributor.authorYuenyong S.
dc.contributor.otherMahidol University
dc.date.accessioned2023-06-18T17:02:24Z
dc.date.available2023-06-18T17:02:24Z
dc.date.issued2022-01-01
dc.description.abstractPerson description search is matching a textual description of a person with the image of the same person. This is a multimodal image-text task, where the model generally has two branches: image and text. The objective is for these two branches to embed their respective input into a joint space, where the embeddings should be near each other if the image and text pair is a match, and far apart if they are not. The image branch can simply use pretrained vision models off-the-shelf without any modification, because 'person' is a common class in large image datasets. For the text branch on the other hand, person descriptions are not part of the dataset commonly used to train large language models (LM). Recent deep learning language models are based on the transformer architecture, which are commonly trained using large text corpus using masked language model loss. In this paper we propose finetuning the transformer-based LM in an unsupervised manner using the person description text before supervised training on the actual task. The result shows that unsupervised LM finetuning is beneficial for Thai person description search.
dc.identifier.citation6th International Conference on Information Technology, InCIT 2022 (2022) , 207-210
dc.identifier.doi10.1109/InCIT56086.2022.10067683
dc.identifier.scopus2-s2.0-85151633056
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/84301
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.titleFinetuning Language Model for Person Description Search in Thai
dc.typeConference Paper
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85151633056&origin=inward
oaire.citation.endPage210
oaire.citation.startPage207
oaire.citation.title6th International Conference on Information Technology, InCIT 2022
oairecerif.author.affiliationMahidol University

Files

Collections