Automating Manga Character Analysis: A Robust Deep Vision-Transformer Approach to Facial Landmark Detection

dc.contributor.authorVachmanus S.
dc.contributor.authorPhinklao N.
dc.contributor.authorPhongsarnariyakul N.
dc.contributor.authorPlongcharoen T.
dc.contributor.authorHotta S.
dc.contributor.authorTuarob S.
dc.contributor.correspondenceVachmanus S.
dc.contributor.otherMahidol University
dc.date.accessioned2024-09-25T18:12:30Z
dc.date.available2024-09-25T18:12:30Z
dc.date.issued2024-01-01
dc.description.abstractComics, particularly Japanese manga, are a powerful medium that blends images and text to convey ideas and encapsulate a unique cultural heritage. Going beyond mere entertainment, manga merges diverse styles and content deeply rooted in Japanese cultural heritage. This study utilizes computer vision analysis, with a specific focus on facial landmark detection, acknowledging the growing significance of technology in analyzing manga images. Through a comprehensive exploration of various methods, the research identifies the extended version of Bidirectional Encoder Representations from Transformers (BERT), BERT Pre-Training of Image Transformers (BEiT), model as a standout performer due to its efficiency and effectiveness. The BEiT model's success lies in its ability to extract facial features, consequently establishing itself as a go-To solution for landmark detection on manga faces. The outcomes achieved the lowest Failure Rate compared to other landmark detection networks, with a Failure Rate of approximately 9.4% and a Mean Average Error of about 4.6 pixels. Beyond its technical accomplishments, this study carries a cultural significance, contributing to the ongoing narrative of manga in Japan.
dc.identifier.citationIEEE Access (2024)
dc.identifier.doi10.1109/ACCESS.2024.3459419
dc.identifier.eissn21693536
dc.identifier.scopus2-s2.0-85204185759
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/101342
dc.rights.holderSCOPUS
dc.subjectMaterials Science
dc.subjectComputer Science
dc.subjectEngineering
dc.titleAutomating Manga Character Analysis: A Robust Deep Vision-Transformer Approach to Facial Landmark Detection
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85204185759&origin=inward
oaire.citation.titleIEEE Access
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationTokyo University of Agriculture and Technology

Files

Collections