DeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification

dc.contributor.authorVachmanus S.
dc.contributor.authorNoraset T.
dc.contributor.authorPiyanonpong W.
dc.contributor.authorRattananukrom T.
dc.contributor.authorTuarob S.
dc.contributor.correspondenceVachmanus S.
dc.contributor.otherMahidol University
dc.date.accessioned2024-02-08T18:08:49Z
dc.date.available2024-02-08T18:08:49Z
dc.date.issued2023-01-01
dc.description.abstractSkin cancer is a dangerous form of cancer that develops slowly in skin cells. Delays in diagnosing and treating these malignant skin conditions may have serious repercussions. Likewise, early skin cancer detection has been shown to improve treatment outcomes. This paper proposes DeepMetaForge, a deep-learning framework for skin cancer detection from metadata-accompanied images. The proposed framework utilizes BEiT, a vision transformer pre-trained as a masked image modeling task, as the image-encoding backbone. We further propose merging the encoded metadata with the derived visual characteristics while compressing the aggregate information simultaneously, simulating how photos with metadata are interpreted. The experiment results on four public datasets of dermoscopic and smartphone skin lesion images reveal that the best configuration of our proposed framework yields 87.1% macro-average F1 on average. The empirical scalability analysis further shows that the proposed framework can be implemented in a variety of machine-learning paradigms, including applications on low-resource devices and as services. The findings shed light on not only the possibility of implementing telemedicine solutions for skin cancer on a nationwide scale that could benefit those in need of quality healthcare but also open doors to many intelligent applications in medicine where images and metadata are collected together, such as disease detection from CT-scan images and patients' expression recognition from facial images.
dc.identifier.citationIEEE Access Vol.11 (2023) , 145467-145484
dc.identifier.doi10.1109/ACCESS.2023.3345225
dc.identifier.eissn21693536
dc.identifier.scopus2-s2.0-85181583293
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/95597
dc.rights.holderSCOPUS
dc.subjectMaterials Science
dc.subjectComputer Science
dc.subjectEngineering
dc.titleDeepMetaForge: A Deep Vision-Transformer Metadata-Fusion Network for Automatic Skin Lesion Classification
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85181583293&origin=inward
oaire.citation.endPage145484
oaire.citation.startPage145467
oaire.citation.titleIEEE Access
oaire.citation.volume11
oairecerif.author.affiliationRamathibodi Hospital
oairecerif.author.affiliationMahidol University

Files

Collections