Comparison of CNN- and Transformer-based Architectures for Automated Oral Epithelium Segmentation on Whole Slide Images
dc.contributor.author | Srisermphoak N. | |
dc.contributor.author | Amornphimoltham P. | |
dc.contributor.author | Chaisuparat R. | |
dc.contributor.author | Achararit P. | |
dc.contributor.author | Fuangrod T. | |
dc.contributor.other | Mahidol University | |
dc.date.accessioned | 2023-12-20T18:01:24Z | |
dc.date.available | 2023-12-20T18:01:24Z | |
dc.date.issued | 2023-01-01 | |
dc.description.abstract | Oral cancer is one of the most commonly found cancers worldwide. Oral Epithelial Dysplasia (OED) is an Oral Potentially Malignant Disorder (OPMD) that can be characterized for preventive oral cancer screening. The standard for OED histological grading is conducted via the epithelial regions of tissue biopsies. However, this procedure is laborious, time-consuming, and subjective; consequently, it is prone to variability due to fatigue and limited expertise. Therefore, this study aims to explore the potential of using Convolutional Neural Network (CNN) and Transformer models for an automated epithelium segmentation algorithm directly from Whole Slide Images (WSIs). This approach can reduce the manual process and support pathologists in grading activities. Accordingly, candidate architectures based on CNN and Transformer are selected: UNet, ResNet50-UNet, VGG19-UNet, Swin-UNet, and MISSFormer. These models are trained using patch-based segmentation to mitigate the high computational cost caused by processing WSIs. The results indicate that UNet, optimized with the ADAM optimizer, demonstrates the best performance in patch-based segmentation with Intersection over Union (IoU) of 0.82 and Dice-Similarity Coefficient (DSC) of 0.87. Furthermore, this model achieves the highest IoU and DSC for tissue-level prediction, scoring 0.88 and 0.94, respectively. According to the experiment, overlapping and non-overlapping patching strategies perform similarly in most of the selected architectures. The latter approach, hence, is suggested for computational efficiency. These results can support enhancing automated epithelium segmentation to provide a reliable tool for assisting pathologists. | |
dc.identifier.citation | BMEiCON 2023 - 15th Biomedical Engineering International Conference (2023) | |
dc.identifier.doi | 10.1109/BMEiCON60347.2023.10322006 | |
dc.identifier.scopus | 2-s2.0-85179558062 | |
dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/20.500.14594/91556 | |
dc.rights.holder | SCOPUS | |
dc.subject | Computer Science | |
dc.title | Comparison of CNN- and Transformer-based Architectures for Automated Oral Epithelium Segmentation on Whole Slide Images | |
dc.type | Conference Paper | |
mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85179558062&origin=inward | |
oaire.citation.title | BMEiCON 2023 - 15th Biomedical Engineering International Conference | |
oairecerif.author.affiliation | Mahidol University, Faculty of Dentistry | |
oairecerif.author.affiliation | Chulalongkorn University | |
oairecerif.author.affiliation | Chulabhorn Royal Academy |