Comparison of CNN- and Transformer-based Architectures for Automated Oral Epithelium Segmentation on Whole Slide Images

Srisermphoak N.; Amornphimoltham P.; Chaisuparat R.; Achararit P.; Fuangrod T.

Comparison of CNN- and Transformer-based Architectures for Automated Oral Epithelium Segmentation on Whole Slide Images

Issued Date

2023-01-01

Resource Type

Conference Paper

DOI

10.1109/BMEiCON60347.2023.10322006

Scopus ID

2-s2.0-85179558062

Journal Title

BMEiCON 2023 - 15th Biomedical Engineering International Conference

Rights Holder(s)

SCOPUS

Bibliographic Citation

BMEiCON 2023 - 15th Biomedical Engineering International Conference (2023)

Suggested Citation

Srisermphoak N., Amornphimoltham P., Chaisuparat R., Achararit P., Fuangrod T. Comparison of CNN- and Transformer-based Architectures for Automated Oral Epithelium Segmentation on Whole Slide Images. BMEiCON 2023 - 15th Biomedical Engineering International Conference (2023). doi:10.1109/BMEiCON60347.2023.10322006 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/91556

Title

Comparison of CNN- and Transformer-based Architectures for Automated Oral Epithelium Segmentation on Whole Slide Images

Author(s)

Srisermphoak N.
Amornphimoltham P.
Chaisuparat R.
Achararit P.
Fuangrod T.

Author's Affiliation

Mahidol University, Faculty of Dentistry
Chulalongkorn University
Chulabhorn Royal Academy

Other Contributor(s)

Mahidol University

Abstract

Oral cancer is one of the most commonly found cancers worldwide. Oral Epithelial Dysplasia (OED) is an Oral Potentially Malignant Disorder (OPMD) that can be characterized for preventive oral cancer screening. The standard for OED histological grading is conducted via the epithelial regions of tissue biopsies. However, this procedure is laborious, time-consuming, and subjective; consequently, it is prone to variability due to fatigue and limited expertise. Therefore, this study aims to explore the potential of using Convolutional Neural Network (CNN) and Transformer models for an automated epithelium segmentation algorithm directly from Whole Slide Images (WSIs). This approach can reduce the manual process and support pathologists in grading activities. Accordingly, candidate architectures based on CNN and Transformer are selected: UNet, ResNet50-UNet, VGG19-UNet, Swin-UNet, and MISSFormer. These models are trained using patch-based segmentation to mitigate the high computational cost caused by processing WSIs. The results indicate that UNet, optimized with the ADAM optimizer, demonstrates the best performance in patch-based segmentation with Intersection over Union (IoU) of 0.82 and Dice-Similarity Coefficient (DSC) of 0.87. Furthermore, this model achieves the highest IoU and DSC for tissue-level prediction, scoring 0.88 and 0.94, respectively. According to the experiment, overlapping and non-overlapping patching strategies perform similarly in most of the selected architectures. The latter approach, hence, is suggested for computational efficiency. These results can support enhancing automated epithelium segmentation to provide a reliable tool for assisting pathologists.

Keyword(s)

Computer Science

URI

https://repository.li.mahidol.ac.th/handle/20.500.14594/91556

Collections

Scopus 2023

Full item page

Send Feedback

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th