Automated tongue segmentation using deep encoder-decoder model

Kusakunniran W.; Borwarnginn P.; Imaromkul T.; Aukkapinyo K.; Thongkanchorn K.; Wattanadhirach D.; Mongkolluksamee S.; Thammasudjarit R.; Ritthipravat P.; Tuakta P.; Benjapornlert P.

Automated tongue segmentation using deep encoder-decoder model

Issued Date

2023-01-01

Resource Type

Article

ISSN

13807501

eISSN

15737721

DOI

10.1007/s11042-023-15061-1

Scopus ID

2-s2.0-85150375955

Journal Title

Multimedia Tools and Applications

Rights Holder(s)

SCOPUS

Bibliographic Citation

Multimedia Tools and Applications (2023)

Suggested Citation

Kusakunniran W., Borwarnginn P., Imaromkul T., Aukkapinyo K., Thongkanchorn K., Wattanadhirach D., Mongkolluksamee S., Thammasudjarit R., Ritthipravat P., Tuakta P., Benjapornlert P. Automated tongue segmentation using deep encoder-decoder model. Multimedia Tools and Applications (2023). doi:10.1007/s11042-023-15061-1 Retrieved from: https://repository.li.mahidol.ac.th/handle/20.500.14594/81794

Title

Automated tongue segmentation using deep encoder-decoder model

Author(s)

Kusakunniran W.
Borwarnginn P.
Imaromkul T.
Aukkapinyo K.
Thongkanchorn K.
Wattanadhirach D.
Mongkolluksamee S.
Thammasudjarit R.
Ritthipravat P.
Tuakta P.
Benjapornlert P.

Author's Affiliation

Faculty of Medicine Ramathibodi Hospital, Mahidol University
Mahidol University
Srinakharinwirot University

Other Contributor(s)

Mahidol University

Abstract

This paper proposes a solution of tongue segmentation in images. The solution relies on a convolutional neural network, using deep U-Net with deep layers of encoder-decoder modules. The model is trained with a starting resolution of 512 x 512 pixels. To enhance the segmentation performances of the trained model across recording environments, three main types of data augmentations are added in the training process, including additive gaussian noise, multiply and add to brightness, and change color temperature. They could also handle an inadequate number of data samples in the limited datasets. The proposed method is evaluated based on four measurement metrics of Dice coefficient, mean IoU, Jaccard distance, and accuracy. The model is successfully trained on publicly available datasets, and then transferred to be tested with the self-collected dataset in the real-world environment.

Keyword(s)

Computer Science

URI

https://repository.li.mahidol.ac.th/handle/20.500.14594/81794

Collections

Scopus 2023

Full item page

Send Feedback

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th