Explainable Deep Learning for Glaucomatous Visual Field Prediction: Artifact Correction Enhances Transformer Models

Sriwatana K.; Puttanawarut C.; Suwan Y.; Achakulvisut T.

Explainable Deep Learning for Glaucomatous Visual Field Prediction: Artifact Correction Enhances Transformer Models

dc.contributor.author	Sriwatana K.
dc.contributor.author	Puttanawarut C.
dc.contributor.author	Suwan Y.
dc.contributor.author	Achakulvisut T.
dc.contributor.correspondence	Sriwatana K.
dc.contributor.other	Mahidol University
dc.date.accessioned	2025-02-11T18:20:54Z
dc.date.available	2025-02-11T18:20:54Z
dc.date.issued	2025-01-02
dc.description.abstract	Purpose: The purpose of this study was to develop a deep learning approach that restores artifact-laden optical coherence tomography (OCT) scans and predicts functional loss on the 24-2 Humphrey Visual Field (HVF) test. Methods: This cross-sectional, retrospective study used 1674 visual field (VF)-OCT pairs from 951 eyes for training and 429 pairs from 345 eyes for testing. Peripapillary retinal nerve fiber layer (RNFL) thickness map artifacts were corrected using a generative diffusion model. Three convolutional neural networks and 2 transformer-based models were trained on original and artifact-corrected datasets to estimate 54 sensitivity thresholds of the 24-2 HVF test. Results: Predictive performances were calculated using root mean square error (RMSE) and mean absolute error (MAE), with explainability evaluated through GradCAM, attention maps, and dimensionality reduction techniques. The Distillation with No Labels (DINO) Vision Transformers (ViT) trained on artifact-corrected datasets achieved the highest accuracy (RMSE, 95% confidence interval [CI] = 4.44, 95% CI = 4.07, 4.82 decibel [dB], MAE = 3.46, 95% CI = 3.14, 3.79 dB), and the greatest interpretability, showing improvements of 0.15 dB in global RMSE and MAE (P < 0.05) compared to the performance on original maps. Feature maps and visualization tools indicate that artifacts compromise DINO-ViT's predictive ability but improve with artifact correction. Conclusions: Combining self-supervised ViTs with generative artifact correction enhances the correlation between glaucomatous structures and functions. Translational Relevance: Our approach offers a comprehensive tool for glaucoma management, facilitates the exploration of structure-function correlations in research, and underscores the importance of addressing artifacts in the clinical interpretation of OCT.
dc.identifier.citation	Translational vision science & technology Vol.14 No.1 (2025) , 22
dc.identifier.doi	10.1167/tvst.14.1.22
dc.identifier.eissn	21642591
dc.identifier.pmid	39847375
dc.identifier.scopus	2-s2.0-85216608372
dc.identifier.uri	https://repository.li.mahidol.ac.th/handle/20.500.14594/104205
dc.rights.holder	SCOPUS
dc.subject	Medicine
dc.subject	Engineering
dc.title	Explainable Deep Learning for Glaucomatous Visual Field Prediction: Artifact Correction Enhances Transformer Models
dc.type	Article
mu.datasource.scopus	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85216608372&origin=inward
oaire.citation.issue	1
oaire.citation.title	Translational vision science & technology
oaire.citation.volume	14
oairecerif.author.affiliation	Faculty of Medicine Ramathibodi Hospital, Mahidol University
oairecerif.author.affiliation	Mahidol University

Collections

Scopus 2025

	Office Hour: Monday-Friday 08.30-12.00 and 13.00-16.30 hrs.
	Phutthamonthon Sai 4 Rd. Salaya, Nakhon Pathom 73170, Thailand
	The office: +66 (2) 800 2680 ext.4306
	thipsuda.van@mahidol.ac.th
	https://repository.li.mahidol.ac.th

Explainable Deep Learning for Glaucomatous Visual Field Prediction: Artifact Correction Enhances Transformer Models

Files

Collections