Deep learning-based head and neck deformable image registration using spatio-temporal analysis and self attention

dc.contributor.authorLee D.
dc.contributor.authorHu Y.C.
dc.contributor.authorTreeChairusame T.
dc.contributor.authorOh J.H.
dc.contributor.authorLee N.
dc.contributor.authorAristophanous M.
dc.contributor.authorCerviño L.
dc.contributor.authorZhang P.
dc.contributor.correspondenceLee D.
dc.contributor.otherMahidol University
dc.date.accessioned2026-04-10T18:38:37Z
dc.date.available2026-04-10T18:38:37Z
dc.date.issued2026-01-01
dc.description.abstractBackground and purposeSignificant anatomical changes during head and neck cancer (HNC) radiotherapy challenge accurate dose delivery. Deformable image registration (DIR) is essential for adaptive radiotherapy (ART), yet conventional methods are too slow for online clinical use. This study proposed a novel deep learning-based DIR algorithm for longitudinal HNC imaging.Materials & methodsWe used sixty HNC patient datasets, each containing a planning CT (pCT) and six weekly cone-beam CTs (CBCTs). Fifty datasets were used for training with cross-validation, and the remaining ten were reserved for testing. The proposed DIR algorithm is a patch-based model that integrates 3D convolutional neural networks, self-attention, and a convolutional Long Short Term Memory to model temporal deformations. The model predicted bidirectional deformation vector fields and was trained with a composite loss function combining image similarity, DVF smoothness, and inverse consistency. Performance was benchmarked against the large deformation diffeomorphic metric mapping (LDDMM) algorithm using Dice similarity coefficient (DSC), Hausdorff distance, and Jacobian analysis.ResultsThe proposed method achieved significantly faster inference, performing bidirectional DIR between the pCT and all six weekly CBCTs in under 3 min and averaging about 30 s per patient, while matching or exceeding LDDMM’s accuracy. DSC remained above 0.8 for all key structures, and the method demonstrated improved DVF consistency with lower mean and 95th percentile Hausdorff distances. Unlike LDDMM, it required no manual parameter tuning, providing consistent results.ConclusionThe proposed DIR algorithm enabled rapid, accurate, and consistent image registration, supporting real-time ART workflows and retrospective dose accumulation in personalized radiotherapy.
dc.identifier.citationPhysics and Imaging in Radiation Oncology Vol.37 (2026)
dc.identifier.doi10.1016/j.phro.2026.100928
dc.identifier.eissn24056316
dc.identifier.scopus2-s2.0-105034505363
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/123456789/116103
dc.rights.holderSCOPUS
dc.subjectMedicine
dc.subjectPhysics and Astronomy
dc.titleDeep learning-based head and neck deformable image registration using spatio-temporal analysis and self attention
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105034505363&origin=inward
oaire.citation.titlePhysics and Imaging in Radiation Oncology
oaire.citation.volume37
oairecerif.author.affiliationMemorial Sloan-Kettering Cancer Center
oairecerif.author.affiliationSiriraj Hospital

Files

Collections