Deep learning-based head and neck deformable image registration using spatio-temporal analysis and self attention
| dc.contributor.author | Lee D. | |
| dc.contributor.author | Hu Y.C. | |
| dc.contributor.author | TreeChairusame T. | |
| dc.contributor.author | Oh J.H. | |
| dc.contributor.author | Lee N. | |
| dc.contributor.author | Aristophanous M. | |
| dc.contributor.author | Cerviño L. | |
| dc.contributor.author | Zhang P. | |
| dc.contributor.correspondence | Lee D. | |
| dc.contributor.other | Mahidol University | |
| dc.date.accessioned | 2026-04-10T18:38:37Z | |
| dc.date.available | 2026-04-10T18:38:37Z | |
| dc.date.issued | 2026-01-01 | |
| dc.description.abstract | Background and purposeSignificant anatomical changes during head and neck cancer (HNC) radiotherapy challenge accurate dose delivery. Deformable image registration (DIR) is essential for adaptive radiotherapy (ART), yet conventional methods are too slow for online clinical use. This study proposed a novel deep learning-based DIR algorithm for longitudinal HNC imaging.Materials & methodsWe used sixty HNC patient datasets, each containing a planning CT (pCT) and six weekly cone-beam CTs (CBCTs). Fifty datasets were used for training with cross-validation, and the remaining ten were reserved for testing. The proposed DIR algorithm is a patch-based model that integrates 3D convolutional neural networks, self-attention, and a convolutional Long Short Term Memory to model temporal deformations. The model predicted bidirectional deformation vector fields and was trained with a composite loss function combining image similarity, DVF smoothness, and inverse consistency. Performance was benchmarked against the large deformation diffeomorphic metric mapping (LDDMM) algorithm using Dice similarity coefficient (DSC), Hausdorff distance, and Jacobian analysis.ResultsThe proposed method achieved significantly faster inference, performing bidirectional DIR between the pCT and all six weekly CBCTs in under 3 min and averaging about 30 s per patient, while matching or exceeding LDDMM’s accuracy. DSC remained above 0.8 for all key structures, and the method demonstrated improved DVF consistency with lower mean and 95th percentile Hausdorff distances. Unlike LDDMM, it required no manual parameter tuning, providing consistent results.ConclusionThe proposed DIR algorithm enabled rapid, accurate, and consistent image registration, supporting real-time ART workflows and retrospective dose accumulation in personalized radiotherapy. | |
| dc.identifier.citation | Physics and Imaging in Radiation Oncology Vol.37 (2026) | |
| dc.identifier.doi | 10.1016/j.phro.2026.100928 | |
| dc.identifier.eissn | 24056316 | |
| dc.identifier.scopus | 2-s2.0-105034505363 | |
| dc.identifier.uri | https://repository.li.mahidol.ac.th/handle/123456789/116103 | |
| dc.rights.holder | SCOPUS | |
| dc.subject | Medicine | |
| dc.subject | Physics and Astronomy | |
| dc.title | Deep learning-based head and neck deformable image registration using spatio-temporal analysis and self attention | |
| dc.type | Article | |
| mu.datasource.scopus | https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=105034505363&origin=inward | |
| oaire.citation.title | Physics and Imaging in Radiation Oncology | |
| oaire.citation.volume | 37 | |
| oairecerif.author.affiliation | Memorial Sloan-Kettering Cancer Center | |
| oairecerif.author.affiliation | Siriraj Hospital |
