Deep-learning-based head pose estimation from a single RGB image and its application to medical CROM measurement

dc.contributor.authorRitthipravat P.
dc.contributor.authorChotikkakamthorn K.
dc.contributor.authorLie W.N.
dc.contributor.authorKusakunniran W.
dc.contributor.authorTuakta P.
dc.contributor.authorBenjapornlert P.
dc.contributor.correspondenceRitthipravat P.
dc.contributor.otherMahidol University
dc.date.accessioned2024-02-28T18:23:12Z
dc.date.available2024-02-28T18:23:12Z
dc.date.issued2024-01-01
dc.description.abstractFor human beings, neck movement will be degraded due to aging, trauma, musculoskeletal disorders, or degenerative diseases. Cervical range of motion (CROM) measurement is one of the popular quantitative neck examinations. Despite radiography is considered as the gold standard, it suffers from invasiveness, radiation exposure, and expensiveness. Recently, vision-based methods have been applied for CROM measurement but achieve large errors and require depth camera. On the other hand, deep neural networks provide good performances on head pose estimation (HPE) from a single image, thus promising for medical CROM measurement. We propose to use CNN networks to extract pyramidal or multi-level image features, which are passed to cross-level attention modules for feature fusion and then to a modified ASPP module and a multi-bin classification/regression module for spatial-channel attention and Euler angle conversion/prediction, respectively. The proposed technique was evaluated on public datasets, such as 300W_LP, AFLW2000, and BIWI, to verify its superior performances (with mean MAE = 3.50°, 3.40°, and 2.31° for different experimental protocols) than state-of-the-art methods. Our pre-trained model was also evaluated with our own collected dataset from hospital for CROM measurement. It also achieved the lowest MAE of 4.58° among other methods and conformed with a medical standard of 5 degrees except the pitch angle (which has a MAE of 5.70°, larger than the standard and the yaw (MAE = 3.60°) and roll angles (MAE = 4.44°)). In general, HPE technique is feasible for CROM measurement and shows its advantages of speed, non-invasiveness, free of anatomical landmark and low cost of operation.
dc.identifier.citationMultimedia Tools and Applications (2024)
dc.identifier.doi10.1007/s11042-024-18612-2
dc.identifier.eissn15737721
dc.identifier.issn13807501
dc.identifier.scopus2-s2.0-85185472105
dc.identifier.urihttps://repository.li.mahidol.ac.th/handle/20.500.14594/97381
dc.rights.holderSCOPUS
dc.subjectComputer Science
dc.subjectEngineering
dc.titleDeep-learning-based head pose estimation from a single RGB image and its application to medical CROM measurement
dc.typeArticle
mu.datasource.scopushttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85185472105&origin=inward
oaire.citation.titleMultimedia Tools and Applications
oairecerif.author.affiliationRamathibodi Hospital
oairecerif.author.affiliationMahidol University
oairecerif.author.affiliationNational Chung Cheng University

Files

Collections