Hip prosthesis failure prediction through radiological deep sequence learning

被引:0
|
作者
Masciulli, Francesco [1 ]
Corti, Anna [1 ]
Lindemann, Alessia [1 ]
Chiappetta, Katia [2 ]
Loppini, Mattia [2 ,3 ,4 ]
Corino, Valentina D. A. [1 ,5 ]
机构
[1] Politecn Milan, Dept Elect Informat & Bioengn, Via Golgi 39, I-20131 Milan, MI, Italy
[2] IRCCS Humanitas Res Hosp, Via Alessandro Manzoni 56, I-20089 Rozzano, MI, Italy
[3] Humanitas Univ, Dept Biomed Sci, Via R Levi Montalcini 4, I-20090 Pieve Emanuele, MI, Italy
[4] Fdn Livio Sciutto Ric Biomed Ortopedia ONLUS, Via A Magliotto 2, I-17100 Savona, SV, Italy
[5] Ctr Cardiol Monzino IRCCS, Cardio Tech Lab, Via Carlo Parea 4, I-20138 Milan, Italy
关键词
Artificial intelligence; Hip replacement; Image classification; Temporal dependency; ARTHROPLASTY; COMPONENT;
D O I
10.1016/j.ijmedinf.2025.105802
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background: Existing deep learning studies for the automated detection of hip prosthesis failure only consider the last available radiographic image. However, using longitudinal data is thought to improve the prediction, by combining temporal and spatial components. The aim of this study is to develop artificial intelligence models for predicting hip implant failure from multiple subsequent plain radiographs. Methods: A cohort of 224 patients was considered for model development and a balanced cohort of 14 patients was used for external validation. A sequence of two or three anteroposterior radiographic images per patient was considered to track the prosthesis over time. A combination of a convolutional neural network (CNN) and a recurrent section was used. For the CNN, a pretrained autoencoder, a pretrained RadImageNet DenseNet and a pretrained custom DenseNet were considered. The recurrent section was implemented using either a single Gated Recurrent Unit (GRU) layer or a Long Short-Term Memory block. Results: Considering 3 images as input provided a positive predictive value (PPV) of 0.966 and an f1 score of 0.933 on the validation set. Regarding the 2-image models, using the postoperative and the last image resulted in PPV of 0.933 and f1 score of 0.918, whereas using the second-to-last image with the post-operative one reached a PPV of 0.882 and f1 score of 0.923. On the external validation set, the 3-image model reached an accuracy of 0.786. Conclusion: This study demonstrated the potential of the developed models, based on a series of plain radiographs, to predict hip prosthesis failure.
引用
收藏
页数:8
相关论文
empty
未找到相关数据