Automated detection of steps in videos of strabismus surgery using deep learning

被引:0
|
作者
Zheng, Ce [1 ]
Li, Wen [2 ]
Wang, Siying [2 ]
Ye, Haiyun [2 ]
Xu, Kai [2 ]
Fang, Wangyi [2 ]
Dong, Yanli [2 ]
Wang, Zilei [2 ]
Qiao, Tong [2 ]
机构
[1] Shanghai Jiao Tong Univ, Xinhua Hosp, Sch Med, Dept Ophthalmol, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Shanghai Childrens Hosp, Sch Med, Dept Ophthalmol, Lu Ding Rd 355, Shanghai 200000, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep learning; Strabismus surgery; Automated detection; Surgical videos; IMPACT;
D O I
10.1186/s12886-024-03504-8
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Background Learning to perform strabismus surgery is an essential aspect of ophthalmologists' surgical training. Automated classification strategy for surgical steps can improve the effectiveness of training curricula and the efficient evaluation of residents' performance. To this end, we aimed to develop and validate a deep learning (DL) model for automated detecting strabismus surgery steps in the videos. Methods In this study, we gathered 479 strabismus surgery videos from Shanghai Children's Hospital, affiliated to Shanghai Jiao Tong University School of Medicine, spanning July 2017 to October 2021. The videos were manually cut into 3345 clips of the eight strabismus surgical steps based on the International Council of Ophthalmology's Ophthalmology Surgical Competency Assessment Rubrics (ICO-OSCAR: strabismus). The videos dataset was randomly split by eye-level into a training (60%), validation (20%) and testing dataset (20%). We evaluated two hybrid DL algorithms: a Recurrent Neural Network (RNN) based and a Transformer-based model. The evaluation metrics included: accuracy, area under the receiver operating characteristic curve, precision, recall and F1-score. Results DL models identified the steps in video clips of strabismus surgery achieved macro-average AUC of 1.00 (95% CI 1.00-1.00) with Transformer-based model and 0.98 (95% CI 0.97-1.00) with RNN-based model, respectively. The Transformer-based model yielded a higher accuracy compared with RNN-based models (0.96 vs. 0.83, p < 0.001). In detecting different steps of strabismus surgery, the predictive ability of the Transformer-based model was better than that of the RNN. Precision ranged between 0.90 and 1 for the Transformer-based model and 0.75 to 0.94 for the RNN-based model. The f1-score ranged between 0.93 and 1 for the Transformer-based model and 0.78 to 0.92 for the RNN-based model. Conclusion The DL models can automate identify video steps of strabismus surgery with high accuracy and Transformer-based algorithms show excellent performance when modeling spatiotemporal features of video frames.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Automated Detection of Endometrial Polyps from Hysteroscopic Videos Using Deep Learning
    Zhao, Aihua
    Du, Xin
    Yuan, Suzhen
    Shen, Wenfeng
    Zhu, Xin
    Wang, Wenwen
    [J]. DIAGNOSTICS, 2023, 13 (08)
  • [2] Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques
    Yu, Felix
    Croso, Gianluca Silva
    Kim, Tae Soo
    Song, Ziang
    Parker, Felix
    Hager, Gregory D.
    Reiter, Austin
    Vedula, S. Swaroop
    Ali, Haider
    Sikder, Shameema
    [J]. JAMA NETWORK OPEN, 2019, 2 (04) : e191860
  • [3] Abnormal behavior detection in videos using deep learning
    Jun Wang
    Limin Xia
    [J]. Cluster Computing, 2019, 22 : 9229 - 9239
  • [4] Abnormal behavior detection in videos using deep learning
    Wang, Jun
    Xia, Limin
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 4): : S9229 - S9239
  • [5] Violence Detection in Videos Using Deep Learning: A Survey
    Kaur, Gurmeet
    Singh, Sarbjeet
    [J]. ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY AND COMPUTING, AICTC 2021, 2022, 392 : 165 - 173
  • [6] Deep Learning driven automated person detection and tracking model on surveillance videos
    Sivachandiran, S.
    Jagan Mohan, K.
    Mohammed Nazer, G.
    [J]. Measurement: Sensors, 2022, 24
  • [7] Deep Learning-Based Automated Detection of Sewer Defects in CCTV Videos
    Kumar, Srinath Shiv
    Wang, Mingzhu
    Abraham, Dulcy M.
    Jahanshahi, Mohammad R.
    Iseley, Tom
    Cheng, Jack C. P.
    [J]. JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2020, 34 (01)
  • [8] Automated Classification of Blood Loss from Transurethral Resection of the Prostate Surgery Videos Using Deep Learning Technique
    Chen, Jian-Wen
    Lin, Wan-Ju
    Lin, Chun-Yuan
    Hung, Che-Lun
    Hou, Chen-Pang
    Cho, Ching-Che
    Young, Hong-Tsu
    Tang, Chuan-Yi
    [J]. APPLIED SCIENCES-BASEL, 2020, 10 (14):
  • [9] Automatic handgun detection alarm in videos using deep learning
    Olmos, Roberto
    Tabik, Siham
    Herrera, Francisco
    [J]. NEUROCOMPUTING, 2018, 275 : 66 - 72
  • [10] Anomaly Detection in Traffic Surveillance Videos Using Deep Learning
    Khan, Sardar Waqar
    Hafeez, Qasim
    Khalid, Muhammad Irfan
    Alroobaea, Roobaea
    Hussain, Saddam
    Iqbal, Jawaid
    Almotiri, Jasem
    Ullah, Syed Sajid
    [J]. SENSORS, 2022, 22 (17)