SINet: A hybrid deep CNN model for real-time detection and segmentation of surgical instruments

被引:2
|
作者
Liu, Zhenzhong
Zhou, Yifan
Zheng, Laiwang
Zhang, Guobin
机构
[1] Tianjin Univ Technol, Sch Mech Engn, Tianjin Key Lab Adv Mechatron Syst Design & Intel, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Natl Demonstrat Ctr Expt Mech & Elect Engn Educ, Tianjin, Peoples R China
关键词
Deep learning; Object detection; Surgical instruments; Semantic segmentation; MINIMALLY INVASIVE SURGERY; NEURAL-NETWORKS; LOCALIZATION; SYSTEM; TOOLS;
D O I
10.1016/j.bspc.2023.105670
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Detection and segmentation of surgical instruments is an indispensable technology in robot-assisted surgery that enables doctors to obtain more comprehensive visual information and further improve the safety of surgery. However, the results of a detection are more easily interfered by environmental factors, such as instrument shaking, incomplete displays and insufficient light. To overcome those issues, we designed a hybrid deep-CNN model (SINet) for real-time surgical instrument detection and segmentation. Methods: The framework employs YOLOv5 as the object detection model and introduces a GAM attention mechanism to improve its feature extraction abilities. During training, the SiLU activation function is adopted to avoid gradient explosions and unstable training situations. Specifically, the vector angle relationship between the ground truth boxes and the prediction boxes was applied in the SIoU loss function to reduce the degree of freedom of the regression and accelerate the network convergence. Finally, a semantic segmentation head is used to implement detections of the surgical instruments by paralleling the detection and segmentation. Results: The proposed method is evaluated on the m2cai16-tool-locations public dataset and achieved a significant 97.9% mean average precision (mAP), 133 frames per second (FPS), 85.7% mean intersection over union (MIoU) and 86.6% Dice. Experiment based on simulated surgery platform also shows satisfactory detection performance. Conclusion: Experimental results demonstrated that the SINet can effectively detect the pose of surgical instruments and achieves a better performance than most of the current algorithms. The method has the potential to help perform a series of surgical operations efficiently and safely.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] A Hybrid Approach to Real-Time Robotic Visual Navigation: Integrating Detection and Scene Segmentation
    Hu, Lingxiang
    Zhu, Xingfei
    Li, Dun
    Zhang, Fukai
    Zhang, Chengqiu
    2024 WRC SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION, WRC SARA, 2024, : 280 - 285
  • [32] Real-Time Segmentation of Non-rigid Surgical Tools Based on Deep Learning and Tracking
    Garcia-Peraza-Herrera, Luis C.
    Li, Wenqi
    Gruijthuijsen, Caspar
    Devreker, Alain
    Attilakos, George
    Deprest, Jan
    Vander Poorten, Emmanuel
    Stoyanov, Danail
    Vercauteren, Tom
    Ourselin, Sebastien
    COMPUTER-ASSISTED AND ROBOTIC ENDOSCOPY, 2017, 10170 : 84 - 95
  • [33] Real-time detection of road manhole covers with a deep learning model
    Pang, Dangfeng
    Guan, Zhiwei
    Luo, Tao
    Su, Wei
    Dou, Ruzhen
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [34] Real-time detection of road manhole covers with a deep learning model
    Dangfeng Pang
    Zhiwei Guan
    Tao Luo
    Wei Su
    Ruzhen Dou
    Scientific Reports, 13
  • [35] A REAL-TIME DEEP TRANSFER LEARNING MODEL FOR FACIAL MASK DETECTION
    Zhang, Edward
    2021 INTEGRATED COMMUNICATIONS NAVIGATION AND SURVEILLANCE CONFERENCE (ICNS), 2021,
  • [36] TinySegformer: A lightweight visual segmentation model for real-time agricultural pest detection
    Zhang, Yan
    Lv, Chunli
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 218
  • [37] Real-time segmentation of surgical instruments inside the abdominal cavity using a joint hue saturation color feature
    Doignon, C
    Graebling, P
    de Mathelin, M
    REAL-TIME IMAGING, 2005, 11 (5-6) : 429 - 442
  • [38] Real-time rubber quality model based on CNN-LSTM deep learning theory
    Han, Shanling
    Dong, Wenzheng
    Sun, He
    Xiao, Peng
    Zhang, Shoudong
    Chen, Long
    Li, Yong
    MATERIALS TODAY COMMUNICATIONS, 2023, 35
  • [39] Deep learning model for real-time semantic segmentation during intraoperative robotic prostatectomy
    In, Il P.
    Gon, Sung P.
    Won, Ji K.
    Jung, Min K.
    Donghyun, L.
    Tae, Sung C.
    Goo, Young L.
    Pak, S.
    EUROPEAN UROLOGY, 2024, 85 : S2026 - S2026
  • [40] Real-Time Traffic Sign Detection and Recognition using CNN
    Santos, D.
    Silva, F.
    Pereira, D.
    Almeida, L.
    Artero, A.
    Piteri, M.
    de Albuquerque, V
    IEEE LATIN AMERICA TRANSACTIONS, 2020, 18 (03) : 522 - 529