SINet: A hybrid deep CNN model for real-time detection and segmentation of surgical instruments

被引:2
|
作者
Liu, Zhenzhong
Zhou, Yifan
Zheng, Laiwang
Zhang, Guobin
机构
[1] Tianjin Univ Technol, Sch Mech Engn, Tianjin Key Lab Adv Mechatron Syst Design & Intel, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Natl Demonstrat Ctr Expt Mech & Elect Engn Educ, Tianjin, Peoples R China
关键词
Deep learning; Object detection; Surgical instruments; Semantic segmentation; MINIMALLY INVASIVE SURGERY; NEURAL-NETWORKS; LOCALIZATION; SYSTEM; TOOLS;
D O I
10.1016/j.bspc.2023.105670
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Detection and segmentation of surgical instruments is an indispensable technology in robot-assisted surgery that enables doctors to obtain more comprehensive visual information and further improve the safety of surgery. However, the results of a detection are more easily interfered by environmental factors, such as instrument shaking, incomplete displays and insufficient light. To overcome those issues, we designed a hybrid deep-CNN model (SINet) for real-time surgical instrument detection and segmentation. Methods: The framework employs YOLOv5 as the object detection model and introduces a GAM attention mechanism to improve its feature extraction abilities. During training, the SiLU activation function is adopted to avoid gradient explosions and unstable training situations. Specifically, the vector angle relationship between the ground truth boxes and the prediction boxes was applied in the SIoU loss function to reduce the degree of freedom of the regression and accelerate the network convergence. Finally, a semantic segmentation head is used to implement detections of the surgical instruments by paralleling the detection and segmentation. Results: The proposed method is evaluated on the m2cai16-tool-locations public dataset and achieved a significant 97.9% mean average precision (mAP), 133 frames per second (FPS), 85.7% mean intersection over union (MIoU) and 86.6% Dice. Experiment based on simulated surgery platform also shows satisfactory detection performance. Conclusion: Experimental results demonstrated that the SINet can effectively detect the pose of surgical instruments and achieves a better performance than most of the current algorithms. The method has the potential to help perform a series of surgical operations efficiently and safely.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Real-time smoke detection with Faster R-CNN
    Li, Lei
    Liu, Fenggang
    Ding, Yidan
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [42] Real-Time Anomaly Detection of Network Traffic Based on CNN
    Liu, Haitao
    Wang, Haifeng
    SYMMETRY-BASEL, 2023, 15 (06):
  • [43] Real-time Obstacle Detection by Road Plane Segmentation
    Santhanam, S.
    Balisavira, V.
    Pandey, V. K.
    2013 IEEE 9TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS (CSPA), 2013, : 151 - 154
  • [44] REAL-TIME EDGE-DETECTION AND IMAGE SEGMENTATION
    CHONG, CP
    SALAMA, CAT
    SMITH, KC
    ANALOG INTEGRATED CIRCUITS AND SIGNAL PROCESSING, 1992, 2 (02) : 117 - 130
  • [45] Weakly supervised segmentation for real-time surgical tool tracking
    Lee, Eung-Joo
    Plishker, William
    Liu, Xinyang
    Bhattacharyya, Shuvra S.
    Shekhar, Raj
    HEALTHCARE TECHNOLOGY LETTERS, 2019, 6 (06) : 231 - 236
  • [46] AUTOMATIC REAL-TIME CNN-BASED NEONATAL BRAIN VENTRICLES SEGMENTATION
    Wang, Puyang
    Cuccolo, Nick. G.
    Tyagi, Rachana
    Hacihaliloglu, Ilker
    Patel, Vishal M.
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 716 - 719
  • [47] Cascaded CNN for Real-time Tongue Segmentation Based on Key Points Localization
    Yuan, Wei
    Liu, Changsong
    2019 4TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (ICBDA 2019), 2019, : 303 - 307
  • [48] In-vivo real-time tracking of surgical instruments in endoscopic video
    Bouarfa, Loubna
    Akman, Oytun
    Schneider, Armin
    Jonker, Pieter P.
    Dankelman, Jenny
    MINIMALLY INVASIVE THERAPY & ALLIED TECHNOLOGIES, 2012, 21 (03) : 129 - 134
  • [49] CNN Implementation of a Moving Object Segmentation Approach for Real-Time Video Surveillance
    Rodriguez-Fernandez, D.
    Vilarino, D. L.
    Pardo, X. M.
    2008 11TH INTERNATIONAL WORKSHOP ON CELLULAR NEURAL NETWORKS AND THEIR APPLICATIONS, 2008, : 129 - 134
  • [50] An Interpretable CNN for the Segmentation of the Left Ventricle in Cardiac MRI by Real-Time Visualization
    Liu, Jun
    Yuan, Geng
    Yang, Changdi
    Song, Houbing
    Luo, Liang
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2023, 135 (02): : 1571 - 1587