SINet: A hybrid deep CNN model for real-time detection and segmentation of surgical instruments

被引:2
|
作者
Liu, Zhenzhong
Zhou, Yifan
Zheng, Laiwang
Zhang, Guobin
机构
[1] Tianjin Univ Technol, Sch Mech Engn, Tianjin Key Lab Adv Mechatron Syst Design & Intel, Tianjin 300384, Peoples R China
[2] Tianjin Univ Technol, Natl Demonstrat Ctr Expt Mech & Elect Engn Educ, Tianjin, Peoples R China
关键词
Deep learning; Object detection; Surgical instruments; Semantic segmentation; MINIMALLY INVASIVE SURGERY; NEURAL-NETWORKS; LOCALIZATION; SYSTEM; TOOLS;
D O I
10.1016/j.bspc.2023.105670
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Detection and segmentation of surgical instruments is an indispensable technology in robot-assisted surgery that enables doctors to obtain more comprehensive visual information and further improve the safety of surgery. However, the results of a detection are more easily interfered by environmental factors, such as instrument shaking, incomplete displays and insufficient light. To overcome those issues, we designed a hybrid deep-CNN model (SINet) for real-time surgical instrument detection and segmentation. Methods: The framework employs YOLOv5 as the object detection model and introduces a GAM attention mechanism to improve its feature extraction abilities. During training, the SiLU activation function is adopted to avoid gradient explosions and unstable training situations. Specifically, the vector angle relationship between the ground truth boxes and the prediction boxes was applied in the SIoU loss function to reduce the degree of freedom of the regression and accelerate the network convergence. Finally, a semantic segmentation head is used to implement detections of the surgical instruments by paralleling the detection and segmentation. Results: The proposed method is evaluated on the m2cai16-tool-locations public dataset and achieved a significant 97.9% mean average precision (mAP), 133 frames per second (FPS), 85.7% mean intersection over union (MIoU) and 86.6% Dice. Experiment based on simulated surgery platform also shows satisfactory detection performance. Conclusion: Experimental results demonstrated that the SINet can effectively detect the pose of surgical instruments and achieves a better performance than most of the current algorithms. The method has the potential to help perform a series of surgical operations efficiently and safely.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] InstrumentNet: An integrated model for real-time segmentation of intracranial surgical instruments
    Liu, Zhenzhong
    Zheng, Laiwang
    Gu, Lin
    Yang, Shubin
    Zhong, Zichen
    Zhang, Guobin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [2] Real-time traffic incident detection based on a hybrid deep learning model
    Li, Linchao
    Lin, Yi
    Du, Bowen
    Yang, Fan
    Ran, Bin
    TRANSPORTMETRICA A-TRANSPORT SCIENCE, 2022, 18 (01) : 78 - 98
  • [3] A Real-time Semantic Segmentation Model for Lane Detection
    Ma, Chen-Xu
    Li, Jing-Ang
    Han, Yong-Hua
    Wang, Yu-Meng
    Mu, Hai-Bo
    Jiang, Lu-Rong
    Journal of Network Intelligence, 2024, 9 (04): : 2234 - 2257
  • [4] A hybrid deformable model for real-time surgical simulation
    Zhu, Bo
    Gu, Lixu
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2012, 36 (05) : 356 - 365
  • [5] Attention-Guided Lightweight Network for Real-Time Segmentation of Robotic Surgical Instruments
    Ni, Zhen-Liang
    Bian, Gui-Bin
    Hou, Zeng-Guang
    Zhou, Xiao-Hu
    Xie, Xiao-Liang
    Li, Zhen
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 9939 - 9945
  • [6] SegTransConv: Transformer and CNN Hybrid Method for Real-Time Semantic Segmentation of Autonomous Vehicles
    Fan, Jiaqi
    Gao, Bingzhao
    Ge, Quanbo
    Ran, Yabing
    Zhang, Jia
    Chu, Hongqing
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1586 - 1601
  • [7] SwinURNet: Hybrid Transformer-CNN Architecture for Real-Time Unstructured Road Segmentation
    Wang, Zhangyu
    Liao, Zhihao
    Zhou, Bin
    Yu, Guizhen
    Luo, Wenwen
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [8] Deep CNN Approach with Visual Features for Real-Time Pavement Crack Detection
    Kulambayev, Bakhytzhan
    Astaubayeva, Gulnar
    Tleuberdiyeva, Gulnara
    Alimkulova, Janna
    Nussupbekova, Gulzhan
    Kisseleva, Olga
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (03) : 319 - 328
  • [9] ECSNet: An Accelerated Real-Time Image Segmentation CNN Architecture for Pavement Crack Detection
    Zhang, Tianjie
    Wang, Donglei
    Lu, Yang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 15105 - 15112
  • [10] An efficient lightweight CNN model for real-time fire smoke detection
    Sun, Bangyong
    Wang, Yu
    Wu, Siyuan
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2023, 20 (04)