Detection and segmentation method of surgical instruments based on improved YOLOv5s

被引:0
|
作者
Meng Xiao-liang [1 ]
Zhao Ji-kang [1 ]
Wang Xiao-yu [1 ]
Zhang Li-ye [1 ]
Song Zheng [1 ]
机构
[1] Shandong Univ Technol, Sch Comp Sci & Technol, Zibo 255000, Peoples R China
基金
中国国家自然科学基金;
关键词
surgical instruments; target detection; semantic segmentation; attention mechanism; SEMANTIC SEGMENTATION; NEURAL-NETWORK; LIGHTWEIGHT;
D O I
10.37188/CJLCD.2023-0025
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
In the process of endoscopic surgery,surgeons need to know the position information of surgical instruments in real time. The existing target detection algorithms are affected by factors such as reflection and shadow,and there is still optimization space for the accuracy and missed detection rate. This paper proposes a detection and segmentation method of surgical instruments based on improved YOLOv5s. Firstly,the brightness and contrast of images are corrected by Gamma correction algorithm to solve the problems of reflection and shadow occlusion of surgical instruments. Secondly, convolutional block attention module(CBAM) and dynamic convolution module are designed to increase the weight of important feature information,which further improves the accuracy of target detection and reduces the missed detection rate of the model. At the same time,the spatial pyramid pooling module is optimized to expand the receptive field, so as to better identify multi-scale targets. Finally, the feature pyramid networks(FPN)semantic segmentation head is designed to realize the semantic segmentation. Experimental results on endoscopic surgery dataset show that the mAP@ 0. 5 of target detection in this paper is 98. 2%, and the mIoU of semantic segmentation is 94. 0%. The proposed method can assist surgeons to quickly grasp the position and type of surgical instruments,and improve the efficiency of surgery.
引用
收藏
页码:1698 / 1706
页数:9
相关论文
共 23 条
  • [1] Sigmoid-weighted linear units for neural network function approximation in reinforcement learning
    Elfwing, Stefan
    Uchibe, Eiji
    Doya, Kenji
    [J]. NEURAL NETWORKS, 2018, 107 : 3 - 11
  • [2] Elhassan MAM, 2023, Arxiv, DOI arXiv:2204.01278
  • [3] Lightweight and high-precision object detection algorithm based on YOLO framework
    Fan Xin-chuan
    Chen Chun-mei
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (07) : 945 - 954
  • [4] The Future of Endoscopic Navigation: A Review of Advanced Endoscopic Vision Technology
    Fu, Zuoming
    Jin, Ziyi
    Zhang, Chongan
    He, Zhongyu
    Zha, Zhenzhou
    Hu, Chunyong
    Gan, Tianyuan
    Yan, Qinglai
    Wang, Peng
    Ye, Xuesong
    [J]. IEEE ACCESS, 2021, 9 : 41144 - 41167
  • [5] DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation
    Jha, Debesh
    Riegler, Michael A.
    Johansen, Dag
    Halvorsen, Pal
    Johansen, Havard D.
    [J]. 2020 IEEE 33RD INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS(CBMS 2020), 2020, : 558 - 564
  • [6] Tool Detection and Operative Skill Assessment in Surgical Videos Using Region-Based Convolutional Neural Networks
    Jin, Amy
    Yeung, Serena
    Jopling, Jeffrey
    Krause, Jonathan
    Azagury, Dan
    Milstein, Arnold
    Li Fei-Fei
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 691 - 699
  • [7] Li Chunyuan, 2021, PROC INT C LEARN REP
  • [8] An Anchor-Free Convolutional Neural Network for Real-Time Surgical Tool Detection in Robot-Assisted Surgery
    Liu, Yuying
    Zhao, Zijian
    Chang, Faliang
    Hu, Sanyuan
    [J]. IEEE ACCESS, 2020, 8 : 78193 - 78201
  • [9] DSRD-Net: Dual-stream residual dense network for semantic segmentation of instruments in robot-assisted surgery
    Mahmood, Tahir
    Cho, Se Woon
    Park, Kang Ryoung
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 202
  • [10] SurgiNet: Pyramid Attention Aggregation and Class-wise Self-Distillation for Surgical Instrument Segmentation
    Ni, Zhen-Liang
    Zhou, Xiao-Hu
    Wang, Guan-An
    Yue, Wen-Qian
    Li, Zhen
    Bian, Gui-Bin
    Hou, Zeng-Guang
    [J]. MEDICAL IMAGE ANALYSIS, 2022, 76