Object and anatomical feature recognition in surgical video images based on a convolutional neural network

被引:13
|
作者
Bamba, Yoshiko [1 ]
Ogawa, Shimpei [1 ]
Itabashi, Michio [1 ]
Shindo, Hironari [2 ]
Kameoka, Shingo [3 ]
Okamoto, Takahiro [4 ]
Yamamoto, Masakazu [1 ]
机构
[1] Tokyo Womens Med Univ, Inst Gastroenterol, Dept Surg, Shinjuku Ku, 8-1 Kawadacho, Tokyo 1628666, Japan
[2] Otsuki Municipal Cent Hosp, Yamanashi, Japan
[3] Ushiku Aiwa Hosp, Ibaraki, Japan
[4] Tokyo Womens Med Univ, Dept Breast Endocrinol Surg, Tokyo, Japan
关键词
Image-guided navigation technology; Surgical education; Convolutional neural network; Computer vision; Object detection; GASTRIC-CANCER; SURGERY;
D O I
10.1007/s11548-021-02434-w
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Artificial intelligence-enabled techniques can process large amounts of surgical data and may be utilized for clinical decision support to recognize or forecast adverse events in an actual intraoperative scenario. To develop an image-guided navigation technology that will help in surgical education, we explored the performance of a convolutional neural network (CNN)-based computer vision system in detecting intraoperative objects. Methods The surgical videos used for annotation were recorded during surgeries conducted in the Department of Surgery of Tokyo Women's Medical University from 2019 to 2020. Abdominal endoscopic images were cut out from manually captured surgical videos. An open-source programming framework for CNN was used to design a model that could recognize and segment objects in real time through IBM Visual Insights. The model was used to detect the GI tract, blood, vessels, uterus, forceps, ports, gauze and clips in the surgical images. Results The accuracy, precision and recall of the model were 83%, 80% and 92%, respectively. The mean average precision (mAP), the calculated mean of the precision for each object, was 91%. Among surgical tools, the highest recall and precision of 96.3% and 97.9%, respectively, were achieved for forceps. Among the anatomical structures, the highest recall and precision of 92.9% and 91.3%, respectively, were achieved for the GI tract. Conclusion The proposed model could detect objects in operative images with high accuracy, highlighting the possibility of using AI-based object recognition techniques for intraoperative navigation. Real-time object recognition will play a major role in navigation surgery and surgical education.
引用
收藏
页码:2045 / 2054
页数:10
相关论文
共 50 条
  • [21] Hand Frame Extraction in Surgical Video Images Using Convolutional Neural Network
    Sakib, Shadman
    Hossain, Belayat
    Hiranaka, Takafumi
    Kobashi, Syoji
    2020 JOINT 11TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS AND 21ST INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (SCIS-ISIS), 2020, : 350 - 354
  • [22] Recurrent Convolutional Neural Network for Object Recognition
    Liang, Ming
    Hu, Xiaolin
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 3367 - 3375
  • [23] Convolutional Neural Network Based Automatic Object Detection on Aerial Images
    Sevo, Igor
    Avramovic, Aleksej
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (05) : 740 - 744
  • [24] Remote Sensing Image Object Recognition Based on Convolutional Neural Network
    Zhen, Yumei
    Liu, Huanyu
    Li, Junbao
    Hu, Cong
    Pan, Jeng-Shyang
    PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017), 2017, : 814 - 817
  • [25] Face Recognition Based On Gabor Local Feature and Convolutional Neural Network
    Qin, Weimeng
    Wang, Lie
    Luo, Wen
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING, INFORMATION SCIENCE & APPLICATION TECHNOLOGY (ICCIA 2017), 2017, 74 : 571 - 576
  • [26] Convolutional Neural Network Implementation for Eye Movement Recognition based on Video
    Cheng, Bing
    Zhang, Chao
    Ding, Xiaojuan
    Wu, Xiaopei
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 179 - 184
  • [27] Video Vehicle Detection and Recognition Based on MapReduce and Convolutional Neural Network
    Chen, Mingsong
    Wang, Weiguang
    Dong, Shi
    Zhou, Xinling
    ADVANCES IN SWARM INTELLIGENCE, ICSI 2018, PT II, 2018, 10942 : 552 - 562
  • [28] Video-based face recognition based on deep convolutional neural network
    Zhai, Yilong
    He, Dongzhi
    PROCEEDINGS OF 2019 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO AND SIGNAL PROCESSING (IVSP 2019), 2019, : 23 - 27
  • [29] Inception recurrent convolutional neural network for object recognition
    Alom, Md Zahangir
    Hasan, Mahmudul
    Yakopcic, Chris
    Taha, Tarek M.
    Asari, Vijayan K.
    MACHINE VISION AND APPLICATIONS, 2021, 32 (01)
  • [30] Inception recurrent convolutional neural network for object recognition
    Md Zahangir Alom
    Mahmudul Hasan
    Chris Yakopcic
    Tarek M. Taha
    Vijayan K. Asari
    Machine Vision and Applications, 2021, 32