Convolutional-de-convolutional neural networks for recognition of surgical workflow

被引:0
|
作者
Chen, Yu-Wen [1 ]
Zhang, Ju [1 ]
Wang, Peng [2 ]
Hu, Zheng-Yu [1 ]
Zhong, Kun-Hua [1 ]
机构
[1] Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, China
[2] Southwest Hospital, Third Military Medical University, Chongqing, China
关键词
Convolutional neural networks - Deep neural networks - Knowledge management - Learning systems - Semantics - Transfer learning;
D O I
暂无
中图分类号
学科分类号
摘要
Computer-assisted surgery (CAS) has occupied an important position in modern surgery, further stimulating the progress of methodology and technology. In recent years, a large number of computer vision-based methods have been widely used in surgical workflow recognition tasks. For training the models, a lot of annotated data are necessary. However, the annotation of surgical data requires expert knowledge and thus becomes difficult and time-consuming. In this paper, we focus on the problem of data deficiency and propose a knowledge transfer learning method based on artificial neural network to compensate a small amount of labeled training data. To solve this problem, we propose an unsupervised method for pre-training a Convolutional-De-Convolutional (CDC) neural network for sequencing surgical workflow frames, which performs neural convolution in space (for semantic abstraction) and neural de-convolution in time (for frame level resolution) simultaneously. Specifically, through neural convolution transfer learning, we only fine-tuned the CDC neural network to classify the surgical phase. We performed some experiments for validating the model, and it showed that the proposed model can effectively extract the surgical feature and determine the surgical phase. The accuracy (Acc), recall, precision (Pres) of our model reached 91.4, 78.9, and 82.5%, respectively. Copyright © 2022 Chen, Zhang, Wang, Hu and Zhong.
引用
收藏
相关论文
共 50 条
  • [31] Driving Posture Recognition by Convolutional Neural Networks
    Yan, Chao
    Zhang, Bailing
    Coenen, Frans
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 680 - 685
  • [32] Personality Recognition Using Convolutional Neural Networks
    Gimenez, Maite
    Paredes, Roberto
    Rosso, Paolo
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2017, PT II, 2018, 10762 : 313 - 323
  • [33] Facial Expression Recognition with Convolutional Neural Networks
    Singh, Shekhar
    Nasoz, Fatma
    2020 10TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2020, : 324 - 328
  • [34] Convolutional Neural Networks for the Recognition of Malayalam Characters
    Anil, R.
    Manjusha, K.
    Kumar, S. Sachin
    Soman, K. P.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF INTELLIGENT COMPUTING: THEORY AND APPLICATIONS (FICTA) 2014, VOL 2, 2015, 328 : 493 - 500
  • [35] Evaluation of convolutional neural networks for visual recognition
    Neubauer, C
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1998, 9 (04): : 685 - 696
  • [36] AN ANALYSIS OF CONVOLUTIONAL NEURAL NETWORKS FOR SPEECH RECOGNITION
    Huang, Jui-Ting
    Li, Jinyu
    Gong, Yifan
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4989 - 4993
  • [37] Convolutional Neural Networks for Traffic Sign Recognition
    Wei, Zhonghua
    Gu, Heng
    Zhang, Ran
    Peng, Jingxuan
    Qui, Shi
    CICTP 2021: ADVANCED TRANSPORTATION, ENHANCED CONNECTION, 2021, : 399 - 409
  • [38] CONVOLUTIONAL NEURAL NETWORKS FOR NOISE SIGNAL RECOGNITION
    Portsev, Ruslan J.
    Makarenko, Andrey V.
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [39] Ear Recognition In The Wild with Convolutional Neural Networks
    Ramos-Cooper, Solange
    Camara-Chavez, Guillermo
    2021 XLVII LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2021), 2021,
  • [40] Speech Recognition Based on Convolutional Neural Networks
    Du Guiming
    Wang Xia
    Wang Guangyan
    Zhang Yan
    Li Dan
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 708 - 711