LRTD: long-range temporal dependency based active learning for surgical workflow recognition

被引:19
|
作者
Shi, Xueying [1 ]
Jin, Yueming [1 ]
Dou, Qi [1 ]
Heng, Pheng-Ann [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Surgical workflow recognition; Active learning; Long-range temporal dependency; Intra-clip dependency; SEGMENTATION; TASKS;
D O I
10.1007/s11548-020-02198-9
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Automatic surgical workflow recognition in video is an essentially fundamental yet challenging problem for developing computer-assisted and robotic-assisted surgery. Existing approaches with deep learning have achieved remarkable performance on analysis of surgical videos, however, heavily relying on large-scale labelled datasets. Unfortunately, the annotation is not often available in abundance, because it requires the domain knowledge of surgeons. Even for experts, it is very tedious and time-consuming to do a sufficient amount of annotations. Methods In this paper, we propose a novel active learning method for cost-effective surgical video analysis. Specifically, we propose a non-local recurrent convolutional network, which introduces non-local block to capture the long-range temporal dependency (LRTD) among continuous frames. We then formulate an intra-clip dependency score to represent the overall dependency within this clip. By ranking scores among clips in unlabelled data pool, we select the clips with weak dependencies to annotate, which indicates the most informative ones to better benefit network training. Results We validate our approach on a large surgical video dataset (Cholec80) by performing surgical workflow recognition task. By using our LRTD based selection strategy, we can outperform other state-of-the-art active learning methods who only consider neighbor-frame information. Using only up to 50% of samples, our approach can exceed the performance of full-data training. Conclusion By modeling the intra-clip dependency, our LRTD based strategy shows stronger capability to select informative video clips for annotation compared with other active learning methods, through the evaluation on a popular public surgical dataset. The results also show the promising potential of our framework for reducing annotation workload in the clinical practice.
引用
收藏
页码:1573 / 1584
页数:12
相关论文
共 50 条
  • [41] Long-range particles from thorium active deposit
    Wood, AB
    PHILOSOPHICAL MAGAZINE, 1921, 41 (244): : 575 - 584
  • [42] Alzheimer's disease diagnosis based on long-range dependency mechanism using convolutional neural network
    Pei, Zhao
    Gou, Yuanshuai
    Ma, Miao
    Guo, Min
    Leng, Chengcai
    Chen, Yuli
    Li, Jun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 36053 - 36068
  • [43] Ontology-based surgical workflow recognition and prediction
    Neumann, Juliane
    Uciteli, Alexandr
    Meschke, Tim
    Bieck, Richard
    Franke, Stefan
    Herre, Heinrich
    Neumuth, Thomas
    JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 136
  • [44] Alzheimer’s disease diagnosis based on long-range dependency mechanism using convolutional neural network
    Zhao Pei
    Yuanshuai Gou
    Miao Ma
    Min Guo
    Chengcai Leng
    Yuli Chen
    Jun Li
    Multimedia Tools and Applications, 2022, 81 : 36053 - 36068
  • [45] Long-Range Hand Gesture Recognition via Attention-based SSD Network
    Zhou, Liguang
    Du, Chenping
    Sun, Zhenglong
    Lam, Tin Lun
    Xu, Yangsheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1832 - 1838
  • [46] Vision-based arm gesture recognition for a long-range human–robot interaction
    DoHyung Kim
    Jaeyeon Lee
    Ho-Sub Yoon
    Jaehong Kim
    Joochan Sohn
    The Journal of Supercomputing, 2013, 65 : 336 - 352
  • [47] Fusion-Based Approach for Long-Range Night-Time Facial Recognition
    Martin, Robert B.
    Sluch, Mikhail
    Kafka, Kristopher M.
    Dolby, Andrew
    Ice, Robert
    Lemoff, Brian E.
    AUTOMATIC TARGET RECOGNITION XXIV, 2014, 9090
  • [48] Cepstrum-Based Road Surface Recognition Using Long-Range Automotive Radar
    Darapu, Sudeepini
    Devi, S. M. Renuka
    Katuri, Srinivasarao
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA ENGINEERING (ICCIDE 2018), 2019, 28 : 207 - 215
  • [49] Deep learning for software-based turbulence mitigation in long-range imaging
    Nieuwenhuizen, Robert
    Schutte, Klamer
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING IN DEFENSE APPLICATIONS, 2019, 11169
  • [50] A Discreet Wearable Long-Range Emergency System Based on Embedded Machine Learning
    Orfanidis, Charalampos
    Hassen, Rayen Bel Haj
    Kwiek, Armando
    Fafoutis, Xenofon
    Jacobsson, Martin
    2021 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS AND OTHER AFFILIATED EVENTS (PERCOM WORKSHOPS), 2021, : 182 - 187