LRTD: long-range temporal dependency based active learning for surgical workflow recognition

被引:19
|
作者
Shi, Xueying [1 ]
Jin, Yueming [1 ]
Dou, Qi [1 ]
Heng, Pheng-Ann [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Surgical workflow recognition; Active learning; Long-range temporal dependency; Intra-clip dependency; SEGMENTATION; TASKS;
D O I
10.1007/s11548-020-02198-9
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Automatic surgical workflow recognition in video is an essentially fundamental yet challenging problem for developing computer-assisted and robotic-assisted surgery. Existing approaches with deep learning have achieved remarkable performance on analysis of surgical videos, however, heavily relying on large-scale labelled datasets. Unfortunately, the annotation is not often available in abundance, because it requires the domain knowledge of surgeons. Even for experts, it is very tedious and time-consuming to do a sufficient amount of annotations. Methods In this paper, we propose a novel active learning method for cost-effective surgical video analysis. Specifically, we propose a non-local recurrent convolutional network, which introduces non-local block to capture the long-range temporal dependency (LRTD) among continuous frames. We then formulate an intra-clip dependency score to represent the overall dependency within this clip. By ranking scores among clips in unlabelled data pool, we select the clips with weak dependencies to annotate, which indicates the most informative ones to better benefit network training. Results We validate our approach on a large surgical video dataset (Cholec80) by performing surgical workflow recognition task. By using our LRTD based selection strategy, we can outperform other state-of-the-art active learning methods who only consider neighbor-frame information. Using only up to 50% of samples, our approach can exceed the performance of full-data training. Conclusion By modeling the intra-clip dependency, our LRTD based strategy shows stronger capability to select informative video clips for annotation compared with other active learning methods, through the evaluation on a popular public surgical dataset. The results also show the promising potential of our framework for reducing annotation workload in the clinical practice.
引用
收藏
页码:1573 / 1584
页数:12
相关论文
共 50 条
  • [21] Surgical workflow recognition with temporal convolution and transformer for action segmentation
    Bokai Zhang
    Bharti Goel
    Mohammad Hasan Sarhan
    Varun Kejriwal Goel
    Rami Abukhalil
    Bindu Kalesan
    Natalie Stottler
    Svetlana Petculescu
    International Journal of Computer Assisted Radiology and Surgery, 2023, 18 : 785 - 794
  • [22] Temporal Dependency Rule Learning Based Group Activity Recognition in Smart Spaces
    Bourbia, Amine Lotfi
    Son, Heesuk
    Shin, Byoungheon
    Kim, Taehun
    Lee, Dongman
    Hyun, Soon J.
    PROCEEDINGS 2016 IEEE 40TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS, VOL 1, 2016, : 658 - 663
  • [23] Temporal Network Embedding Enhanced With Long-Range Dynamics and Self-Supervised Learning
    Wang, Zhizheng
    Sun, Yuanyuan
    Yang, Zhihao
    Yang, Liang
    Lin, Hongfei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 12
  • [24] A knowledge-based recognition algorithm for long-range infrared bridge images
    Cao, ZG
    Sun, Q
    Zhang, TX
    AUTOMATIC TARGET RECOGNITION XI, 2001, 4379 : 168 - 175
  • [25] Theory of critical phenomena with long-range temporal interaction
    Zeng, Shaolong
    Zhong, Fan
    PHYSICA SCRIPTA, 2023, 98 (07)
  • [26] A recognition algorithm based on knowledge framework for long-range infrared bridge images
    Sun, Q.
    Cao, Z.
    Zhang, T.
    Huazhong Ligong Daxue Xuebao/Journal Huazhong (Central China) University of Science and Technology, 2001, 29 (04): : 1 - 3
  • [27] Spectral long-range interaction of temporal incoherent solitons
    Xu, Gang
    Garnier, Josselin
    Picozzi, Antonio
    OPTICS LETTERS, 2014, 39 (03) : 590 - 593
  • [28] Emergence in kinetic roughening with long-range temporal correlations
    Wang, Shuting
    Xia, Hui
    PHYSICAL REVIEW E, 2025, 111 (02)
  • [29] Long-range temporal correlations of ocean surface currents
    Ashkenazy, Yosef
    Gildor, Hezi
    JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 2009, 114
  • [30] Learning-based Long-range Axon Tracing in Dense Scenes
    Hernandez, Mark
    Brewster, Adam
    Thul, Larry
    Telfer, Brian A.
    Majumdar, Arjun
    Choi, Heejin
    Ku, Taeyun
    Chung, Kwanghun
    Brattain, Laura J.
    2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018), 2018, : 1578 - 1582