LRTD: long-range temporal dependency based active learning for surgical workflow recognition

被引:19
|
作者
Shi, Xueying [1 ]
Jin, Yueming [1 ]
Dou, Qi [1 ]
Heng, Pheng-Ann [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Comp Sci & Engn, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Surgical workflow recognition; Active learning; Long-range temporal dependency; Intra-clip dependency; SEGMENTATION; TASKS;
D O I
10.1007/s11548-020-02198-9
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Automatic surgical workflow recognition in video is an essentially fundamental yet challenging problem for developing computer-assisted and robotic-assisted surgery. Existing approaches with deep learning have achieved remarkable performance on analysis of surgical videos, however, heavily relying on large-scale labelled datasets. Unfortunately, the annotation is not often available in abundance, because it requires the domain knowledge of surgeons. Even for experts, it is very tedious and time-consuming to do a sufficient amount of annotations. Methods In this paper, we propose a novel active learning method for cost-effective surgical video analysis. Specifically, we propose a non-local recurrent convolutional network, which introduces non-local block to capture the long-range temporal dependency (LRTD) among continuous frames. We then formulate an intra-clip dependency score to represent the overall dependency within this clip. By ranking scores among clips in unlabelled data pool, we select the clips with weak dependencies to annotate, which indicates the most informative ones to better benefit network training. Results We validate our approach on a large surgical video dataset (Cholec80) by performing surgical workflow recognition task. By using our LRTD based selection strategy, we can outperform other state-of-the-art active learning methods who only consider neighbor-frame information. Using only up to 50% of samples, our approach can exceed the performance of full-data training. Conclusion By modeling the intra-clip dependency, our LRTD based strategy shows stronger capability to select informative video clips for annotation compared with other active learning methods, through the evaluation on a popular public surgical dataset. The results also show the promising potential of our framework for reducing annotation workload in the clinical practice.
引用
收藏
页码:1573 / 1584
页数:12
相关论文
共 50 条
  • [31] MICROWAVE REMOTE RECOGNITION SYSTEM (LONG-RANGE TYPE)
    OHTA, T
    NAKANO, H
    HIGASHI, K
    CHIHARA, T
    IWANISHI, Y
    IKEUCHI, M
    SHARP TECHNICAL JOURNAL, 1989, (41): : 27 - 30
  • [32] Research on bridge recognition in long-range infrared images
    Zuo, Zhen
    Zhang, Tianxu
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 1998, 26 (11): : 6 - 9
  • [33] Experimental acquisition of long-range portraits of objects and their recognition
    Buryi, E
    Kosykh, AE
    QUANTUM ELECTRONICS, 1998, 28 (10) : 932 - 935
  • [34] Temporal Memory Relation Network for Workflow Recognition From Surgical Video
    Jin, Yueming
    Long, Yonghao
    Chen, Cheng
    Zhao, Zixu
    Dou, Qi
    Heng, Pheng-Ann
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (07) : 1911 - 1923
  • [35] Learning, Motor Skill, and Long-Range Correlations
    Nourrit-Lucas, Deborah
    Tossa, Adate Olivier
    Zelic, Gregory
    Delignieres, Didier
    JOURNAL OF MOTOR BEHAVIOR, 2015, 47 (03) : 182 - 189
  • [36] Long-range perceptual learning with line stimuli?
    Tzvetanov, T
    Niebergall, R
    PERCEPTION, 2005, 34 : 77 - 78
  • [37] Learning and generation of long-range correlated sequences
    Priel, A.
    Kanter, I.
    Physical Review E - Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 2000, 62 (2 A): : 1617 - 1621
  • [38] Learning and generation of long-range correlated sequences
    Priel, A
    Kanter, I
    PHYSICAL REVIEW E, 2000, 62 (02): : 1617 - 1621
  • [39] Long-range range-gated laser active imaging experiments
    Guo Hui-chao
    Sun Hua-yan
    Zhao Yun
    Wu Jian-hua
    INTERNATIONAL SYMPOSIUM ON PHOTOELECTRONIC DETECTION AND IMAGING 2011: LASER SENSING AND IMAGING AND BIOLOGICAL AND MEDICAL APPLICATIONS OF PHOTONICS SENSING AND IMAGING, 2011, 8192
  • [40] Long-range velocity correlations from active dopants
    Abbaspour, Leila
    Mandal, Rituparno
    Sollich, Peter
    Klumpp, Stefan
    COMMUNICATIONS PHYSICS, 2024, 7 (01):