Exploring Segment-Level Semantics for Online Phase Recognition From Surgical Videos

被引:22
|
作者
Ding, Xinpeng [1 ]
Li, Xiaomeng [1 ,2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[2] Hong Kong Univ Sci & Technol, Shenzhen Res Inst, Shenzhen 518057, Peoples R China
关键词
Surgery; Videos; Feature extraction; Semantics; Hidden Markov models; Task analysis; Convolution; Surgical video analysis; surgical phase recognition; REAL-TIME SEGMENTATION; WORKFLOW RECOGNITION; TASKS;
D O I
10.1109/TMI.2022.3182995
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatic surgical phase recognition plays a vital role in robot-assisted surgeries. Existing methods ignored a pivotal problem that surgical phases should be classified by learning segment-level semantics instead of solely relying on frame-wise information. This paper presents a segment-attentive hierarchical consistency network (SAHC) for surgical phase recognition from videos. The key idea is to extract hierarchical high-level semantic-consistent segments and use them to refine the erroneous predictions caused by ambiguous frames. To achieve it, we design a temporal hierarchical network to generate hierarchical high-level segments. Then, we introduce a hierarchical segment-frame attention module to capture relations between the low-level frames and high-level segments. By regularizing the predictions of frames and their corresponding segments via a consistency loss, the network can generate semantic-consistent segments and then rectify the misclassified predictions caused by ambiguous low-level frames. We validate SAHC on two public surgical video datasets, i.e., the M2CAI16 challenge dataset and the Cholec80 dataset. Experimental results show that our method outperforms previous state-of-the-arts and ablation studies prove the effectiveness of our proposed modules. Our code has been released at: https://github.com/xmed-lab/SAHC.
引用
收藏
页码:3309 / 3319
页数:11
相关论文
共 50 条
  • [1] Segment-Level Sentiment Classification for Online Comments of Legal Cases
    Yang, Peng
    Wu, Yong
    Cheng, Tenglang
    Lyu, Xinyuan
    Wang, Zhuang
    2020 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2020, : 366 - 370
  • [2] Online signature verification using segment-level fuzzy modelling
    Ansari, Abdul Quaiyum
    Hanmandlu, Madasu
    Kour, Jaspreet
    Singh, Abhineet Kumar
    IET BIOMETRICS, 2014, 3 (03) : 113 - 127
  • [3] Action Duration Prediction for Segment-Level Alignment of Weakly-Labeled Videos
    Ghoddoosian, Reza
    Sayed, Saif
    Athitsos, Vassilis
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2052 - 2061
  • [4] Segment-Level Joint Topic-Sentiment Model for Online Review Analysis
    Yang, Qinjuan
    Rao, Yanghui
    Xie, Haoran
    Wang, Jiahai
    Wang, Fu Lee
    Chan, Wai Hong
    IEEE INTELLIGENT SYSTEMS, 2019, 34 (01) : 43 - 50
  • [5] RECOGNIZING MICRO ACTIONS IN VIDEOS: LEARNING MOTION DETAILS VIA SEGMENT-LEVEL TEMPORAL PYRAMID
    Mi, Yang
    Wang, Song
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1036 - 1041
  • [6] Task-Level vs. Segment-Level Quantitative Metrics for Surgical Skill Assessment
    Vedula, S. Swaroop
    Malpani, Anand
    Ahmidi, Narges
    Khudanpur, Sanjeev
    Hager, Gregory
    Chen, Chi Chiung Grace
    JOURNAL OF SURGICAL EDUCATION, 2016, 73 (03) : 482 - 489
  • [7] Music emotion recognition based on segment-level two-stage learning
    Na He
    Sam Ferguson
    International Journal of Multimedia Information Retrieval, 2022, 11 : 383 - 394
  • [8] Music emotion recognition based on segment-level two-stage learning
    He, Na
    Ferguson, Sam
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2022, 11 (03) : 383 - 394
  • [9] A study of crowdsourced segment-level surgical skill assessment using pairwise rankings
    Anand Malpani
    S. Swaroop Vedula
    Chi Chiung Grace Chen
    Gregory D. Hager
    International Journal of Computer Assisted Radiology and Surgery, 2015, 10 : 1435 - 1447
  • [10] A study of crowdsourced segment-level surgical skill assessment using pairwise rankings
    Malpani, Anand
    Vedula, S. Swaroop
    Chen, Chi Chiung Grace
    Hager, Gregory D.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2015, 10 (09) : 1435 - 1447