Data-centric multi-task surgical phase estimation with sparse scene segmentation

被引:7
|
作者
Sanchez-Matilla, Ricardo [1 ]
Robu, Maria [1 ]
Grammatikopoulou, Maria [1 ]
Luengo, Imanol [1 ]
Stoyanov, Danail [1 ,2 ]
机构
[1] Digital Surg, London, England
[2] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England
关键词
Surgical phases; Scene segmentation; Surgical data science; Multi-task;
D O I
10.1007/s11548-022-02616-0
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Purpose Surgical workflow estimation techniques aim to divide a surgical video into temporal segments based on predefined surgical actions or objectives, which can be of different granularity such as steps or phases. Potential applications range from real-time intra-operative feedback to automatic post-operative reports and analysis. A common approach in the literature for performing automatic surgical phase estimation is to decouple the problem into two stages: feature extraction from a single frame and temporal feature fusion. This approach is performed in two stages due to computational restrictions when processing large spatio-temporal sequences. Methods The majority of existing works focus on pushing the performance solely through temporal model development. Differently, we follow a data-centric approach and propose a training pipeline that enables models to maximise the usage of existing datasets, which are generally used in isolation. Specifically, we use dense phase annotations available in Cholec80, and sparse scene (i.e., instrument and anatomy) segmentation annotation available in CholecSeg8k in less than 5% of the overlapping frames. We propose a simple multi-task encoder that effectively fuses both streams, when available, based on their importance and jointly optimise them for performing accurate phase prediction. Results and conclusion We show that with a small fraction of scene segmentation annotations, a relatively simple model can obtain comparable results than previous state-of-the-art and more complex architectures when evaluated in similar settings. We hope that this data-centric approach can encourage new research directions where data, and how to use it, plays an important role along with model development.
引用
收藏
页码:953 / 960
页数:8
相关论文
共 50 条
  • [1] Data-centric multi-task surgical phase estimation with sparse scene segmentation
    Ricardo Sanchez-Matilla
    Maria Robu
    Maria Grammatikopoulou
    Imanol Luengo
    Danail Stoyanov
    [J]. International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 953 - 960
  • [2] Scene Text Segmentation via Multi-Task Cascade Transformer With Paired Data Synthesis
    Dang, Quang-Vinh
    Lee, Guee-Sang
    [J]. IEEE ACCESS, 2023, 11 : 67791 - 67805
  • [3] A Data-Centric Approach for Image Scene Localization
    Alfarrarjeh, Abdullah
    Kim, Seon Ho
    Rajan, Shivnesh
    Deshmukh, Akshay
    Shahabi, Cyrus
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 594 - 603
  • [4] Multi-Task Learning Framework for Motion Estimation and Dynamic Scene Deblurring
    Jung, Hyungjoo
    Kim, Youngjung
    Jang, Hyunsung
    Ha, Namkoo
    Sohn, Kwanghoon
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 8170 - 8183
  • [5] A LiDAR-Based Dynamic Driving Scene Multi-task Segmentation Network
    Wang, Hai
    Li, Jianguo
    Cai, Yingfeng
    Chen, Long
    [J]. Qiche Gongcheng/Automotive Engineering, 2024, 46 (09): : 1608 - 1616
  • [6] Multi-task Sparse Gaussian Processes with Improved Multi-task Sparsity Regularization
    Zhu, Jiang
    Sun, Shiliang
    [J]. PATTERN RECOGNITION (CCPR 2014), PT I, 2014, 483 : 54 - 62
  • [7] Data-Centric Task Scheduling for Battleship Island Monitoring
    Kuroki, Kotomi
    Kodera, Shiho
    Kurata, Narito
    Hamamoto, Takuji
    Saruwatari, Shunsuke
    [J]. SenSys'15: Proceedings of the 13th ACM Conference on Embedded Networked Sensor Systems, 2015, : 417 - 418
  • [8] Sparse Multi-Task Reinforcement Learning
    Calandriello, Daniele
    Lazaric, Alessandro
    Restelli, Marcello
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [9] Sparse multi-task reinforcement learning
    Calandriello, Daniele
    Lazaric, Alessandro
    Restelli, Marcello
    [J]. INTELLIGENZA ARTIFICIALE, 2015, 9 (01) : 5 - 20
  • [10] Global-Reasoned Multi-Task Learning Model for Surgical Scene Understanding
    Seenivasan, Lalithkumar
    Mitheran, Sai
    Islam, Mobarakol
    Ren, Hongliang
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 3858 - 3865