A spatio-temporal network for video semantic segmentation in surgical videos

被引:2
|
作者
Grammatikopoulou, Maria [1 ]
Sanchez-Matilla, Ricardo [1 ]
Bragman, Felix [1 ]
Owen, David [1 ]
Culshaw, Lucy [1 ]
Kerr, Karen [1 ]
Stoyanov, Danail [1 ,2 ]
Luengo, Imanol [1 ]
机构
[1] Medtronic Plc, London, England
[2] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England
关键词
Video segmentation; Semantic segmentation;
D O I
10.1007/s11548-023-02971-6
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
PurposeSemantic segmentation in surgical videos has applications in intra-operative guidance, post-operative analytics and surgical education. Models need to provide accurate predictions since temporally inconsistent identification of anatomy can hinder patient safety. We propose a novel architecture for modelling temporal relationships in videos to address these issues.MethodsWe developed a temporal segmentation model that includes a static encoder and a spatio-temporal decoder. The encoder processes individual frames whilst the decoder learns spatio-temporal relationships from frame sequences. The decoder can be used with any suitable encoder to improve temporal consistency.ResultsModel performance was evaluated on the CholecSeg8k dataset and a private dataset of robotic Partial Nephrectomy procedures. Mean Intersection over Union improved by 1.30% and 4.27% respectively for each dataset when the temporal decoder was applied. Our model also displayed improvements in temporal consistency up to 7.23%.ConclusionsThis work demonstrates an advance in video segmentation of surgical scenes with potential applications in surgery with a view to improve patient outcomes. The proposed decoder can extend state-of-the-art static models, and it is shown that it can improve per-frame segmentation output and video temporal consistency.
引用
收藏
页码:375 / 382
页数:8
相关论文
共 50 条
  • [31] Learning Feature Semantic Matching for Spatio-Temporal Video Grounding
    Zhang, Tong
    Fang, Hao
    Zhang, Hao
    Gao, Jialin
    Lu, Xiankai
    Nie, Xiushan
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9268 - 9279
  • [32] Proposals from binary tree and spatio-temporal tunnel for temporal segmentation of rough videos
    ZHANG Yunzuo
    GUO Kaina
    [J]. Optoelectronics Letters, 2022, 18 (12) : 763 - 768
  • [33] Proposals from binary tree and spatio-temporal tunnel for temporal segmentation of rough videos
    Zhang, Yunzuo
    Guo, Kaina
    [J]. OPTOELECTRONICS LETTERS, 2022, 18 (12) : 763 - 768
  • [34] Proposals from binary tree and spatio-temporal tunnel for temporal segmentation of rough videos
    Yunzuo Zhang
    Kaina Guo
    [J]. Optoelectronics Letters, 2022, 18 : 763 - 768
  • [35] Spatio-temporal Human Body Segmentation from Video Stream
    Al Harbi, Nouf
    Gotoh, Yoshihiko
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 78 - 85
  • [36] Interactive object extraction using spatio-temporal video segmentation
    [J]. Okubo, Hidehiko, 1600, Inst. of Image Information and Television Engineers (68):
  • [37] Spatio-Temporal Video Segmentation With Shape Growth or Shrinkage Constraint
    Tarabalka, Yuliya
    Charpiat, Guillaume
    Brucker, Ludovic
    Menze, Bjoern H.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3829 - 3840
  • [38] Deep Spatio-Temporal Random Fields for Efficient Video Segmentation
    Chandra, Siddhartha
    Couprie, Camille
    Kokkinos, Iasonas
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8915 - 8924
  • [39] Spatio-Temporal Video Segmentation of Static Scenes and Its Applications
    Jiang, Hanqing
    Zhang, Guofeng
    Wang, Huiyan
    Bao, Hujun
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (01) : 3 - 15
  • [40] Spatio-temporal video segmentation using a joint similarity measure
    Choi, JG
    Lee, SW
    Kim, SD
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (02) : 279 - 286