A spatio-temporal network for video semantic segmentation in surgical videos

被引:4
|
作者
Grammatikopoulou, Maria [1 ]
Sanchez-Matilla, Ricardo [1 ]
Bragman, Felix [1 ]
Owen, David [1 ]
Culshaw, Lucy [1 ]
Kerr, Karen [1 ]
Stoyanov, Danail [1 ,2 ]
Luengo, Imanol [1 ]
机构
[1] Medtronic Plc, London, England
[2] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England
关键词
Video segmentation; Semantic segmentation;
D O I
10.1007/s11548-023-02971-6
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
PurposeSemantic segmentation in surgical videos has applications in intra-operative guidance, post-operative analytics and surgical education. Models need to provide accurate predictions since temporally inconsistent identification of anatomy can hinder patient safety. We propose a novel architecture for modelling temporal relationships in videos to address these issues.MethodsWe developed a temporal segmentation model that includes a static encoder and a spatio-temporal decoder. The encoder processes individual frames whilst the decoder learns spatio-temporal relationships from frame sequences. The decoder can be used with any suitable encoder to improve temporal consistency.ResultsModel performance was evaluated on the CholecSeg8k dataset and a private dataset of robotic Partial Nephrectomy procedures. Mean Intersection over Union improved by 1.30% and 4.27% respectively for each dataset when the temporal decoder was applied. Our model also displayed improvements in temporal consistency up to 7.23%.ConclusionsThis work demonstrates an advance in video segmentation of surgical scenes with potential applications in surgery with a view to improve patient outcomes. The proposed decoder can extend state-of-the-art static models, and it is shown that it can improve per-frame segmentation output and video temporal consistency.
引用
收藏
页码:375 / 382
页数:8
相关论文
共 50 条
  • [21] A Novel Spatio-Temporal Video Object Segmentation Algorithm
    Zhu, Shiping
    Xia, Xi
    Zhang, Qingrong
    Belloulata, Kamel
    2008 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-5, 2008, : 1916 - +
  • [22] Efficient probabilistic spatio-temporal video object segmentation
    Ahmed, Rakib
    Karmakar, Gour C.
    Dooley, Laurence S.
    6TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2007, : 807 - +
  • [23] Spatio-temporal segmentation using laserscanner and video sequences
    Kaempchen, N
    Zocholl, M
    Dietmayer, KCJ
    PATTERN RECOGNITION, 2004, 3175 : 367 - 374
  • [24] Morphological spatio-temporal simplification for video image segmentation
    Wang, DM
    Labit, C
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 1997, 11 (02) : 161 - 170
  • [25] A spatio-temporal video analysis system for object segmentation
    Xia, JH
    Wang, YL
    ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 812 - 815
  • [26] A Spatio-Temporal Encoding Neural Network for Semantic Segmentation of Satellite Image Time Series
    Zhang, Feifei
    Wang, Yong
    Du, Yawen
    Zhu, Yijia
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [27] STFCN: Spatio-Temporal Fully Convolutional Neural Network for Semantic Segmentation of Street Scenes
    Fayyaz, Mohsen
    Saffar, Mohammad Hajizadeh
    Sabokrou, Mohammad
    Fathy, Mahmood
    Huang, Fay
    Klette, Reinhard
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT I, 2017, 10116 : 493 - 509
  • [28] Spatio-Temporal Transformer Network for Video Restoration
    Kim, Tae Hyun
    Sajjadi, Mehdi S. M.
    Hirsch, Michael
    Schoelkopf, Bernhard
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 111 - 127
  • [29] Spatio-temporal segmentation
    Swain, C
    Puri, A
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1233 - 1236
  • [30] Spatio-Temporal Self-Attention Network for Fire Detection and Segmentation in Video Surveillance
    Shahid, Mohammad
    Virtusio, John Jethro
    Wu, Yu-Hsien
    Chen, Yung-Yao
    Tanveer, M.
    Muhammad, Khan
    Hua, Kai-Lung
    IEEE ACCESS, 2022, 10 : 1259 - 1275