A spatio-temporal network for video semantic segmentation in surgical videos

被引：2

作者：

Grammatikopoulou, Maria ^{[1
]}

Sanchez-Matilla, Ricardo ^{[1
]}

Bragman, Felix ^{[1
]}

Owen, David ^{[1
]}

Culshaw, Lucy ^{[1
]}

Kerr, Karen ^{[1
]}

Stoyanov, Danail ^{[1
,2
]}

Luengo, Imanol ^{[1
]}

机构：

[1] Medtronic Plc, London, England

[2] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England

来源：

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY | 2023年 / 19卷 / 2期

关键词：

Video segmentation; Semantic segmentation;

D O I：

10.1007/s11548-023-02971-6

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

PurposeSemantic segmentation in surgical videos has applications in intra-operative guidance, post-operative analytics and surgical education. Models need to provide accurate predictions since temporally inconsistent identification of anatomy can hinder patient safety. We propose a novel architecture for modelling temporal relationships in videos to address these issues.MethodsWe developed a temporal segmentation model that includes a static encoder and a spatio-temporal decoder. The encoder processes individual frames whilst the decoder learns spatio-temporal relationships from frame sequences. The decoder can be used with any suitable encoder to improve temporal consistency.ResultsModel performance was evaluated on the CholecSeg8k dataset and a private dataset of robotic Partial Nephrectomy procedures. Mean Intersection over Union improved by 1.30% and 4.27% respectively for each dataset when the temporal decoder was applied. Our model also displayed improvements in temporal consistency up to 7.23%.ConclusionsThis work demonstrates an advance in video segmentation of surgical scenes with potential applications in surgery with a view to improve patient outcomes. The proposed decoder can extend state-of-the-art static models, and it is shown that it can improve per-frame segmentation output and video temporal consistency.

引用

页码：375 / 382

页数：8

共 50 条

[31] Learning Feature Semantic Matching for Spatio-Temporal Video Grounding
Zhang, Tong
Fang, Hao
Zhang, Hao
Gao, Jialin
Lu, Xiankai
Nie, Xiushan
Yin, Yilong
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9268 - 9279
[32] Proposals from binary tree and spatio-temporal tunnel for temporal segmentation of rough videos
ZHANG Yunzuo
GUO Kaina
[J]. Optoelectronics Letters, 2022, 18 (12) : 763 - 768
[33] Proposals from binary tree and spatio-temporal tunnel for temporal segmentation of rough videos
Zhang, Yunzuo
Guo, Kaina
[J]. OPTOELECTRONICS LETTERS, 2022, 18 (12) : 763 - 768
[34] Proposals from binary tree and spatio-temporal tunnel for temporal segmentation of rough videos
Yunzuo Zhang
Kaina Guo
[J]. Optoelectronics Letters, 2022, 18 : 763 - 768
[35] Spatio-temporal Human Body Segmentation from Video Stream
Al Harbi, Nouf
Gotoh, Yoshihiko
[J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 78 - 85
[36] Interactive object extraction using spatio-temporal video segmentation
[J]. Okubo, Hidehiko, 1600, Inst. of Image Information and Television Engineers (68):
[37] Spatio-Temporal Video Segmentation With Shape Growth or Shrinkage Constraint
Tarabalka, Yuliya
Charpiat, Guillaume
Brucker, Ludovic
Menze, Bjoern H.
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3829 - 3840
[38] Deep Spatio-Temporal Random Fields for Efficient Video Segmentation
Chandra, Siddhartha
Couprie, Camille
Kokkinos, Iasonas
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8915 - 8924
[39] Spatio-Temporal Video Segmentation of Static Scenes and Its Applications
Jiang, Hanqing
Zhang, Guofeng
Wang, Huiyan
Bao, Hujun
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (01) : 3 - 15
[40] Spatio-temporal video segmentation using a joint similarity measure
Choi, JG
Lee, SW
Kim, SD
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (02) : 279 - 286

← 1 2 3 4 5 →