A spatio-temporal network for video semantic segmentation in surgical videos

被引：4

作者：

Grammatikopoulou, Maria ^{[1
]}

Sanchez-Matilla, Ricardo ^{[1
]}

Bragman, Felix ^{[1
]}

Owen, David ^{[1
]}

Culshaw, Lucy ^{[1
]}

Kerr, Karen ^{[1
]}

Stoyanov, Danail ^{[1
,2
]}

Luengo, Imanol ^{[1
]}

机构：

[1] Medtronic Plc, London, England

[2] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England

来源：

INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY | 2023年 / 19卷 / 2期

关键词：

Video segmentation; Semantic segmentation;

D O I：

10.1007/s11548-023-02971-6

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

PurposeSemantic segmentation in surgical videos has applications in intra-operative guidance, post-operative analytics and surgical education. Models need to provide accurate predictions since temporally inconsistent identification of anatomy can hinder patient safety. We propose a novel architecture for modelling temporal relationships in videos to address these issues.MethodsWe developed a temporal segmentation model that includes a static encoder and a spatio-temporal decoder. The encoder processes individual frames whilst the decoder learns spatio-temporal relationships from frame sequences. The decoder can be used with any suitable encoder to improve temporal consistency.ResultsModel performance was evaluated on the CholecSeg8k dataset and a private dataset of robotic Partial Nephrectomy procedures. Mean Intersection over Union improved by 1.30% and 4.27% respectively for each dataset when the temporal decoder was applied. Our model also displayed improvements in temporal consistency up to 7.23%.ConclusionsThis work demonstrates an advance in video segmentation of surgical scenes with potential applications in surgery with a view to improve patient outcomes. The proposed decoder can extend state-of-the-art static models, and it is shown that it can improve per-frame segmentation output and video temporal consistency.

引用

页码：375 / 382

页数：8

共 50 条

[21] A Novel Spatio-Temporal Video Object Segmentation Algorithm
Zhu, Shiping
Xia, Xi
Zhang, Qingrong
Belloulata, Kamel
2008 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY, VOLS 1-5, 2008, : 1916 - +
[22] Efficient probabilistic spatio-temporal video object segmentation
Ahmed, Rakib
Karmakar, Gour C.
Dooley, Laurence S.
6TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2007, : 807 - +
[23] Spatio-temporal segmentation using laserscanner and video sequences
Kaempchen, N
Zocholl, M
Dietmayer, KCJ
PATTERN RECOGNITION, 2004, 3175 : 367 - 374
[24] Morphological spatio-temporal simplification for video image segmentation
Wang, DM
Labit, C
SIGNAL PROCESSING-IMAGE COMMUNICATION, 1997, 11 (02) : 161 - 170
[25] A spatio-temporal video analysis system for object segmentation
Xia, JH
Wang, YL
ISPA 2003: PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, PTS 1 AND 2, 2003, : 812 - 815
[26] A Spatio-Temporal Encoding Neural Network for Semantic Segmentation of Satellite Image Time Series
Zhang, Feifei
Wang, Yong
Du, Yawen
Zhu, Yijia
APPLIED SCIENCES-BASEL, 2023, 13 (23):
[27] STFCN: Spatio-Temporal Fully Convolutional Neural Network for Semantic Segmentation of Street Scenes
Fayyaz, Mohsen
Saffar, Mohammad Hajizadeh
Sabokrou, Mohammad
Fathy, Mahmood
Huang, Fay
Klette, Reinhard
COMPUTER VISION - ACCV 2016 WORKSHOPS, PT I, 2017, 10116 : 493 - 509
[28] Spatio-Temporal Transformer Network for Video Restoration
Kim, Tae Hyun
Sajjadi, Mehdi S. M.
Hirsch, Michael
Schoelkopf, Bernhard
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 111 - 127
[29] Spatio-temporal segmentation
Swain, C
Puri, A
VISUAL COMMUNICATIONS AND IMAGE PROCESSING '99, PARTS 1-2, 1998, 3653 : 1233 - 1236
[30] Spatio-Temporal Self-Attention Network for Fire Detection and Segmentation in Video Surveillance
Shahid, Mohammad
Virtusio, John Jethro
Wu, Yu-Hsien
Chen, Yung-Yao
Tanveer, M.
Muhammad, Khan
Hua, Kai-Lung
IEEE ACCESS, 2022, 10 : 1259 - 1275

← 1 2 3 4 5 →