Dynamic Scene Graph Representation for Surgical Video

被引：4

作者：

Holm, Felix ^{[1
]}

Ghazaei, Ghazal ^{[2
]}

Czempiel, Tobias ^{[1
]}

Oezsoy, Ege ^{[1
]}

Saur, Stefan ^{[3
]}

Navab, Nassir ^{[1
]}

机构：

[1] Tech Univ Munich, Chair Comp Aided Med Procedures, Munich, Germany

[2] Carl Zeiss, Oberkochen, Germany

[3] Carl Zeiss Meditec AG, Jena, Germany

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW | 2023年

关键词：

D O I：

10.1109/ICCVW60793.2023.00015

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time. Despite containing crucial workflow information and being commonly recorded in many procedures, usage of surgical videos for automated surgical workflow understanding is still limited. In this work, we exploit scene graphs as a more holistic, semantically meaningful and human-readable way to represent surgical videos while encoding all anatomical structures, tools, and their interactions. To properly evaluate the impact of our solutions, we create a scene graph dataset from semantic segmentations from the CaDIS and CATARACTS datasets. We demonstrate that scene graphs can be leveraged through the use of graph convolutional networks (GCNs) to tackle surgical downstream tasks such as surgical workflow recognition with competitive performance. Moreover, we demonstrate the benefits of surgical scene graphs regarding the explainability and robustness of model decisions, which are crucial in the clinical setting.

引用

页码：81 / 87

页数：7

共 50 条

[31] Target Adaptive Context Aggregation for Video Scene Graph Generation
Teng, Yao
Wang, Limin
Li, Zhifeng
Wu, Gangshan
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13668 - 13677
[32] Video scene detection using graph-based representations
Sakarya, Ufuk
Telatar, Ziya
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (10) : 774 - 783
[33] Video Scene Graph Generation with Spatial-Temporal Knowledge
Pu, Tao
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
[34] Seamless Video Scene Transition Using Hierarchical Graph Cuts
Hirai, Tatsunori
SIGGRAPH ASIA 2017 POSTERS (SA'17), 2017,
[35] Robust Graph-Cut Scene Segmentation and Reconstruction for Free-Viewpoint Video of Complex Dynamic Scenes
Guillemaut, Jean-Yves
Kilner, Joe
Hilton, Adrian
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 809 - 816
[36] Motion-Based Video Representation for Scene Change Detection
Chong-Wah Ngo
Ting-Chuen Pong
Hong-Jiang Zhang
International Journal of Computer Vision, 2002, 50 : 127 - 142
[37] Video scene segmentation and semantic representation using a novel scheme
Songhao Zhu
Yuncai Liu
Multimedia Tools and Applications, 2009, 42 : 183 - 205
[38] Video scene segmentation and semantic representation using a novel scheme
Zhu, Songhao
Liu, Yuncai
MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 42 (02) : 183 - 205
[39] Object-Centric Representation Learning for Video Scene Understanding
Zhou, Yi
Zhang, Hui
Park, Seung-In
Yoo, ByungIn
Qi, Xiaojuan
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8410 - 8423
[40] Motion-based video representation for scene change detection
Ngo, CW
Pong, TC
Zhang, HJ
Chin, RT
15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 827 - 830

← 1 2 3 4 5 →