Dynamic Scene Graph Representation for Surgical Video

被引：4

作者：

Holm, Felix ^{[1
]}

Ghazaei, Ghazal ^{[2
]}

Czempiel, Tobias ^{[1
]}

Oezsoy, Ege ^{[1
]}

Saur, Stefan ^{[3
]}

Navab, Nassir ^{[1
]}

机构：

[1] Tech Univ Munich, Chair Comp Aided Med Procedures, Munich, Germany

[2] Carl Zeiss, Oberkochen, Germany

[3] Carl Zeiss Meditec AG, Jena, Germany

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW | 2023年

关键词：

D O I：

10.1109/ICCVW60793.2023.00015

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time. Despite containing crucial workflow information and being commonly recorded in many procedures, usage of surgical videos for automated surgical workflow understanding is still limited. In this work, we exploit scene graphs as a more holistic, semantically meaningful and human-readable way to represent surgical videos while encoding all anatomical structures, tools, and their interactions. To properly evaluate the impact of our solutions, we create a scene graph dataset from semantic segmentations from the CaDIS and CATARACTS datasets. We demonstrate that scene graphs can be leveraged through the use of graph convolutional networks (GCNs) to tackle surgical downstream tasks such as surgical workflow recognition with competitive performance. Moreover, we demonstrate the benefits of surgical scene graphs regarding the explainability and robustness of model decisions, which are crucial in the clinical setting.

引用

页码：81 / 87

页数：7

共 50 条

[41] Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion
Wang, Jinpeng
Gao, Yuting
Li, Ke
Hu, Jianguo
Jiang, Xinyang
Guo, Xiaowei
Ji, Rongrong
Sun, Xing
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10129 - 10137
[42] Motion-based video representation for scene change detection
Ngo, CW
Pong, TC
Zhang, HJ
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 50 (02) : 127 - 142
[43] Variational Graph Convolutional Networks for Dynamic Graph Representation Learning
Mir, Aabid A.
Zuhairi, Megat F.
Musa, Shahrulniza
Alanazi, Meshari H.
Namoun, Abdallah
IEEE ACCESS, 2024, 12 : 161697 - 161717
[44] A dynamic graph representation learning based on temporal graph transformer
Zhong, Ying
Huang, Chenze
ALEXANDRIA ENGINEERING JOURNAL, 2023, 63 : 359 - 369
[45] A dynamic graph representation learning based on temporal graph transformer
Zhong, Ying
Huang, Chenze
ALEXANDRIA ENGINEERING JOURNAL, 2023, 63 : 359 - 369
[46] Semantic Single Video Segmentation with Robust Graph Representation
Zhao, Handong
Fu, Yun
PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 2219 - 2225
[47] A conceptual graph approach for video data representation and retrieval
Fatemi, N
Mulhem, P
ADVANCES IN INTELLIGENT DATA ANALYSIS, PROCEEDINGS, 1999, 1642 : 525 - 536
[48] VStreamDRLS: Dynamic Graph Representation Learning with Self-Attention for Enterprise Distributed Video Streaming Solutions
Antaris, Stefanos
Rafailidis, Dimitrios
2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, : 486 - 493
[49] A Surgical Scene Replay System for Learning Gastroenterological Endoscopic Surgery Skill by Multiple Synchronized-Video and Gaze Representation
Matsuda A.
Okuzono T.
Nakamura H.
Kuzuoka H.
Rekimoto J.
1600, Association for Computing Machinery (05):
[50] HIGH DYNAMIC RANGE IMAGING FOR STEREOSCOPIC SCENE REPRESENTATION
Lin, Huei-Yung
Chang, Wei-Zhe
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4305 - 4308

← 1 2 3 4 5 →