Dynamic Scene Graph Representation for Surgical Video

被引:4
|
作者
Holm, Felix [1 ]
Ghazaei, Ghazal [2 ]
Czempiel, Tobias [1 ]
Oezsoy, Ege [1 ]
Saur, Stefan [3 ]
Navab, Nassir [1 ]
机构
[1] Tech Univ Munich, Chair Comp Aided Med Procedures, Munich, Germany
[2] Carl Zeiss, Oberkochen, Germany
[3] Carl Zeiss Meditec AG, Jena, Germany
关键词
D O I
10.1109/ICCVW60793.2023.00015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time. Despite containing crucial workflow information and being commonly recorded in many procedures, usage of surgical videos for automated surgical workflow understanding is still limited. In this work, we exploit scene graphs as a more holistic, semantically meaningful and human-readable way to represent surgical videos while encoding all anatomical structures, tools, and their interactions. To properly evaluate the impact of our solutions, we create a scene graph dataset from semantic segmentations from the CaDIS and CATARACTS datasets. We demonstrate that scene graphs can be leveraged through the use of graph convolutional networks (GCNs) to tackle surgical downstream tasks such as surgical workflow recognition with competitive performance. Moreover, we demonstrate the benefits of surgical scene graphs regarding the explainability and robustness of model decisions, which are crucial in the clinical setting.
引用
收藏
页码:81 / 87
页数:7
相关论文
共 50 条
  • [41] Enhancing Unsupervised Video Representation Learning by Decoupling the Scene and the Motion
    Wang, Jinpeng
    Gao, Yuting
    Li, Ke
    Hu, Jianguo
    Jiang, Xinyang
    Guo, Xiaowei
    Ji, Rongrong
    Sun, Xing
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 10129 - 10137
  • [42] Motion-based video representation for scene change detection
    Ngo, CW
    Pong, TC
    Zhang, HJ
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 50 (02) : 127 - 142
  • [43] Variational Graph Convolutional Networks for Dynamic Graph Representation Learning
    Mir, Aabid A.
    Zuhairi, Megat F.
    Musa, Shahrulniza
    Alanazi, Meshari H.
    Namoun, Abdallah
    IEEE ACCESS, 2024, 12 : 161697 - 161717
  • [44] A dynamic graph representation learning based on temporal graph transformer
    Zhong, Ying
    Huang, Chenze
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 63 : 359 - 369
  • [45] A dynamic graph representation learning based on temporal graph transformer
    Zhong, Ying
    Huang, Chenze
    ALEXANDRIA ENGINEERING JOURNAL, 2023, 63 : 359 - 369
  • [46] Semantic Single Video Segmentation with Robust Graph Representation
    Zhao, Handong
    Fu, Yun
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 2219 - 2225
  • [47] A conceptual graph approach for video data representation and retrieval
    Fatemi, N
    Mulhem, P
    ADVANCES IN INTELLIGENT DATA ANALYSIS, PROCEEDINGS, 1999, 1642 : 525 - 536
  • [48] VStreamDRLS: Dynamic Graph Representation Learning with Self-Attention for Enterprise Distributed Video Streaming Solutions
    Antaris, Stefanos
    Rafailidis, Dimitrios
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2020, : 486 - 493
  • [49] A Surgical Scene Replay System for Learning Gastroenterological Endoscopic Surgery Skill by Multiple Synchronized-Video and Gaze Representation
    Matsuda A.
    Okuzono T.
    Nakamura H.
    Kuzuoka H.
    Rekimoto J.
    1600, Association for Computing Machinery (05):
  • [50] HIGH DYNAMIC RANGE IMAGING FOR STEREOSCOPIC SCENE REPRESENTATION
    Lin, Huei-Yung
    Chang, Wei-Zhe
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4305 - 4308