Dynamic Scene Graph Representation for Surgical Video

被引:4
|
作者
Holm, Felix [1 ]
Ghazaei, Ghazal [2 ]
Czempiel, Tobias [1 ]
Oezsoy, Ege [1 ]
Saur, Stefan [3 ]
Navab, Nassir [1 ]
机构
[1] Tech Univ Munich, Chair Comp Aided Med Procedures, Munich, Germany
[2] Carl Zeiss, Oberkochen, Germany
[3] Carl Zeiss Meditec AG, Jena, Germany
关键词
D O I
10.1109/ICCVW60793.2023.00015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Surgical videos captured from microscopic or endoscopic imaging devices are rich but complex sources of information, depicting different tools and anatomical structures utilized during an extended amount of time. Despite containing crucial workflow information and being commonly recorded in many procedures, usage of surgical videos for automated surgical workflow understanding is still limited. In this work, we exploit scene graphs as a more holistic, semantically meaningful and human-readable way to represent surgical videos while encoding all anatomical structures, tools, and their interactions. To properly evaluate the impact of our solutions, we create a scene graph dataset from semantic segmentations from the CaDIS and CATARACTS datasets. We demonstrate that scene graphs can be leveraged through the use of graph convolutional networks (GCNs) to tackle surgical downstream tasks such as surgical workflow recognition with competitive performance. Moreover, we demonstrate the benefits of surgical scene graphs regarding the explainability and robustness of model decisions, which are crucial in the clinical setting.
引用
收藏
页码:81 / 87
页数:7
相关论文
共 50 条
  • [31] Target Adaptive Context Aggregation for Video Scene Graph Generation
    Teng, Yao
    Wang, Limin
    Li, Zhifeng
    Wu, Gangshan
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13668 - 13677
  • [32] Video scene detection using graph-based representations
    Sakarya, Ufuk
    Telatar, Ziya
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2010, 25 (10) : 774 - 783
  • [33] Video Scene Graph Generation with Spatial-Temporal Knowledge
    Pu, Tao
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9340 - 9344
  • [34] Seamless Video Scene Transition Using Hierarchical Graph Cuts
    Hirai, Tatsunori
    SIGGRAPH ASIA 2017 POSTERS (SA'17), 2017,
  • [35] Robust Graph-Cut Scene Segmentation and Reconstruction for Free-Viewpoint Video of Complex Dynamic Scenes
    Guillemaut, Jean-Yves
    Kilner, Joe
    Hilton, Adrian
    2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 809 - 816
  • [36] Motion-Based Video Representation for Scene Change Detection
    Chong-Wah Ngo
    Ting-Chuen Pong
    Hong-Jiang Zhang
    International Journal of Computer Vision, 2002, 50 : 127 - 142
  • [37] Video scene segmentation and semantic representation using a novel scheme
    Songhao Zhu
    Yuncai Liu
    Multimedia Tools and Applications, 2009, 42 : 183 - 205
  • [38] Video scene segmentation and semantic representation using a novel scheme
    Zhu, Songhao
    Liu, Yuncai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 42 (02) : 183 - 205
  • [39] Object-Centric Representation Learning for Video Scene Understanding
    Zhou, Yi
    Zhang, Hui
    Park, Seung-In
    Yoo, ByungIn
    Qi, Xiaojuan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 8410 - 8423
  • [40] Motion-based video representation for scene change detection
    Ngo, CW
    Pong, TC
    Zhang, HJ
    Chin, RT
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 1, PROCEEDINGS: COMPUTER VISION AND IMAGE ANALYSIS, 2000, : 827 - 830