Spatially Conditioned Graphs for Detecting Human-Object Interactions

被引:44
|
作者
Zhang, Frederic Z. [1 ,3 ]
Campbell, Dylan [2 ,3 ]
Gould, Stephen [1 ,3 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
[2] Univ Oxford, Oxford, England
[3] Australian Ctr Robot Vis, Canberra, ACT, Australia
关键词
D O I
10.1109/ICCV48922.2021.01307
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of detecting human-object interactions in images using graphical neural networks. Unlike conventional methods, where nodes send scaled but otherwise identical messages to each of their neighbours, we propose to condition messages between pairs of nodes on their spatial relationships, resulting in different messages going to neighbours of the same node. To this end, we explore various ways of applying spatial conditioning under a multi-branch structure. Through extensive experimentation we demonstrate the advantages of spatial conditioning for the computation of the adjacency structure, messages and the refined graph features. In particular, we empirically show that as the quality of the bounding boxes increases, their coarse appearance features contribute relatively less to the disambiguation of interactions compared to the spatial information. Our method achieves an mAP of 31.33% on HICO-DET and 54.2% on V-COCO, significantly outperforming state-of-the-art on fine-tuned detections.
引用
收藏
页码:13299 / 13307
页数:9
相关论文
共 50 条
  • [31] Novel Anomalous Event Detection based on Human-object Interactions
    Colque, Rensso Mora
    Caetano, Carlos
    de Melo, Victor C.
    Chavez, Guillermo Camara
    Schwartz, William Robson
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 293 - 300
  • [32] Modeling 4D Human-Object Interactions for Event and Object Recognition
    Wei, Ping
    Zhao, Yibiao
    Zheng, Nanning
    Zhu, Song-Chun
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3272 - 3279
  • [33] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423
  • [34] NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions
    Jiang, Yuheng
    Jiang, Suyi
    Sun, Guoxing
    Su, Zhuo
    Guo, Kaiwen
    Wu, Minye
    Yu, Jingyi
    Xu, Lan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6145 - 6155
  • [35] Causality Inspired Retrieval of Human-object Interactions from Video
    Zhou, Liting
    Liu, Jianquan
    Nishimura, Shoji
    Antony, Joseph
    Gurrin, Cathal
    [J]. 2019 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2019,
  • [36] Recognizing Human-Object Interactions Using Sparse Subspace Clustering
    Bogun, Ivan
    Ribeiro, Eraldo
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 409 - 416
  • [37] Graph-based method for human-object interactions detection
    Xia, Li-min
    Wu, Wei
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2021, 28 (01) : 205 - 218
  • [38] Action Anticipation Using Pairwise Human-Object Interactions and Transformers
    Roy, Debaditya
    Fernando, Basura
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8116 - 8129
  • [39] Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
    Pi, Huaijin
    Peng, Sida
    Yang, Minghui
    Zhou, Xiaowei
    Bao, Hujun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15015 - 15027
  • [40] Visualizing Thermal Traces to Reveal Histories of Human-Object Interactions
    Amemiya, Tomohiro
    [J]. UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION, PT II, PROCEEDINGS: INTELLIGENT AND UBIQUITOUS INTERACTION ENVIRONMENTS, 2009, 5615 : 477 - 482