Spatially Conditioned Graphs for Detecting Human-Object Interactions

被引:44
|
作者
Zhang, Frederic Z. [1 ,3 ]
Campbell, Dylan [2 ,3 ]
Gould, Stephen [1 ,3 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
[2] Univ Oxford, Oxford, England
[3] Australian Ctr Robot Vis, Canberra, ACT, Australia
关键词
D O I
10.1109/ICCV48922.2021.01307
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of detecting human-object interactions in images using graphical neural networks. Unlike conventional methods, where nodes send scaled but otherwise identical messages to each of their neighbours, we propose to condition messages between pairs of nodes on their spatial relationships, resulting in different messages going to neighbours of the same node. To this end, we explore various ways of applying spatial conditioning under a multi-branch structure. Through extensive experimentation we demonstrate the advantages of spatial conditioning for the computation of the adjacency structure, messages and the refined graph features. In particular, we empirically show that as the quality of the bounding boxes increases, their coarse appearance features contribute relatively less to the disambiguation of interactions compared to the spatial information. Our method achieves an mAP of 31.33% on HICO-DET and 54.2% on V-COCO, significantly outperforming state-of-the-art on fine-tuned detections.
引用
收藏
页码:13299 / 13307
页数:9
相关论文
共 50 条
  • [41] Spatial Audio for Human-Object Interactions in Small AR Workspaces
    Yang, Jing
    Soros, Gabor
    [J]. MOBISYS'18: PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS, APPLICATIONS, AND SERVICES, 2018, : 518 - 518
  • [42] A Method for Detecting Human-object Interaction based on Motion Distribution around Hand
    Tsukamoto, Tatsuhiro
    Abe, Toru
    Suganuma, Takuo
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 5: VISAPP, 2020, : 462 - 469
  • [43] Detecting human-object interaction with multi-level pairwise feature network
    Liu, Hanchao
    Mu, Tai-Jiang
    Huang, Xiaolei
    [J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) : 229 - 239
  • [44] Reasoning About Human-Object Interactions Through Dual Attention Networks
    Xiao, Tete
    Fan, Quanfu
    Gutfreund, Dan
    Monfort, Mathew
    Oliva, Aude
    Zhou, Bolei
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3918 - 3927
  • [45] GID-Net: Detecting human-object interaction with global and instance dependency
    Yang, Dongming
    Zou, YueXian
    Zhang, Jian
    Li, Ge
    [J]. NEUROCOMPUTING, 2021, 444 : 366 - 377
  • [46] Turbo Learning Framework for Human-Object Interactions Recognition and Human Pose Estimation
    Feng, Wei
    Liu, Wentao
    Li, Tong
    Peng, Jing
    Qian, Chen
    Hu, Xiaolin
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 898 - 905
  • [47] Hierarchical Video Prediction using Relational Layouts for Human-Object Interactions
    Bodla, Navaneeth
    Shrivastava, Gaurav
    Chellappa, Rama
    Shrivastava, Abhinav
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 12141 - 12150
  • [48] Spatio-Temporal Human-Object Interactions for Action Recognition in Videos
    Escorcia, Victor
    Carlos Niebles, Juan
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 508 - 514
  • [49] Observing Human-Object Interactions: Using Spatial and Functional Compatibility for Recognition
    Gupta, Abhinav
    Kembhavi, Aniruddha
    Davis, Larry S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (10) : 1775 - 1789
  • [50] Recognition and Prediction of Human-Object Interactions with a Self-Organizing Architecture
    Mici, Luiza
    Parisi, German, I
    Wermter, Stefan
    [J]. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,