Spatially Conditioned Graphs for Detecting Human-Object Interactions

被引:44
|
作者
Zhang, Frederic Z. [1 ,3 ]
Campbell, Dylan [2 ,3 ]
Gould, Stephen [1 ,3 ]
机构
[1] Australian Natl Univ, Canberra, ACT, Australia
[2] Univ Oxford, Oxford, England
[3] Australian Ctr Robot Vis, Canberra, ACT, Australia
关键词
D O I
10.1109/ICCV48922.2021.01307
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of detecting human-object interactions in images using graphical neural networks. Unlike conventional methods, where nodes send scaled but otherwise identical messages to each of their neighbours, we propose to condition messages between pairs of nodes on their spatial relationships, resulting in different messages going to neighbours of the same node. To this end, we explore various ways of applying spatial conditioning under a multi-branch structure. Through extensive experimentation we demonstrate the advantages of spatial conditioning for the computation of the adjacency structure, messages and the refined graph features. In particular, we empirically show that as the quality of the bounding boxes increases, their coarse appearance features contribute relatively less to the disambiguation of interactions compared to the spatial information. Our method achieves an mAP of 31.33% on HICO-DET and 54.2% on V-COCO, significantly outperforming state-of-the-art on fine-tuned detections.
引用
收藏
页码:13299 / 13307
页数:9
相关论文
共 50 条
  • [1] Spatially Conditioned Graphs for Detecting Human-Object Interactions
    Zhang, Frederic Z.
    Campbell, Dylan
    Gould, Stephen
    [J]. Proceedings of the IEEE International Conference on Computer Vision, 2021, : 13299 - 13307
  • [2] Detecting and Recognizing Human-Object Interactions
    Gkioxari, Georgia
    Girshick, Ross
    Dollar, Piotr
    He, Kaiming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8359 - 8367
  • [3] Detecting Subtle Human-Object Interactions Using Kinect
    Ubalde, Sebastian
    Liu, Zicheng
    Mejail, Marta
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 770 - 777
  • [4] Detecting Human-Object Interactions via Functional Generalization
    Bansal, Ankan
    Rambhatla, Sai Saketh
    Shrivastava, Abhinav
    Chellappa, Rama
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10460 - 10469
  • [5] Exploring Predicate Visual Context in Detecting of Human-Object Interactions
    Zhang, Frederic Z.
    Yuan, Yuhui
    Campbell, Dylan
    Zhong, Zhuoyao
    Gould, Stephen
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10377 - 10387
  • [6] Detecting Human-Object Contact in Images
    Chen, Yixin
    Dwivedi, Sai Kumar
    Black, Michael J.
    Tzionas, Dimitrios
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17100 - 17110
  • [7] Detecting human-object interactions in videos by modeling the trajectory of objects and human skeleton
    Li, Qiyue
    Xie, Xuemei
    Zhang, Chen
    Zhang, Jin
    Shi, Guangming
    [J]. NEUROCOMPUTING, 2022, 509 : 234 - 243
  • [8] Detecting Human-Object Relationships in Videos
    Ji, Jingwei
    Desai, Rishi
    Niebles, Juan Carlos
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8086 - 8096
  • [9] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics
    Yuan, Hangjie
    Wang, Mang
    Ni, Dong
    Xu, Liangpeng
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 3206 - 3214
  • [10] Reconstructing Action-Conditioned Human-Object Interactions Using Commonsense Knowledge Priors
    Wang, Xi
    Li, Gen
    Kuo, Yen-Ling
    Kocabas, Muhammed
    Aksan, Emre
    Hilliges, Otmar
    [J]. 2022 INTERNATIONAL CONFERENCE ON 3D VISION, 3DV, 2022, : 353 - 362