Learning to Detect Human-Object Interactions

被引:272
|
作者
Chao, Yu-Wei [1 ]
Liu, Yunfan [1 ]
Liu, Xieyang [1 ]
Zeng, Huayi [1 ,2 ]
Deng, Jia [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Washington Univ, St Louis, MO 63130 USA
关键词
D O I
10.1109/WACV.2018.00048
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of detecting human-object interactions (HOI) in static images, defined as predicting a human and an object bounding box with an interaction class label that connects them. HOI detection is a fundamental problem in computer vision as it provides semantic information about the interactions among the detected objects. We introduce HICO-DET, a new large benchmark for HOI detection, by augmenting the current HICO classification benchmark with instance annotations. To solve the task, we propose Human-Object Region-based Convolutional Neural Networks (HO-RCNN). At the core of our HO-RCNN is the Interaction Pattern, a novel DNN input that characterizes the spatial relations between two bounding boxes. Experiments on HICO-DET demonstrate that our HO-RCNN, by exploiting human-object spatial relations through Interaction Patterns, significantly improves the performance of HOI detection over baseline approaches.
引用
收藏
页码:381 / 389
页数:9
相关论文
共 50 条
  • [1] Learning to Detect Human-Object Interactions with Knowledge
    Xu, Bingjie
    Wong, Yongkang
    Li, Junnan
    Zhao, Qi
    Kankanhalli, Mohan S.
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2019 - 2028
  • [2] Learning Human-Object Interactions by Attention Aggregation
    Gu, Dongzhou
    Cai, Shuang
    Ma, Shiwei
    [J]. SIXTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2021, 11913
  • [3] Language for Learning Complex Human-Object Interactions
    Patel, Mitesh
    Ek, Carl Henrik
    Kyriazis, Nikolaos
    Argyros, Antonis
    Miro, Jaime Valls
    Kragic, Danica
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2013, : 4997 - 5002
  • [4] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423
  • [5] Detecting and Recognizing Human-Object Interactions
    Gkioxari, Georgia
    Girshick, Ross
    Dollar, Piotr
    He, Kaiming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8359 - 8367
  • [6] Turbo Learning Framework for Human-Object Interactions Recognition and Human Pose Estimation
    Feng, Wei
    Liu, Wentao
    Li, Tong
    Peng, Jing
    Qian, Chen
    Hu, Xiaolin
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 898 - 905
  • [7] Audio Augmented Reality for Human-Object Interactions
    Yang, Jing
    Mattern, Friedemann
    [J]. UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 408 - 412
  • [8] Predicting Human-Object Interactions in Egocentric Videos
    Benavent-Lledo, Manuel
    Oprea, Sergiu
    Alejandro Castro-Vargas, John
    Mulero-Perez, David
    Garcia-Rodriguez, Jose
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [9] A self-organizing neural network architecture for learning human-object interactions
    Mici, Luiza
    Parisi, German I.
    Wermter, Stefan
    [J]. NEUROCOMPUTING, 2018, 307 : 14 - 24
  • [10] Explicit Modeling of Human-Object Interactions in Realistic Videos
    Prest, Alessandro
    Ferrari, Vittorio
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (04) : 835 - 848