Learning Human-Object Interaction Detection using Interaction Points

被引:148
|
作者
Wang, Tiancai [1 ]
Yang, Tong [1 ]
Danelljan, Martin [2 ]
Khan, Fahad Shahbaz [3 ,4 ]
Zhang, Xiangyu [1 ]
Sun, Jian [1 ]
机构
[1] MEGVII Technol, Beijing, Peoples R China
[2] Swiss Fed Inst Technol, Zurich, Switzerland
[3] IIAI, Abu Dhabi, U Arab Emirates
[4] Linkoping Univ, Linkoping, Sweden
关键词
D O I
10.1109/CVPR42600.2020.00417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding interactions between humans and objects is one of the fundamental problems in visual classification and an essential step towards detailed scene understanding. Human-object interaction (HOI) detection strives to localize both the human and an object as well as the identification of complex interactions between them. Most existing HOI detection approaches are instance-centric where interactions between all possible human-object pairs are predicted based on appearance features and coarse spatial information. We argue that appearance features alone are insufficient to capture complex human-object interactions. In this paper, we therefore propose a novel fully-convolutional approach that directly detects the interactions between human-object pairs. Our network predicts interaction points, which directly localize and classify the interaction. Paired with the densely predicted interaction vectors, the interactions are associated with human and object detections to obtain final predictions. To the best of our knowledge, we are the first to propose an approach where HOI detection is posed as a keypoint detection and grouping problem. Experiments are performed on two popular benchmarks: V-COCO and HICO-DET. Our approach sets a new state-of-the-art on both datasets. Code is available at https: //github.com/vaes1/IP-Net.
引用
收藏
页码:4115 / 4124
页数:10
相关论文
共 50 条
  • [41] Multi-stream Network for Human-object Interaction Detection
    Wang, Chang
    Sun, Jinyu
    Ma, Shiwei
    Lu, Yuqiu
    Liu, Wang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (08)
  • [42] Polysemy Deciphering Network for Robust Human-Object Interaction Detection
    Zhong, Xubin
    Ding, Changxing
    Qu, Xian
    Tao, Dacheng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1910 - 1929
  • [43] Detecting Human-Object Interaction via Fabricated Compositional Learning
    Hou, Zhi
    Yu, Baosheng
    Qiao, Yu
    Peng, Xiaojiang
    Tao, Dacheng
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14641 - 14650
  • [44] Human-object interaction detection via interactive visual-semantic graph learning
    Tongtong WU
    Fuqing DUAN
    Liang CHANG
    Ke LU
    Science China(Information Sciences), 2022, 65 (06) : 81 - 82
  • [45] Pose graph parsing network for human-object interaction detection
    Su, Zhan
    Wang, Yuting
    Xie, Qing
    Yu, Ruiyun
    NEUROCOMPUTING, 2022, 476 : 53 - 62
  • [46] Human-object interaction detection via interactive visual-semantic graph learning
    Wu, Tongtong
    Duan, Fuqing
    Chang, Liang
    Lu, Ke
    SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (06)
  • [47] ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection
    Liu, Ye
    Yuan, Junsong
    Chen, Chang Wen
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4235 - 4243
  • [48] Cascaded Human-Object Interaction Recognition
    Zhou, Tianfei
    Wang, Wenguan
    Qi, Siyuan
    Ling, Haibin
    Shen, Jianbing
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4262 - 4271
  • [49] Segmenting Key Clues to Induce Human-Object Interaction Detection
    Xue, Mingliang
    Wang, Siwei
    Fu, Bing
    Zhao, Zhengyang
    Liu, Tao
    Lai, Lingfeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT I, 2024, 14425 : 60 - 71
  • [50] Rethinking vision transformer through human-object interaction detection
    Cheng, Yamin
    Zhao, Zitian
    Wang, Zhi
    Duan, Hancong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 122