Learning Human-Object Interaction Detection using Interaction Points

被引:148
|
作者
Wang, Tiancai [1 ]
Yang, Tong [1 ]
Danelljan, Martin [2 ]
Khan, Fahad Shahbaz [3 ,4 ]
Zhang, Xiangyu [1 ]
Sun, Jian [1 ]
机构
[1] MEGVII Technol, Beijing, Peoples R China
[2] Swiss Fed Inst Technol, Zurich, Switzerland
[3] IIAI, Abu Dhabi, U Arab Emirates
[4] Linkoping Univ, Linkoping, Sweden
关键词
D O I
10.1109/CVPR42600.2020.00417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding interactions between humans and objects is one of the fundamental problems in visual classification and an essential step towards detailed scene understanding. Human-object interaction (HOI) detection strives to localize both the human and an object as well as the identification of complex interactions between them. Most existing HOI detection approaches are instance-centric where interactions between all possible human-object pairs are predicted based on appearance features and coarse spatial information. We argue that appearance features alone are insufficient to capture complex human-object interactions. In this paper, we therefore propose a novel fully-convolutional approach that directly detects the interactions between human-object pairs. Our network predicts interaction points, which directly localize and classify the interaction. Paired with the densely predicted interaction vectors, the interactions are associated with human and object detections to obtain final predictions. To the best of our knowledge, we are the first to propose an approach where HOI detection is posed as a keypoint detection and grouping problem. Experiments are performed on two popular benchmarks: V-COCO and HICO-DET. Our approach sets a new state-of-the-art on both datasets. Code is available at https: //github.com/vaes1/IP-Net.
引用
收藏
页码:4115 / 4124
页数:10
相关论文
共 50 条
  • [31] Improving Human-Object Interaction Detection via Phrase Learning and Label Composition
    Li, Zhimin
    Zou, Cheng
    Zhao, Yu
    Li, Boxun
    Zhong, Sheng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1509 - 1517
  • [32] Human-Object Interaction Detection with Ratio-Transformer
    Wang, Tianlang
    Lu, Tao
    Fang, Wenhua
    Zhang, Yanduo
    SYMMETRY-BASEL, 2022, 14 (08):
  • [33] Semantic Inference Network for Human-Object Interaction Detection
    Liu, Hongyi
    Mo, Lisha
    Ma, Huimin
    IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 : 518 - 529
  • [34] Geometric Features Enhanced Human-Object Interaction Detection
    Zhu, Manli
    Ho, Edmond S. L.
    Chen, Shuang
    Yang, Longzhi
    Shum, Hubert P. H.
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 1
  • [35] Transferable Interactiveness Knowledge for Human-Object Interaction Detection
    Li, Yong-Lu
    Liu, Xinpeng
    Wu, Xiaoqian
    Huang, Xijie
    Xu, Liang
    Lu, Cewu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3870 - 3882
  • [36] Exploiting Scene Graphs for Human-Object Interaction Detection
    He, Tao
    Gao, Lianli
    Song, Jingkuan
    Li, Yuan-Fang
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15964 - 15973
  • [37] Weakly-supervised Human-object Interaction Detection
    Sugimoto, Masaki
    Furuta, Ryosuke
    Taniguchi, Yukinobu
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 293 - 300
  • [38] Hierarchical Reasoning Network for Human-Object Interaction Detection
    Gao, Yiming
    Kuang, Zhanghui
    Li, Guanbin
    Zhang, Wayne
    Lin, Liang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8306 - 8317
  • [39] Highlighting Object Category Immunity for the Generalization of Human-Object Interaction Detection
    Liu, Xinpeng
    Li, Yong-Lu
    Lu, Cewu
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1819 - 1827
  • [40] ERNet: An Efficient and Reliable Human-Object Interaction Detection Network
    Lim, JunYi
    Baskaran, Vishnu Monn
    Lim, Joanne Mun-Yee
    Wong, KokSheik
    See, John
    Tistarelli, Massimo
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 964 - 979