Relational Context Learning for Human-Object Interaction Detection

被引:7
|
作者
Kim, Sanghyun [1 ]
Jung, Deunsol [1 ]
Cho, Minsu [1 ]
机构
[1] Pohang Univ Sci & Technol POSTECH, Pohang, South Korea
关键词
D O I
10.1109/CVPR52729.2023.00286
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent state-of-the-art methods for HOI detection typically build on transformer architectures with two decoder branches, one for human-object pair detection and the other for interaction classification. Such disentangled transformers, however, may suffer from insufficient context exchange between the branches and lead to a lack of context information for relational reasoning, which is critical in discovering HOI instances. In this work, we propose the multiplex relation network (MUREN) that performs rich context exchange between three decoder branches using unary, pairwise, and ternary relations of human, object, and interaction tokens. The proposed method learns comprehensive relational contexts for discovering HOI instances, achieving state-of-the-art performance on two standard benchmarks for HOI detection, HICO-DET and V-COCO.
引用
收藏
页码:2925 / 2934
页数:10
相关论文
共 50 条
  • [1] Lifelong Learning for Human-Object Interaction Detection
    Sun, Bo
    Lu, Sixu
    He, Jun
    Yu, Lejun
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 582 - 587
  • [2] Learning Human-Object Interaction Detection using Interaction Points
    Wang, Tiancai
    Yang, Tong
    Danelljan, Martin
    Khan, Fahad Shahbaz
    Zhang, Xiangyu
    Sun, Jian
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4115 - 4124
  • [3] Affordance Transfer Learning for Human-Object Interaction Detection
    Hou, Zhi
    Yu, Baosheng
    Qiao, Yu
    Peng, Xiaojiang
    Tao, Dacheng
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 495 - 504
  • [4] Learning Self- and Cross-Triplet Context Clues for Human-Object Interaction Detection
    Ren, Weihong
    Luo, Jinguo
    Jiang, Weibo
    Qu, Liangqiong
    Han, Zhi
    Tian, Jiandong
    Liu, Honghai
    [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34 (10) : 9760 - 9773
  • [5] Learning Human-Object Interaction Detection via Deformable Transformer
    Cai, Shuang
    Ma, Shiwei
    Gu, Dongzhou
    [J]. 2021 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2021, 12076
  • [6] A Survey of Human-Object Interaction Detection
    Gong X.
    Zhang Z.
    Liu L.
    Ma B.
    Wu K.
    [J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704
  • [7] Human-Object Interaction Detection: An Overview
    Wang, Jia
    Shuai, Hong-Han
    Li, Yung-Hui
    Cheng, Wen-Huang
    [J]. IEEE Consumer Electronics Magazine, 2024, 13 (06) : 56 - 72
  • [8] Compositional Learning in Transformer-Based Human-Object Interaction Detection
    Zhuang, Zikun
    Qian, Ruihao
    Xie, Chi
    Liang, Shuang
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1038 - 1043
  • [9] From detection to understanding: A survey on representation learning for human-object interaction
    Luo, Tianlun
    Guan, Steven
    Yang, Rui
    Smith, Jeremy
    [J]. NEUROCOMPUTING, 2023, 543
  • [10] Improving Human-Object Interaction Detection via Virtual Image Learning
    Fang, Shuman
    Liu, Shuai
    Li, Jie
    Jiang, Guannan
    Lin, Xianming
    Ji, Rongrong
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5455 - 5463