Learning Self- and Cross-Triplet Context Clues for Human-Object Interaction Detection

被引:0
|
作者
Ren, Weihong [1 ,2 ]
Luo, Jinguo [1 ]
Jiang, Weibo [1 ]
Qu, Liangqiong [3 ]
Han, Zhi [2 ]
Tian, Jiandong [2 ]
Liu, Honghai [1 ]
机构
[1] Harbin Institute of Technology, State Key Laboratory of Robotics and Systems, School of Mechanical Engineering and Automation, Shenzhen,518055, China
[2] Shenyang Institute of Automation, Chinese Academy of Sciences, State Key Laboratory of Robotics, Shenyang,110016, China
[3] The University of Hong Kong, Department of Statistics and Actuarial Science, Hong Kong, Hong Kong
关键词
D O I
10.1109/TCSVT.2024.3402247
中图分类号
学科分类号
摘要
Human-Object Interaction (HOI) detection aims to infer interactions between humans and objects, and it is very important for scene analysis and understanding. The existing methods usually focus on exploring instance-level (e.g., object appearance) or interaction-level (e.g., action semantic) features to conduct interaction prediction. However, most of these methods only consider the self-triplet feature aggregation, which may lead to learning ambiguity without exploring the cross-triplet context exchange. In this paper, from both visual and textual perspectives, we propose a novel method to jointly explore self- and cross-triplet interaction context clues for HOI detection. First, we employ a graph neural network to perform self-triplet aggregation, where human and object features represent graph nodes and visual interaction feature and textual prior knowledge are acted as two different edges. Furthermore, we also attempt to explore cross-triplet context exchange by incorporating symbiotic and layout relationships among different HOI triplets. Extensive experiments on two benchmarks demonstrate that our proposed method outperforms the state-of-the-art ones and achieves the impressive performance of 40.32 mAP on HICO-DET and 69.1 mAP on V-COCO datasets, respectively. © 1991-2012 IEEE.
引用
收藏
页码:9760 / 9773
相关论文
共 50 条
  • [21] Diagnosing Rarity in Human-object Interaction Detection
    Kilickaya, Mert
    Smeulders, Arnold
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3956 - 3960
  • [22] Parallel Queries for Human-Object Interaction Detection
    Chen, Junwen
    Yanai, Keiji
    [J]. PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [23] Human-Object Interaction Detection with Missing Objects
    Kogashi, Kaen
    Wu, Yang
    Nobuhara, Shohei
    Nishino, Ko
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [24] Discovering Human-Object Interaction Concepts via Self-Compositional Learning
    Hou, Zhi
    Yu, Baosheng
    Tao, Dacheng
    [J]. COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 461 - 478
  • [25] Human-Object Interaction Detection: A Survey of Deep Learning-Based Methods
    Li, Fang
    Wang, Shunli
    Wang, Shuaiping
    Zhang, Lihua
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT I, 2022, 13604 : 441 - 452
  • [26] Improving Human-Object Interaction Detection via Phrase Learning and Label Composition
    Li, Zhimin
    Zou, Cheng
    Zhao, Yu
    Li, Boxun
    Zhong, Sheng
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1509 - 1517
  • [27] Enhanced Transformer Interaction Components for Human-Object Interaction Detection
    Zhang, JinHui
    Zhao, Yuxiao
    Zhang, Xian
    Wang, Xiang
    Zhao, Yuxuan
    Wang, Peng
    Hu, Jian
    [J]. ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023, 2023,
  • [28] Three-stream network with context convolution module for human-object interaction detection
    Siadari, Thomhert S.
    Han, Mikyong
    Yoon, Hyunjin
    [J]. ETRI JOURNAL, 2020, 42 (02) : 230 - 238
  • [29] Learning Asynchronous and Sparse Human-Object Interaction in Videos
    Morais, Romero
    Vuong Le
    Venkatesh, Svetha
    Truyen Tran
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16036 - 16045
  • [30] Category Query Learning for Human-Object Interaction Classification
    Xie, Chi
    Zeng, Fangao
    Hu, Yue
    Liang, Shuang
    Wei, Yichen
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 15275 - 15284