Semantic Inference Network for Human-Object Interaction Detection

被引:0
|
作者
Liu, Hongyi [1 ]
Mo, Lisha [1 ]
Ma, Huimin [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Human-object interaction; Visual relationship detection; Word embedding;
D O I
10.1007/978-3-030-34120-6_42
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently many efforts have been made to understand the scenes in images. The interactions between human and objects are usually of great significance to scene understanding. In this paper, we focus on the task of detecting human-object interactions (HOI), which is to detect triplets < human, verb, object > in challenging daily images. We propose a novel model which introduces a semantic stream and a new form of loss function. Our intuition is that the semantic information of object classes is beneficial to HOI detection. Semantic information is extracted by embedding the category information of objects with pre-trained BERT model. On the other hand, we find that the HOI task suffers severely from extreme imbalance between positive and negative samples. We propose a weighted focal loss (WFL) to tackle this problem. The results show that our method achieves a gain of 5% compared with our baseline.
引用
下载
收藏
页码:518 / 529
页数:12
相关论文
共 50 条
  • [31] Category-Aware Transformer Network for Better Human-Object Interaction Detection
    Dong, Leizhen
    Li, Zhimin
    Xu, Kunlun
    Zhang, Zhijun
    Yan, Luxin
    Zhong, Sheng
    Zou, Xu
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 19516 - 19525
  • [32] Enhanced Transformer Interaction Components for Human-Object Interaction Detection
    Zhang, JinHui
    Zhao, Yuxiao
    Zhang, Xian
    Wang, Xiang
    Zhao, Yuxuan
    Wang, Peng
    Hu, Jian
    ACM SYMPOSIUM ON SPATIAL USER INTERACTION, SUI 2023, 2023,
  • [33] Learning Human-Object Interaction Detection using Interaction Points
    Wang, Tiancai
    Yang, Tong
    Danelljan, Martin
    Khan, Fahad Shahbaz
    Zhang, Xiangyu
    Sun, Jian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4115 - 4124
  • [34] Relational Context Learning for Human-Object Interaction Detection
    Kim, Sanghyun
    Jung, Deunsol
    Cho, Minsu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2925 - 2934
  • [35] DSSF: Dynamic Semantic Sampling and Fusion for One-Stage Human-Object Interaction Detection
    Gu, Dongzhou
    Ma, Shiwei
    Cai, Shuang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [36] Neural-Logic Human-Object Interaction Detection
    Li, Liulei
    Wei, Jianan
    Wang, Wenguan
    Yang, Yi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [37] Transferable Interactiveness Knowledge for Human-Object Interaction Detection
    Li, Yong-Lu
    Zhou, Siyuan
    Huang, Xijie
    Xu, Liang
    Ma, Ze
    Fang, Hao-Shu
    Wang, Yan-Feng
    Lu, Cewu
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3580 - 3589
  • [38] Human-Object Interaction Detection Based on Star Graph
    Cai, Shuang
    Ma, Shiwei
    Gu, Dongzhou
    Wang, Chang
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2022, 36 (09)
  • [39] Affordance Transfer Learning for Human-Object Interaction Detection
    Hou, Zhi
    Yu, Baosheng
    Qiao, Yu
    Peng, Xiaojiang
    Tao, Dacheng
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 495 - 504
  • [40] Structured LSTM for Human-Object Interaction Detection and Anticipation
    Anh Minh Truong
    Yoshitaka, Atsuo
    2017 14TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2017,