Affordance Transfer Learning for Human-Object Interaction Detection

被引:44
|
作者
Hou, Zhi [1 ]
Yu, Baosheng [1 ]
Qiao, Yu [2 ,3 ]
Peng, Xiaojiang [4 ]
Tao, Dacheng [1 ]
机构
[1] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW, Australia
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Beijing, Peoples R China
[3] Shanghai AI Lab, Shanghai, Peoples R China
[4] Shenzhen Technol Univ, Shenzhen, Peoples R China
基金
澳大利亚研究理事会;
关键词
D O I
10.1109/CVPR46437.2021.00056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reasoning the human-object interactions (HOI) is essential for deeper scene understanding, while object affordances (or functionalities) are of great importance for human to discover unseen HOIs with novel objects. Inspired by this, we introduce an affordance transfer learning approach to jointly detect HOIs with novel object and recognize affordances. Specifically, HOI representations can be decoupled into a combination of affordance and object representations, making it possible to compose novel interactions by combining affordance representations and novel object representations from additional images, i.e. transferring the affordance to novel objects. With the proposed affordance transfer learning, the model is also capable of inferring the affordances of novel objects from known affordance representations. The proposed method can thus be used to 1) improve the performance of HOI detection, especially for the HOIs with unseen objects; and 2) infer the affordances of novel objects. Experimental results on two datasets, HICO-DET and HOI-COCO (from V-COCO), demonstrate significant improvements over recent state-of-the-art methods for HOI detection and object affordance detection.
引用
收藏
页码:495 / 504
页数:10
相关论文
共 50 条
  • [1] Attribute Based Affordance Detection from Human-Object Interaction Images
    Hassan, Mahmudul
    Dharmaratne, Anuja
    [J]. IMAGE AND VIDEO TECHNOLOGY - PSIVT 2015 WORKSHOPS, 2016, 9555 : 220 - 232
  • [2] Lifelong Learning for Human-Object Interaction Detection
    Sun, Bo
    Lu, Sixu
    He, Jun
    Yu, Lejun
    [J]. 2022 IEEE 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND NETWORKS (ICICN 2022), 2022, : 582 - 587
  • [3] Learning Human-Object Interaction Detection using Interaction Points
    Wang, Tiancai
    Yang, Tong
    Danelljan, Martin
    Khan, Fahad Shahbaz
    Zhang, Xiangyu
    Sun, Jian
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4115 - 4124
  • [4] Relational Context Learning for Human-Object Interaction Detection
    Kim, Sanghyun
    Jung, Deunsol
    Cho, Minsu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2925 - 2934
  • [5] Grounding human-object interaction to affordance behavior in multimodal datasets
    Henlein, Alexander
    Gopinath, Anju
    Krishnaswamy, Nikhil
    Mehler, Alexander
    Pustejovsky, James
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [6] Learning Human-Object Interaction Detection via Deformable Transformer
    Cai, Shuang
    Ma, Shiwei
    Gu, Dongzhou
    [J]. 2021 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2021, 12076
  • [7] A Survey of Human-Object Interaction Detection
    Gong, Xun
    Zhang, Zhiying
    Liu, Lu
    Ma, Bing
    Wu, Kunlun
    [J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704
  • [8] Compositional Learning in Transformer-Based Human-Object Interaction Detection
    Zhuang, Zikun
    Qian, Ruihao
    Xie, Chi
    Liang, Shuang
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1038 - 1043
  • [9] From detection to understanding: A survey on representation learning for human-object interaction
    Luo, Tianlun
    Guan, Steven
    Yang, Rui
    Smith, Jeremy
    [J]. NEUROCOMPUTING, 2023, 543
  • [10] Improving Human-Object Interaction Detection via Virtual Image Learning
    Fang, Shuman
    Liu, Shuai
    Li, Jie
    Jiang, Guannan
    Lin, Xianming
    Ji, Rongrong
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5455 - 5463