Pairwise Body-Part Attention for Recognizing Human-Object Interactions

被引:71
|
作者
Fang, Hao-Shu [1 ]
Cao, Jinkun [1 ]
Tai, Yu-Wing [2 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Tencent YouTu Lab, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Human-object interactions; Body-part correlations; Attention model; ACTION RECOGNITION;
D O I
10.1007/978-3-030-01249-6_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In human-object interactions (HOI) recognition, conventional methods consider the human body as a whole and pay a uniform attention to the entire body region. They ignore the fact that normally, human interacts with an object by using some parts of the body. In this paper, we argue that different body parts should be paid with different attention in HOI recognition, and the correlations between different body parts should be further considered. This is because our body parts always work collaboratively. We propose a new pairwise body-part attention model which can learn to focus on crucial parts, and their correlations for HOI recognition. A novel attention based feature selection method and a feature representation scheme that can capture pairwise correlations between body parts are introduced in the model. Our proposed approach achieved 10% relative improvement (36.1mAP -> 39.9mAP) over the state-of-the-art results in HOI recognition on the HICO dataset. We will make our model and source codes publicly available.
引用
收藏
页码:52 / 68
页数:17
相关论文
共 50 条
  • [31] Human-Object Interactions Are More than the Sum of Their Parts
    Baldassano, Christopher
    Beck, Diane M.
    Fei-Fei, Li
    [J]. CEREBRAL CORTEX, 2017, 27 (03) : 2276 - 2288
  • [32] Detecting Human-Object Interactions via Functional Generalization
    Bansal, Ankan
    Rambhatla, Sai Saketh
    Shrivastava, Abhinav
    Chellappa, Rama
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10460 - 10469
  • [33] Spatially Conditioned Graphs for Detecting Human-Object Interactions
    Zhang, Frederic Z.
    Campbell, Dylan
    Gould, Stephen
    [J]. Proceedings of the IEEE International Conference on Computer Vision, 2021, : 13299 - 13307
  • [34] Detection of Generic Human-Object Interactions in Video Streams
    Bruckschen, Lilli
    Amft, Sabrina
    Tanke, Julian
    Gall, Juergen
    Bennewitz, Maren
    [J]. SOCIAL ROBOTICS, ICSR 2019, 2019, 11876 : 108 - 118
  • [35] Full-Body Articulated Human-Object Interaction
    Jiang, Nan
    Liu, Tengyu
    Cao, Zhexuan
    Cui, Jieming
    Zhang, Zhiyuan
    Chen, Yixin
    Wang, He
    Zhu, Yixin
    Huang, Siyuan
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9331 - 9342
  • [36] IMoS: Intent-Driven Full-Body Motion Synthesis for Human-Object Interactions
    Ghosh, Anindita
    Dabral, Rishabh
    Golyanik, Vladislav
    Theobalt, Christian
    Slusallek, Philipp
    [J]. COMPUTER GRAPHICS FORUM, 2023, 42 (02) : 1 - 12
  • [37] Body-part specific interactions of action verb processing with motor behaviour
    Klepp, Anne
    Niccolai, Valentina
    Sieksmeyer, Jan
    Arnzen, Stephanie
    Indefrey, Peter
    Schnitzler, Alfons
    Biennann-Ruben, Katja
    [J]. BEHAVIOURAL BRAIN RESEARCH, 2017, 328 : 149 - 158
  • [38] Action Prediction Based on Physically Grounded Object Affordances in Human-Object Interactions
    Dutta, Vibekananda
    Zielinska, Teresa
    [J]. 2017 11TH INTERNATIONAL WORKSHOP ON ROBOT MOTION AND CONTROL (ROMOCO), 2017, : 41 - 46
  • [39] VISION, SHAPE, AND LINGUISTIC DESCRIPTION - TZELTAL BODY-PART TERMINOLOGY AND OBJECT DESCRIPTION
    LEVINSON, SC
    [J]. LINGUISTICS, 1994, 32 (4-5) : 791 - 855
  • [40] Automated Parts-Based Model for Recognizing Human-Object Interactions from Aerial Imagery with Fully Convolutional Network
    Ghadi, Yazeed Yasin
    Waheed, Manahil
    al Shloul, Tamara
    Alsuhibany, Suliman A.
    Jalal, Ahmad
    Park, Jeongmin
    [J]. REMOTE SENSING, 2022, 14 (06)