Pairwise Body-Part Attention for Recognizing Human-Object Interactions

被引:71
|
作者
Fang, Hao-Shu [1 ]
Cao, Jinkun [1 ]
Tai, Yu-Wing [2 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Tencent YouTu Lab, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Human-object interactions; Body-part correlations; Attention model; ACTION RECOGNITION;
D O I
10.1007/978-3-030-01249-6_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In human-object interactions (HOI) recognition, conventional methods consider the human body as a whole and pay a uniform attention to the entire body region. They ignore the fact that normally, human interacts with an object by using some parts of the body. In this paper, we argue that different body parts should be paid with different attention in HOI recognition, and the correlations between different body parts should be further considered. This is because our body parts always work collaboratively. We propose a new pairwise body-part attention model which can learn to focus on crucial parts, and their correlations for HOI recognition. A novel attention based feature selection method and a feature representation scheme that can capture pairwise correlations between body parts are introduced in the model. Our proposed approach achieved 10% relative improvement (36.1mAP -> 39.9mAP) over the state-of-the-art results in HOI recognition on the HICO dataset. We will make our model and source codes publicly available.
引用
收藏
页码:52 / 68
页数:17
相关论文
共 50 条
  • [21] Body-Part Joint Detection and Association via Extended Object Representation
    Zhou, Huayi
    Jiang, Fei
    Lu, Hongtao
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 168 - 173
  • [22] Mixing Body-Part Sequences for Human Pose Estimation
    Cherian, Anoop
    Mairal, Julien
    Alahari, Karteek
    Schmid, Cordelia
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : CP32 - CP32
  • [23] Explicit Modeling of Human-Object Interactions in Realistic Videos
    Prest, Alessandro
    Ferrari, Vittorio
    Schmid, Cordelia
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (04) : 835 - 848
  • [24] Predicting the Location of "interactees" in Novel Human-Object Interactions
    Chen, Chao-Yeh
    Grauman, Kristen
    [J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 351 - 367
  • [25] Detecting Subtle Human-Object Interactions Using Kinect
    Ubalde, Sebastian
    Liu, Zicheng
    Mejail, Marta
    [J]. PROGRESS IN PATTERN RECOGNITION IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, CIARP 2014, 2014, 8827 : 770 - 777
  • [26] Detecting human-object interaction with multi-level pairwise feature network
    Liu, Hanchao
    Mu, Tai-Jiang
    Huang, Xiaolei
    [J]. COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) : 229 - 239
  • [27] BPJDet: Extended Object Representation for Generic Body-Part Joint Detection
    Zhou, Huayi
    Jiang, Fei
    Si, Jiaxin
    Ding, Yue
    Lu, Hongtao
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (06) : 4314 - 4330
  • [28] Exemplar-Based Recognition of Human-Object Interactions
    Hu, Jian-Fang
    Zheng, Wei-Shi
    Lai, Jianhuang
    Gong, Shaogang
    Xiang, Tao
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (04) : 647 - 660
  • [29] Skew-Robust Human-Object Interactions in Videos
    Agarwal, Apoorva
    Dabral, Rishabh
    Jain, Arjun
    Ramakrishnan, Ganesh
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5087 - 5096
  • [30] Spatially Conditioned Graphs for Detecting Human-Object Interactions
    Zhang, Frederic Z.
    Campbell, Dylan
    Gould, Stephen
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13299 - 13307