Pairwise Body-Part Attention for Recognizing Human-Object Interactions

被引:71
|
作者
Fang, Hao-Shu [1 ]
Cao, Jinkun [1 ]
Tai, Yu-Wing [2 ]
Lu, Cewu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[2] Tencent YouTu Lab, Shanghai, Peoples R China
来源
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Human-object interactions; Body-part correlations; Attention model; ACTION RECOGNITION;
D O I
10.1007/978-3-030-01249-6_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In human-object interactions (HOI) recognition, conventional methods consider the human body as a whole and pay a uniform attention to the entire body region. They ignore the fact that normally, human interacts with an object by using some parts of the body. In this paper, we argue that different body parts should be paid with different attention in HOI recognition, and the correlations between different body parts should be further considered. This is because our body parts always work collaboratively. We propose a new pairwise body-part attention model which can learn to focus on crucial parts, and their correlations for HOI recognition. A novel attention based feature selection method and a feature representation scheme that can capture pairwise correlations between body parts are introduced in the model. Our proposed approach achieved 10% relative improvement (36.1mAP -> 39.9mAP) over the state-of-the-art results in HOI recognition on the HICO dataset. We will make our model and source codes publicly available.
引用
收藏
页码:52 / 68
页数:17
相关论文
共 50 条
  • [41] Exploring Predicate Visual Context in Detecting of Human-Object Interactions
    Zhang, Frederic Z.
    Yuan, Yuhui
    Campbell, Dylan
    Zhong, Zhuoyao
    Gould, Stephen
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 10377 - 10387
  • [42] Novel Anomalous Event Detection based on Human-object Interactions
    Colque, Rensso Mora
    Caetano, Carlos
    de Melo, Victor C.
    Chavez, Guillermo Camara
    Schwartz, William Robson
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 293 - 300
  • [43] Modeling 4D Human-Object Interactions for Event and Object Recognition
    Wei, Ping
    Zhao, Yibiao
    Zheng, Nanning
    Zhu, Song-Chun
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 3272 - 3279
  • [44] Learning Human-Object Interactions by Graph Parsing Neural Networks
    Qi, Siyuan
    Wang, Wenguan
    Jia, Baoxiong
    Shen, Jianbing
    Zhu, Song-Chun
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 407 - 423
  • [45] NeuralHOFusion: Neural Volumetric Rendering under Human-object Interactions
    Jiang, Yuheng
    Jiang, Suyi
    Sun, Guoxing
    Su, Zhuo
    Guo, Kaiwen
    Wu, Minye
    Yu, Jingyi
    Xu, Lan
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 6145 - 6155
  • [46] Causality Inspired Retrieval of Human-object Interactions from Video
    Zhou, Liting
    Liu, Jianquan
    Nishimura, Shoji
    Antony, Joseph
    Gurrin, Cathal
    [J]. 2019 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2019,
  • [47] Graph-based method for human-object interactions detection
    Xia, Li-min
    Wu, Wei
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY, 2021, 28 (01) : 205 - 218
  • [48] Human action recognition using an ensemble of body-part detectors
    Chakraborty, Bhaskar
    Bagdanov, Andrew D.
    Gonzalez, Jordi
    Roca, Xavier
    [J]. EXPERT SYSTEMS, 2013, 30 (02) : 101 - 114
  • [49] Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
    Pi, Huaijin
    Peng, Sida
    Yang, Minghui
    Zhou, Xiaowei
    Bao, Hujun
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 15015 - 15027
  • [50] Visualizing Thermal Traces to Reveal Histories of Human-Object Interactions
    Amemiya, Tomohiro
    [J]. UNIVERSAL ACCESS IN HUMAN-COMPUTER INTERACTION, PT II, PROCEEDINGS: INTELLIGENT AND UBIQUITOUS INTERACTION ENVIRONMENTS, 2009, 5615 : 477 - 482