Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities

被引:175
|
作者
Yao, Bangpeng [1 ]
Li Fei-Fei [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
10.1109/CVPR.2010.5540235
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting objects in cluttered scenes and estimating articulated human body parts are two challenging problems in computer vision. The difficulty is particularly pronounced in activities involving human-object interactions (e. g. playing tennis), where the relevant object tends to be small or only partially visible, and the human body parts are often self-occluded. We observe, however, that objects and human poses can serve as mutual context to each other - recognizing one facilitates the recognition of the other. In this paper we propose a new random field model to encode the mutual context of objects and human poses in human-object interaction activities. We then cast the model learning task as a structure learning problem, of which the structural connectivity between the object, the overall human pose, and different body parts are estimated through a structure search approach, and the parameters of the model are estimated by a new max-margin algorithm. On a sports data set of six classes of human-object interactions [12], we show that our mutual context model significantly outperforms state-of-the-art in detecting very difficult objects and human poses.
引用
收藏
页码:17 / 24
页数:8
相关论文
共 50 条
  • [1] Human-Object Interaction Recognition Based on Modeling Context
    Shuyang Li
    Wei Liang
    Qun Zhang
    [J]. Journal of Beijing Institute of Technology, 2017, 26 (02) : 215 - 222
  • [2] Human-Object Interaction Recognition Based on Modeling Context
    Li, Shuyang
    Liang, Wei
    Zhang, Qun
    [J]. Journal of Beijing Institute of Technology (English Edition), 2017, 26 (02): : 215 - 222
  • [3] A SIMILARITY MEASURE FOR ANALYZING HUMAN ACTIVITIES USING HUMAN-OBJECT INTERACTION CONTEXT
    Amiri, S. Mohsen
    Pourazad, Mahsa T.
    Nasiopoulos, Panos
    Leung, Victor C. M.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2368 - 2372
  • [4] Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses
    Yao, Bangpeng
    Fei-Fei, Li
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (09) : 1691 - 1703
  • [5] Pose graph parsing network for human-object interaction detection
    Su, Zhan
    Wang, Yuting
    Xie, Qing
    Yu, Ruiyun
    [J]. NEUROCOMPUTING, 2022, 476 : 53 - 62
  • [6] Relational Context Learning for Human-Object Interaction Detection
    Kim, Sanghyun
    Jung, Deunsol
    Cho, Minsu
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2925 - 2934
  • [7] Pose attention and object semantic representation-based human-object interaction detection network
    Deng, Wei-Mo
    Zhang, Hong-Bo
    Lei, Qing
    Du, Ji-Xiang
    Huang, Min
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (27) : 39453 - 39470
  • [8] Pose attention and object semantic representation-based human-object interaction detection network
    Wei-Mo Deng
    Hong-Bo Zhang
    Qing Lei
    Ji-Xiang Du
    Min Huang
    [J]. Multimedia Tools and Applications, 2022, 81 : 39453 - 39470
  • [9] Cascaded Human-Object Interaction Recognition
    Zhou, Tianfei
    Wang, Wenguan
    Qi, Siyuan
    Ling, Haibin
    Shen, Jianbing
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4262 - 4271
  • [10] A Survey of Human-Object Interaction Detection
    Gong X.
    Zhang Z.
    Liu L.
    Ma B.
    Wu K.
    [J]. Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (04): : 693 - 704