Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses

被引:148
|
作者
Yao, Bangpeng [1 ]
Fei-Fei, Li [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Mutual context; action recognition; human pose estimation; object detection; conditional random field;
D O I
10.1109/TPAMI.2012.67
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting objects in cluttered scenes and estimating articulated human body parts from 2D images are two challenging problems in computer vision. The difficulty is particularly pronounced in activities involving human-object interactions (e.g., playing tennis), where the relevant objects tend to be small or only partially visible and the human body parts are often self-occluded. We observe, however, that objects and human poses can serve as mutual context to each other-recognizing one facilitates the recognition of the other. In this paper, we propose a mutual context model to jointly model objects and human poses in human-object interaction activities. In our approach, object detection provides a strong prior for better human pose estimation, while human pose estimation improves the accuracy of detecting the objects that interact with the human. On a six-class sports data set and a 24-class people interacting with musical instruments data set, we show that our mutual context model outperforms state of the art in detecting very difficult objects and estimating human poses, as well as classifying human-object interaction activities.
引用
收藏
页码:1691 / 1703
页数:13
相关论文
共 50 条
  • [1] HICO: A Benchmark for Recognizing Human-Object Interactions in Images
    Chao, Yu-Wei
    Wang, Zhan
    He, Yugeng
    Wang, Jiaxuan
    Deng, Jia
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1017 - 1025
  • [2] Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities
    Yao, Bangpeng
    Li Fei-Fei
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 17 - 24
  • [3] Detecting and Recognizing Human-Object Interactions
    Gkioxari, Georgia
    Girshick, Ross
    Dollar, Piotr
    He, Kaiming
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8359 - 8367
  • [4] Detecting human-object interactions in videos by modeling the trajectory of objects and human skeleton
    Li, Qiyue
    Xie, Xuemei
    Zhang, Chen
    Zhang, Jin
    Shi, Guangming
    [J]. NEUROCOMPUTING, 2022, 509 : 234 - 243
  • [5] Recognizing Human-Object Interactions via Target Localization
    Cho, Sunyoung
    Park, Jihun
    Shin, Young Sook
    Lee, Sang-ho
    [J]. 2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 836 - 840
  • [6] An Intelligent Framework for Recognizing Social Human-Object Interactions
    Alarfaj, Mohammed
    Waheed, Manahil
    Ghadi, Yazeed Yasin
    al Shloul, Tamara
    Alsuhibany, Suliman A.
    Jalal, Ahmad
    Park, Jeongmin
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1207 - 1223
  • [7] Recognizing Human Actions from Still Images with Latent Poses
    Yang, Weilong
    Wang, Yang
    Mori, Greg
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2030 - 2037
  • [8] Recognizing Human-Object Interactions Using Sparse Subspace Clustering
    Bogun, Ivan
    Ribeiro, Eraldo
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 409 - 416
  • [9] HUMAN-OBJECT RELATION NETWORK FOR ACTION RECOGNITION IN STILL IMAGES
    Ma, Wentao
    Liang, Shuang
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [10] Human-Object Interaction Recognition Based on Modeling Context
    Shuyang Li
    Wei Liang
    Qun Zhang
    [J]. Journal of Beijing Institute of Technology, 2017, 26 (02) : 215 - 222