Recognizing Human-Object Interactions in Still Images by Modeling the Mutual Context of Objects and Human Poses

被引：148

作者：

Yao, Bangpeng ^{[1
]}

Fei-Fei, Li ^{[1
]}

机构：

[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2012年 / 34卷 / 09期

基金：

美国国家科学基金会;

关键词：

Mutual context; action recognition; human pose estimation; object detection; conditional random field;

D O I：

10.1109/TPAMI.2012.67

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Detecting objects in cluttered scenes and estimating articulated human body parts from 2D images are two challenging problems in computer vision. The difficulty is particularly pronounced in activities involving human-object interactions (e.g., playing tennis), where the relevant objects tend to be small or only partially visible and the human body parts are often self-occluded. We observe, however, that objects and human poses can serve as mutual context to each other-recognizing one facilitates the recognition of the other. In this paper, we propose a mutual context model to jointly model objects and human poses in human-object interaction activities. In our approach, object detection provides a strong prior for better human pose estimation, while human pose estimation improves the accuracy of detecting the objects that interact with the human. On a six-class sports data set and a 24-class people interacting with musical instruments data set, we show that our mutual context model outperforms state of the art in detecting very difficult objects and estimating human poses, as well as classifying human-object interaction activities.

引用

页码：1691 / 1703

页数：13

共 50 条

[1] HICO: A Benchmark for Recognizing Human-Object Interactions in Images
Chao, Yu-Wei
Wang, Zhan
He, Yugeng
Wang, Jiaxuan
Deng, Jia
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1017 - 1025
[2] Modeling Mutual Context of Object and Human Pose in Human-Object Interaction Activities
Yao, Bangpeng
Li Fei-Fei
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 17 - 24
[3] Detecting and Recognizing Human-Object Interactions
Gkioxari, Georgia
Girshick, Ross
Dollar, Piotr
He, Kaiming
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8359 - 8367
[4] Detecting human-object interactions in videos by modeling the trajectory of objects and human skeleton
Li, Qiyue
Xie, Xuemei
Zhang, Chen
Zhang, Jin
Shi, Guangming
[J]. NEUROCOMPUTING, 2022, 509 : 234 - 243
[5] Recognizing Human-Object Interactions via Target Localization
Cho, Sunyoung
Park, Jihun
Shin, Young Sook
Lee, Sang-ho
[J]. 2018 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2018, : 836 - 840
[6] An Intelligent Framework for Recognizing Social Human-Object Interactions
Alarfaj, Mohammed
Waheed, Manahil
Ghadi, Yazeed Yasin
al Shloul, Tamara
Alsuhibany, Suliman A.
Jalal, Ahmad
Park, Jeongmin
[J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (01): : 1207 - 1223
[7] Recognizing Human Actions from Still Images with Latent Poses
Yang, Weilong
Wang, Yang
Mori, Greg
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2030 - 2037
[8] Recognizing Human-Object Interactions Using Sparse Subspace Clustering
Bogun, Ivan
Ribeiro, Eraldo
[J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PT I, 2013, 8047 : 409 - 416
[9] HUMAN-OBJECT RELATION NETWORK FOR ACTION RECOGNITION IN STILL IMAGES
Ma, Wentao
Liang, Shuang
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
[10] Human-Object Interaction Recognition Based on Modeling Context
Shuyang Li
Wei Liang
Qun Zhang
[J]. Journal of Beijing Institute of Technology, 2017, 26 (02) : 215 - 222

← 1 2 3 4 5 →