Human pose regression by combining indirect part detection and contextual information

被引：113

作者：

Luvizon, Diogo C. ^{[1
,2
]}

Labia, Hedi ^{[1
,3
]}

Picard, David ^{[1
,4
]}

机构：

[1] Paris Seine Univ, CNRS, ENSEA, ETIS UMR 8051, F-95000 Cergy, France

[2] Samsung Res Inst, Adv Technol, Campinas, SP, Brazil

[3] Univ Paris Saclay, IBISC, Univ Ewy Val Essonne, Paris, France

[4] UPE, Ecole Ponts, UMR 8049, LIGM, Champs Sur Marne, France

来源：

COMPUTERS & GRAPHICS-UK | 2019年 / 85卷

关键词：

Human pose estimation; Neural nets; vision; PICTORIAL STRUCTURES;

D O I：

10.1016/j.cag.2019.09.002

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

In this paper, we tackle the problem of human pose estimation from still images, which is a very active topic, specially due to its several applications, from image annotation to human-machine interface. We use the soft-argmax function to convert feature maps directly to body joint coordinates, resulting in a fully differentiable framework. Our method is able to learn heat maps representations indirectly, without additional steps of artificial ground truth generation. Consequently, contextual information can be included to the pose predictions in a seamless way. We evaluated our method on two challenging datasets, the Leeds Sports Poses (LSP) and the MPII Human Pose datasets, reaching the best performance among all the existing regression methods. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页码：15 / 22

页数：8

共 50 条

[41] Combining models of pose and dynamics for human motion recognition
Filipovych, Roynan
Ribeiro, Eraldo
ADVANCES IN VISUAL COMPUTING, PROCEEDINGS, PT 2, 2007, 4842 : 21 - 32
[42] COMBINING MULTIMODAL AND TEMPORAL CONTEXTUAL INFORMATION FOR SEMANTIC VIDEO ANALYSIS
Papadopoulos, Georgios Th.
Mezaris, Vasileios
Kompatsiaris, Ioannis
Strintzis, Michael G.
2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4325 - +
[43] Combining Polarimetric and Contextual information using Autoassociative Neural Networks
Avezzano, Ruggero Giuseppe
Del Frate, Fabio
Schiavon, Giovanni
SAR IMAGE ANALYSIS, MODELING, AND TECHNIQUES XIII, 2013, 8891
[44] Combining Visual and Contextual Information for Fraudulent Online Store Classification
Mostard, Wouter
Zijlema, Bastiaan
Wiering, Marco
2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 84 - 90
[45] Human fall detection algorithm combining information enhancement and feature fusion
Wang, Fengsui
Shao, Kaili
Yang, Haiyan
Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2024, 32 (08): : 771 - 778
[46] Global contextual attention for pure regression object detection
Fan, Bingbing
Shao, Mingwen
Li, Yunhao
Li, Cunhe
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) : 2189 - 2197
[47] Global contextual attention for pure regression object detection
Bingbing Fan
Mingwen Shao
Yunhao Li
Cunhe Li
International Journal of Machine Learning and Cybernetics, 2022, 13 : 2189 - 2197
[48] Contextual and Human Factors in Information Fusion
Garcia Herrero, Jesus
Patricio, Miguel A.
Molina, Jose M.
Cardoso, Luiz A.
HUMAN SYSTEMS INTEGRATION TO ENHANCE MARITIME DOMAIN AWARENESS FOR PORT/HARBOUR SECURITY, 2010, 28 : 79 - 92
[49] Action Detection System Based on Pose Information
Kawai, Ryo
Yoshida, Noboru
Liu, Jianquan
PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
[50] Human Pose Regression Through Multiview Visual Fusion
Zhao, Xu
Fu, Yun
Ning, Huazhong
Liu, Yuncai
Huang, Thomas S.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (07) : 957 - 966

← 1 2 3 4 5 →