Human pose regression by combining indirect part detection and contextual information

被引:113
|
作者
Luvizon, Diogo C. [1 ,2 ]
Labia, Hedi [1 ,3 ]
Picard, David [1 ,4 ]
机构
[1] Paris Seine Univ, CNRS, ENSEA, ETIS UMR 8051, F-95000 Cergy, France
[2] Samsung Res Inst, Adv Technol, Campinas, SP, Brazil
[3] Univ Paris Saclay, IBISC, Univ Ewy Val Essonne, Paris, France
[4] UPE, Ecole Ponts, UMR 8049, LIGM, Champs Sur Marne, France
来源
COMPUTERS & GRAPHICS-UK | 2019年 / 85卷
关键词
Human pose estimation; Neural nets; vision; PICTORIAL STRUCTURES;
D O I
10.1016/j.cag.2019.09.002
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper, we tackle the problem of human pose estimation from still images, which is a very active topic, specially due to its several applications, from image annotation to human-machine interface. We use the soft-argmax function to convert feature maps directly to body joint coordinates, resulting in a fully differentiable framework. Our method is able to learn heat maps representations indirectly, without additional steps of artificial ground truth generation. Consequently, contextual information can be included to the pose predictions in a seamless way. We evaluated our method on two challenging datasets, the Leeds Sports Poses (LSP) and the MPII Human Pose datasets, reaching the best performance among all the existing regression methods. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:15 / 22
页数:8
相关论文
共 50 条
  • [41] Combining models of pose and dynamics for human motion recognition
    Filipovych, Roynan
    Ribeiro, Eraldo
    ADVANCES IN VISUAL COMPUTING, PROCEEDINGS, PT 2, 2007, 4842 : 21 - 32
  • [42] COMBINING MULTIMODAL AND TEMPORAL CONTEXTUAL INFORMATION FOR SEMANTIC VIDEO ANALYSIS
    Papadopoulos, Georgios Th.
    Mezaris, Vasileios
    Kompatsiaris, Ioannis
    Strintzis, Michael G.
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 4325 - +
  • [43] Combining Polarimetric and Contextual information using Autoassociative Neural Networks
    Avezzano, Ruggero Giuseppe
    Del Frate, Fabio
    Schiavon, Giovanni
    SAR IMAGE ANALYSIS, MODELING, AND TECHNIQUES XIII, 2013, 8891
  • [44] Combining Visual and Contextual Information for Fraudulent Online Store Classification
    Mostard, Wouter
    Zijlema, Bastiaan
    Wiering, Marco
    2019 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2019), 2019, : 84 - 90
  • [45] Human fall detection algorithm combining information enhancement and feature fusion
    Wang, Fengsui
    Shao, Kaili
    Yang, Haiyan
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2024, 32 (08): : 771 - 778
  • [46] Global contextual attention for pure regression object detection
    Fan, Bingbing
    Shao, Mingwen
    Li, Yunhao
    Li, Cunhe
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (08) : 2189 - 2197
  • [47] Global contextual attention for pure regression object detection
    Bingbing Fan
    Mingwen Shao
    Yunhao Li
    Cunhe Li
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 2189 - 2197
  • [48] Contextual and Human Factors in Information Fusion
    Garcia Herrero, Jesus
    Patricio, Miguel A.
    Molina, Jose M.
    Cardoso, Luiz A.
    HUMAN SYSTEMS INTEGRATION TO ENHANCE MARITIME DOMAIN AWARENESS FOR PORT/HARBOUR SECURITY, 2010, 28 : 79 - 92
  • [49] Action Detection System Based on Pose Information
    Kawai, Ryo
    Yoshida, Noboru
    Liu, Jianquan
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [50] Human Pose Regression Through Multiview Visual Fusion
    Zhao, Xu
    Fu, Yun
    Ning, Huazhong
    Liu, Yuncai
    Huang, Thomas S.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2010, 20 (07) : 957 - 966