Human Pose Estimation by a Series of Residual Auto-Encoders

被引:2
|
作者
Farrajota, M. [1 ]
Rodrigues, Joao M. F. [1 ]
du Buf, J. M. H. [1 ]
机构
[1] Univ Algarve, LARSyS, Vis Lab, P-8005139 Faro, Portugal
关键词
Human pose; ConvNet; Neural networks; Auto-encoders;
D O I
10.1007/978-3-319-58838-4_15
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pose estimation is the task of predicting the pose of an object in an image or in a sequence of images. Here, we focus on articulated human pose estimation in scenes with a single person. We employ a series of residual auto-encoders to produce multiple predictions which are then combined to provide a heatmap prediction of body joints. In this network topology, features are processed across all scales which captures the various spatial relationships associated with the body. Repeated bottom-up and top-down processing with intermediate supervision for each auto-encoder network is applied. We propose some improvements to this type of regression-based networks to further increase performance, namely: (a) increase the number of parameters of the auto-encoder networks in the pipeline, (b) use stronger regularization along with heavy data augmentation, (c) use sub-pixel precision for more precise joint localization, and (d) combine all auto-encoders output heatmaps into a single prediction, which further increases body joint prediction accuracy. We demonstrate state-of-the-art results on the popular FLIC and LSP datasets.
引用
收藏
页码:131 / 139
页数:9
相关论文
共 50 条
  • [41] Improved Denoising Auto-encoders for Image Denoising
    Xiang, Qian
    Pang, Xuliang
    2018 11TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2018), 2018,
  • [42] Cascaded Denoising Convolutional Auto-Encoders for Automatic Recovery of Missing Time Series Data
    Chen, Yuanyi
    Wang, Yubin
    Yang, Qiang
    2020 19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS FOR BUSINESS ENGINEERING AND SCIENCE (DCABES 2020), 2020, : 283 - 286
  • [43] TP-AE: Temporally Primed 6D Object Pose Tracking with Auto-Encoders
    Zheng, Linfang
    Leonardis, Ales
    Tse, Tze Ho Elden
    Horanyi, Nora
    Chen, Hua
    Zhang, Wei
    Chang, Hyung Jin
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 10616 - 10623
  • [44] An Auditory Measure for Anomaly Detection based on Auto-encoders
    Liu, Tao
    Duan, Meiqian
    Sun, Luyang
    Zhang, Bo
    2022 ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING (CACML 2022), 2022, : 109 - 114
  • [45] Embarrassingly shallow auto-encoders for dynamic collaborative filtering
    Olivier Jeunen
    Jan Van Balen
    Bart Goethals
    User Modeling and User-Adapted Interaction, 2022, 32 : 509 - 541
  • [46] Automatic selection of latent variables in variational auto-encoders
    Jouffroy, Emma
    Giremus, Audrey
    Berthoumieu, Yannick
    Bach, Olivier
    Hugget, Alain
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 1407 - 1411
  • [47] Stacked Convolutional Auto-Encoders for Steganalysis of Digital Images
    Tan, Shunquan
    Li, Bin
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [48] Explicit guiding auto-encoders for learning meaningful representation
    Yanan Sun
    Hua Mao
    Yongsheng Sang
    Zhang Yi
    Neural Computing and Applications, 2017, 28 : 429 - 436
  • [49] Learning Robust Auto-Encoders With Regularizer for Linearity and Sparsity
    Shi, Yong
    Lei, Minglong
    Ma, Rongrong
    Niu, Lingfeng
    IEEE ACCESS, 2019, 7 : 17195 - 17206
  • [50] EXPLORING CONVOLUTIONAL AUTO-ENCODERS FOR REPRESENTATION LEARNING ON NETWORKS
    Nerurkar, Pranav
    Chandane, Madhav
    Bhirud, Sunil
    COMPUTER SCIENCE-AGH, 2019, 20 (03): : 350 - 365