Multiclass classification based on a deep convolutional network for head pose estimation

被引:13
|
作者
Cai, Ying [1 ,2 ]
Yang, Meng-long [3 ]
Li, Jun [2 ]
机构
[1] Sichuan Univ, Sch Comp Sci, Chengdu 610065, Peoples R China
[2] Sichuan Agr Univ, Coll Informat Engn, Yaan 625014, Peoples R China
[3] Sichuan Univ, Sch Aeronaut & Astronaut, Chengdu 610065, Peoples R China
基金
中国国家自然科学基金;
关键词
Head pose estimation; Deep convolutional neural network; Multiclass classification; RECOGNITION; POINT;
D O I
10.1631/FITEE.1500125
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Head pose estimation has been considered an important and challenging task in computer vision. In this paper we propose a novel method to estimate head pose based on a deep convolutional neural network (DCNN) for 2D face images. We design an effective and simple method to roughly crop the face from the input image, maintaining the individual-relative facial features ratio. The method can be used in various poses. Then two convolutional neural networks are set up to train the head pose classifier and then compared with each other. The simpler one has six layers. It performs well on seven yaw poses but is somewhat unsatisfactory when mixed in two pitch poses. The other has eight layers and more pixels in input layers. It has better performance on more poses and more training samples. Before training the network, two reasonable strategies including shift and zoom are executed to prepare training samples. Finally, feature extraction filters are optimized together with the weight of the classification component through training, to minimize the classification error. Our method has been evaluated on the CAS-PEAL-R1, CMU PIE, and CUBIC FacePix databases. It has better performance than state-of-the-art methods for head pose estimation.
引用
下载
收藏
页码:930 / 939
页数:10
相关论文
共 50 条
  • [1] Multiclass classification based on a deep convolutional network for head pose estimation
    Ying Cai
    Meng-long Yang
    Jun Li
    Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 930 - 939
  • [2] Deep convolutional neural network-based Bernoulli heatmap for head pose estimation
    Hu, Zhongxu
    Xing, Yang
    Lv, Chen
    Hang, Peng
    Liu, Jie
    NEUROCOMPUTING, 2021, 436 : 198 - 209
  • [3] Hybrid Deep Convolutional Network for Face Alignment and Head Pose Estimation
    Wang, Zhiyong
    Liu, Jingjing
    Liu, Honghai
    INTELLIGENT ROBOTICS AND APPLICATIONS (ICIRA 2022), PT II, 2022, 13456 : 513 - 522
  • [4] Head Pose Estimation Based on Robust Convolutional Neural Network
    Bao, Jiao
    Ye, Mao
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2016, 16 (06) : 133 - 145
  • [5] Head Pose Estimation Using Convolutional Neural Network
    Lee, Seungsu
    Saitoh, Takeshi
    IT CONVERGENCE AND SECURITY 2017, VOL 1, 2018, 449 : 164 - 171
  • [6] Deep Transfer Feature Based Convolutional Neural Forests for Head Pose Estimation
    Liu, Yuanyuan
    Xie, Zhong
    Gong, Xi
    Fang, Fang
    IMAGE AND VIDEO TECHNOLOGY (PSIVT 2017), 2018, 10799 : 5 - 16
  • [7] Head Pose Estimation Based on Multi-Scale Convolutional Neural Network
    Liang Lingyu
    Zhang Tiantian
    He Wei
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (13)
  • [8] An effective multiclass skin cancer classification approach based on deep convolutional neural network
    Houssein, Essam H.
    Abdelkareem, Doaa A.
    Hu, Gang
    Hameed, Mohamed Abdel
    Ibrahim, Ibrahim A.
    Younan, Mina
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2024, 27 (09): : 12799 - 12819
  • [9] Dance Action Recognition and Pose Estimation Based on Deep Convolutional Neural Network
    Zhu, Fengling
    Zhu, Ruichao
    TRAITEMENT DU SIGNAL, 2021, 38 (02) : 529 - 538
  • [10] Multi-person pose estimation based on a deep convolutional neural network
    Duan, Peng
    Wang, Tingwei
    Cui, Maowei
    Sang, Hongyan
    Sun, Qun
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 62 : 245 - 252