Wide Range Head Pose Estimation Using a Single RGB Camera for Intelligent Surveillance

被引:9
|
作者
Rahmaniar, Wahyu [1 ]
ul Haq, Qazi Mazhar [1 ]
Lin, Ting-Lan [1 ,2 ]
机构
[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
[2] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 320314, Taiwan
关键词
Head; Pose estimation; Feature extraction; Three-dimensional displays; Magnetic heads; Deep learning; Real-time systems; CNN; coarse-fine classification; deep learning; Euler angle; head pose estimation; intelligent surveillance;
D O I
10.1109/JSEN.2022.3168863
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Head pose estimation is one of the sensing systems needed for some intelligent surveillance, such as human behavior analysis, intelligent driver assistance, visual attention, and monitoring. These systems require accurate alignment and head movement direction prediction. The previous methods are greatly dependent on the facial landmarks and depth information. Usually, the head pose is measured by estimating several keypoints that require a correct head pose mapping to get accurate results. Moreover, facial landmarks have a detrimental effect on performance when the face is occluded or not adequately visualized. This paper proposes a method for head pose estimation of various facial conditions, such as occlusion and challenging viewpoints. We present a combination of coarse and fine feature maps classification to train a multi-loss deep Convolutional Neural Networks (CNN) to get precise Euler angles (yaw, pitch, roll) of the head position without keypoints and landmarks. Our proposed method uses more quantization units for angle classification to learn coarse and fine structure mapping for better spatial clustering features on an RGB image of a single camera. The experiments are performed on benchmark datasets and some head poses in real cases. The mean average error of prediction is 5.06 degrees, 4.06 degrees, and 2.96 degrees, for the AFLW2000, AFLW, and BIWI datasets, which significantly improves the head pose estimation performance compared to the previous methods. Additionally, the proposed method outperforms previous approaches in computation time of 11 frames per second that is beneficial for real-life applications.
引用
收藏
页码:11112 / 11121
页数:10
相关论文
共 50 条
  • [41] Camera Pose Estimation using Frequency Analysis
    Guo, Shuqiang
    Qu, Zhaoyang
    Wang, Liqun
    Guo, Xiaoli
    Zhu, Hongjin
    2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
  • [42] Spacecraft pose estimation using a monocular camera
    1600, International Astronautical Federation, IAF (00):
  • [43] Customer pose estimation using orientational spatio-temporal network from surveillance camera
    Liu, Jingwen
    Gu, Yanlei
    Kamijo, Shunsuke
    MULTIMEDIA SYSTEMS, 2018, 24 (04) : 439 - 457
  • [44] Human Pose Recognition and tracking using RGB-D Camera
    Kahlouche, Souhila
    Ouadah, Noureddine
    Belhocine, Mohmoud
    Boukandoura, Mhamed
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 520 - 525
  • [45] Joint Customer Pose and Orientation Estimation using Deep Neural Network from Surveillance Camera
    Liu, Jingwen
    Gu, Yanlei
    Kamijo, Shunsuke
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2016, : 216 - 221
  • [46] Camera Pose Estimation using Particle Filters
    Herranz, Fernando
    Muthukrishnan, Kavitha
    Langendoen, Koen
    2011 INTERNATIONAL CONFERENCE ON INDOOR POSITIONING AND INDOOR NAVIGATION, 2011,
  • [47] A Unified Deep Framework for Joint 3D Pose Estimation and Action Recognition from a Single RGB Camera
    Huy Hieu Pham
    Salmane, Houssam
    Khoudour, Louahdi
    Crouzil, Alain
    Velastin, Sergio A.
    Zegers, Pablo
    SENSORS, 2020, 20 (07)
  • [48] 3D HEAD POSE ESTIMATION BASED ON GRAPH CONVOLUTIONAL NETWORK FROM A SINGLE RGB IMAGE
    Lie, Wen-Nung
    Yim, Monyneath
    Aing, Lee
    Chiang, Jui-Chiu
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3963 - 3967
  • [49] CAD-based Pose Estimation Design for Random Bin Picking using a RGB-D Camera
    Kai-Tai Song
    Cheng-Hei Wu
    Sin-Yi Jiang
    Journal of Intelligent & Robotic Systems, 2017, 87 : 455 - 470
  • [50] CAD-based Pose Estimation Design for Random Bin Picking using a RGB-D Camera
    Song, Kai-Tai
    Wu, Cheng-Hei
    Jiang, Sin-Yi
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 87 (3-4) : 455 - 470