Wide Range Head Pose Estimation Using a Single RGB Camera for Intelligent Surveillance

被引:9
|
作者
Rahmaniar, Wahyu [1 ]
ul Haq, Qazi Mazhar [1 ]
Lin, Ting-Lan [1 ,2 ]
机构
[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
[2] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 320314, Taiwan
关键词
Head; Pose estimation; Feature extraction; Three-dimensional displays; Magnetic heads; Deep learning; Real-time systems; CNN; coarse-fine classification; deep learning; Euler angle; head pose estimation; intelligent surveillance;
D O I
10.1109/JSEN.2022.3168863
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Head pose estimation is one of the sensing systems needed for some intelligent surveillance, such as human behavior analysis, intelligent driver assistance, visual attention, and monitoring. These systems require accurate alignment and head movement direction prediction. The previous methods are greatly dependent on the facial landmarks and depth information. Usually, the head pose is measured by estimating several keypoints that require a correct head pose mapping to get accurate results. Moreover, facial landmarks have a detrimental effect on performance when the face is occluded or not adequately visualized. This paper proposes a method for head pose estimation of various facial conditions, such as occlusion and challenging viewpoints. We present a combination of coarse and fine feature maps classification to train a multi-loss deep Convolutional Neural Networks (CNN) to get precise Euler angles (yaw, pitch, roll) of the head position without keypoints and landmarks. Our proposed method uses more quantization units for angle classification to learn coarse and fine structure mapping for better spatial clustering features on an RGB image of a single camera. The experiments are performed on benchmark datasets and some head poses in real cases. The mean average error of prediction is 5.06 degrees, 4.06 degrees, and 2.96 degrees, for the AFLW2000, AFLW, and BIWI datasets, which significantly improves the head pose estimation performance compared to the previous methods. Additionally, the proposed method outperforms previous approaches in computation time of 11 frames per second that is beneficial for real-life applications.
引用
收藏
页码:11112 / 11121
页数:10
相关论文
共 50 条
  • [31] SASE: RGB-Depth Database for Human Head Pose Estimation
    Lusi, Iiris
    Escarela, Sergio
    Anbarjafari, Gholamreza
    COMPUTER VISION - ECCV 2016 WORKSHOPS, PT III, 2016, 9915 : 325 - 336
  • [32] Wide Range Time of Flight Camera for Outdoor Surveillance
    Falie, D.
    Buzuloiu, V.
    MRRS: 2008 MICROWAVES, RADAR AND REMOTE SENSING SYMPOSIUM, PROCEEDINGS, 2008, : 79 - 82
  • [33] Robust Robot Pose Estimation for Challenging Scenes With an RGB-D Camera
    Yu, Hongshan
    Fu, Qiang
    Yang, Zhengeng
    Tan, Lei
    Sun, Wei
    Sun, Mingui
    IEEE SENSORS JOURNAL, 2019, 19 (06) : 2217 - 2229
  • [34] Accurate Shoulder Joint Angle Estimation Using Single RGB camera for Rehabilitation
    Yahya, Muhammad
    Shah, Jawad
    Kadir, Kushsairy
    Warsi, Arif
    Khan, Sheroz
    Nasir, Haidawati
    2019 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC), 2019, : 758 - 763
  • [35] UAV Pose Estimation Using IR and RGB Cameras
    Su, Wang
    Ravankar, Abhijeet
    Ravankar, Ankit A.
    Kobayashi, Yukinori
    Emaru, Takanori
    2017 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2017, : 151 - 156
  • [36] Head Pose Estimation in First-Person Camera Views
    Alletto, Stefano
    Serra, Giuseppe
    Calderara, Simone
    Cucchiara, Rita
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 4188 - 4193
  • [37] Slice representation of range data for head pose estimation
    Tang, Yunqi
    Sun, Zhenan
    Tan, Tieniu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2014, 128 : 18 - 35
  • [38] Planar Pose Estimation using a Camera and Single-Station Ranging Measurements
    Zhu, Chen
    Giorgi, Gabriele
    Guenther, Christoph
    PROCEEDINGS OF THE 30TH INTERNATIONAL TECHNICAL MEETING OF THE SATELLITE DIVISION OF THE INSTITUTE OF NAVIGATION (ION GNSS+ 2017), 2017, : 2531 - 2540
  • [39] Customer pose estimation using orientational spatio-temporal network from surveillance camera
    Jingwen Liu
    Yanlei Gu
    Shunsuke Kamijo
    Multimedia Systems, 2018, 24 : 439 - 457
  • [40] Human Pose Recognition and tracking using RGB-D Camera
    Moussaoui, Imene
    Boubertakh, Hamid
    PROCEEDINGS OF 2016 8TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION & CONTROL (ICMIC 2016), 2016, : 515 - 519