NOSE, EYES AND EARS: HEAD POSE ESTIMATION BY LOCATING FACIAL KEYPOINTS

被引:0
|
作者
Gupta, Aryaman [1 ]
Thakkar, Kalpit [1 ]
Gandhi, Vineet [1 ]
Narayanan, P. J. [1 ]
机构
[1] IIIT Hyderabad, KCIS, Ctr Visual Informat Technol, Hyderabad, India
关键词
Image analysis; Pose estimation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Monocular head pose estimation requires learning a model that computes the intrinsic Euler angles for pose (yaw, pitch, roll) from an input image of human face. Annotating ground truth head pose angles for images in the wild is difficult and requires ad-hoc fitting procedures (which provides only coarse and approximate annotations). This highlights the need for approaches which can train on data captured in controlled environment and generalize on the images in the wild (with varying appearance and illumination of the face). Most present day deep learning approaches which learn a regression function directly on the input images fail to do so. To this end, we propose to use a higher level representation to regress the head pose while using deep learning architectures. More specifically, we use the uncertainty maps in the form of 2D soft localization heatmap images over five facial key points, namely left ear, right ear, left eye, right eye and nose, and pass them through an convolutional neural network to regress the head-pose. We show head pose estimation results on two challenging benchmarks BIWI and AFLW and our approach surpasses the state of the art on both the datasets.
引用
收藏
页码:1977 / 1981
页数:5
相关论文
共 50 条
  • [21] Orientation Keypoints for 6D Human Pose Estimation
    Fisch, Martin
    Clark, Ronald
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 10145 - 10158
  • [22] Head Pose Estimation based on Fuzzy Systems using Facial Geometric Features
    Sadeghzadeh, Arezoo
    Ebrahimnezhad, Hossein
    2016 8TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2016, : 777 - 782
  • [23] Topographic feature mapping for head pose estimation with application to facial gesture interfaces
    Raytchev, B
    Yoda, I
    Sakaue, K
    COMPUTER VISION IN HUMAN-COMPUTER INTERACTION, PROCEEDINGS, 2005, 3766 : 180 - 188
  • [24] COUPLED CASCADE REGRESSION FOR SIMULTANEOUS FACIAL LANDMARK DETECTION AND HEAD POSE ESTIMATION
    Gou, Chao
    Wu, Yue
    Wang, Fei-Yue
    Ji, Qiang
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2906 - 2910
  • [25] Towards unsupervised learning of joint facial landmark detection and head pose estimation
    Zou, Zhiming
    Jia, Dian
    Tang, Wei
    PATTERN RECOGNITION, 2025, 162
  • [26] Determining the Angles of Head Rotation on a Selective Set of Facial Keypoints
    Sheka, A. S.
    Samun, V. S.
    VII INTERNATIONAL YOUNG RESEARCHERS' CONFERENCE - PHYSICS, TECHNOLOGY, INNOVATIONS (PTI-2020), 2020, 2313
  • [27] Camera Pose Estimation using Human Head Pose Estimation
    Fischer, Robert
    Hoedlmoser, Michael
    Gelautz, Margrit
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 877 - 886
  • [28] Detecting and locating human eyes in segmented facial images
    Tao, L
    Gu, JJ
    Gao, QW
    Zhuang, ZQ
    SECOND INTERNATION CONFERENCE ON IMAGE AND GRAPHICS, PTS 1 AND 2, 2002, 4875 : 710 - 715
  • [29] Neurological and Head/Eyes/Ears/Nose/Throat Manifestations of COVID-19: A Systematic Review and Meta-Analysis
    Ganesh, Aravind
    Reis, Isabella R.
    Varma, Malavika
    Patry, David G.
    Cooke, Lara J.
    CANADIAN JOURNAL OF NEUROLOGICAL SCIENCES, 2022, 49 (04) : 514 - 531
  • [30] Detecting Arbitrary Intermediate Keypoints for Human Pose Estimation with Vision Transformers
    Ludwig, Katja
    Harzig, Philipp
    Lienhart, Rainer
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 663 - 671