An Effective Deep Network for Head Pose Estimation without Keypoints

被引:0
|
作者
Thai, Chien [1 ]
Tran, Viet [1 ]
Bui, Minh [1 ]
Ninh, Huong [1 ]
Tran, Hai [1 ]
机构
[1] Viettel Aerosp Inst, Optoelect Ctr, Comp Vis Dept, Hanoi, Vietnam
关键词
Head Pose Estimation; Knowledge Distillation; Convolutional Neural Network; SUPPORT VECTOR MACHINES; FACE; CASCADE;
D O I
10.5220/0010870900003122
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human head pose estimation is an essential problem in facial analysis in recent years that has a lot of computer vision applications such as gaze estimation, virtual reality, driver assistance. Because of the importance of the head pose estimation problem, it is necessary to design a compact model to resolve this task in order to reduce the computational cost when deploying on facial analysis-based applications such as large camera surveillance systems, AI cameras while maintaining accuracy. In this work, we propose a lightweight model that effectively addresses the head pose estimation problem. Our approach has two main steps. 1) We first train many teacher models on the synthesis dataset - 300W-LPA to get the head pose pseudo labels. 2) We design an architecture with the ResNet18 backbone and train our proposed model with the ensemble of these pseudo labels via the knowledge distillation process. To evaluate the effectiveness of our model, we use AFLW-2000 and BIWI - two real-world head pose datasets. Experimental results show that our proposed model significantly improves the accuracy in comparison with the state-of-the-art head pose estimation methods. Furthermore, our model has the real-time speed of similar to 300 FPS when inferring on Tesla V100.
引用
收藏
页码:90 / 98
页数:9
相关论文
共 50 条
  • [41] Camera Pose Estimation using Human Head Pose Estimation
    Fischer, Robert
    Hoedlmoser, Michael
    Gelautz, Margrit
    PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4, 2022, : 877 - 886
  • [42] Deep Head Pose Estimation for Faces in the Wild and Its Transfer Learning
    Hanh Tran Thi Bao
    Kim, Yong-Guk
    2017 SEVENTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2017), 2017, : 187 - 193
  • [43] Deep Head Pose: Gaze-Direction Estimation in Multimodal Video
    Mukherjee, Sankha S.
    Robertson, Neil Martin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 2094 - 2107
  • [44] DEEP REGRESSION FOREST WITH SOFT-ATTENTION FOR HEAD POSE ESTIMATION
    Ma, Xiangtian
    Sang, Nan
    Wang, Xupeng
    Xiao, Shihua
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 2840 - 2844
  • [45] Detecting Arbitrary Intermediate Keypoints for Human Pose Estimation with Vision Transformers
    Ludwig, Katja
    Harzig, Philipp
    Lienhart, Rainer
    2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 663 - 671
  • [46] Deep learning and machine learning techniques for head pose estimation: a survey
    Algabri, Redhwan
    Abdu, Ahmed
    Lee, Sungon
    ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (10)
  • [47] SRPose: Two-View Relative Pose Estimation with Sparse Keypoints
    Yin, Rui
    Zhang, Yulun
    Pan, Zherong
    Zhu, Jianjun
    Wang, Cheng
    Jia, Biao
    COMPUTER VISION - ECCV 2024, PT LXXXIII, 2025, 15141 : 88 - 107
  • [48] A multilevel object pose estimation algorithm based on point cloud keypoints
    Haibo Yang
    Junying Jia
    Xin Lu
    Applied Intelligence, 2023, 53 : 18508 - 18516
  • [49] A multilevel object pose estimation algorithm based on point cloud keypoints
    Yang, Haibo
    Jia, Junying
    Lu, Xin
    APPLIED INTELLIGENCE, 2023, 53 (15) : 18508 - 18516
  • [50] Camera Pose Estimation Method Based on Deep Neural Network
    Tang Xia Qing
    Wu Fan
    Zong Yan Tao
    ICDLT 2019: 2019 3RD INTERNATIONAL CONFERENCE ON DEEP LEARNING TECHNOLOGIES, 2019, : 85 - 90