Head Pose Estimation Based on Multi-Level Feature Fusion

被引:0
|
作者
Yan, Chunman [1 ,2 ]
Zhang, Xiao [1 ]
机构
[1] Northwest Normal Univ, Sch Phys & Elect, Lanzhou 730070, Peoples R China
[2] Engn Res Ctr Gansu Prov Intelligent Informat Techn, Lanzhou 730070, Peoples R China
基金
中国国家自然科学基金;
关键词
Head pose estimation; RepVGG-A2; multi-level feature fusion; attention mechanism; loss function;
D O I
10.1142/S0218001424560020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Head Pose Estimation (HPE) has a wide range of applications in computer vision, but still faces challenges: (1) Existing studies commonly use Euler angles or quaternions as pose labels, which may lead to discontinuity problems. (2) HPE does not effectively address regression via rotated matrices. (3) There is a low recognition rate in complex scenes, high computational requirements, etc. This paper presents an improved unconstrained HPE model to address these challenges. First, a rotation matrix form is introduced to solve the problem of unclear rotation labels. Second, a continuous 6D rotation matrix representation is used for efficient and robust direct regression. The RepVGG-A2 lightweight framework is used for feature extraction, and by adding a multi-level feature fusion module and a coordinate attention mechanism with residual connection, to improve the network's ability to perceive contextual information and pay attention to features. The model's accuracy was further improved by replacing the network activation function and improving the loss function. Experiments on the BIWI dataset 7:3 dividing the training and test sets show that the average absolute error of HPE for the proposed network model is 2.41. Trained on the dataset 300W_LP and tested on the AFLW2000 and BIWI datasets, the average absolute errors of HPE of the proposed network model are 4.34 and 3.93. The experimental results demonstrate that the improved network has better HPE performance.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Multi-Level Drowsiness Detection Based on Deep Feature Fusion of Eye and Head Pose
    Ye, Fang
    Li, Shunxin
    Yuan, Xin
    Li, Longfei
    [J]. PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 107 - 111
  • [2] A Head Pose Estimation Method Based on Multi-feature Fusion
    Zhao, Zhiqiang
    Zheng, Qiaoli
    Zhang, Yan
    Shi, Xin
    [J]. PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (ICBCB 2019), 2019, : 150 - 155
  • [3] Multi-level feature fusion and joint refinement for simultaneous object pose estimation and camera localization
    Wang, Junyi
    Qi, Yue
    [J]. NEURAL NETWORKS, 2024, 174
  • [4] Three-Dimensional Human Hand Pose Estimation Based on Finger-Point Reinforcement and Multi-Level Feature Fusion
    Zhang Kaiyi
    Hong Ru
    Gai Shaoyan
    Da Feipeng
    [J]. ACTA OPTICA SINICA, 2022, 42 (19)
  • [5] Face recognition based on pose-variant image synthesis and multi-level multi-feature fusion
    Li, Congcong
    Su, Guangda
    Shang, Yan
    Li, Yingchun
    Xiang, Yan
    [J]. ANALYSIS AND MODELING OF FACES AND GESTURES, PROCEEDINGS, 2007, 4778 : 261 - 275
  • [6] Appearance-based gaze estimation with feature fusion of multi-level information elements
    Ren, Zhonghe
    Fang, Fengzhou
    Hou, Gaofeng
    Li, Zihao
    Niu, Rui
    [J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2023, 10 (03) : 1080 - 1109
  • [7] Multi-level structured hybrid forest for joint head detection and pose estimation
    Liu, Yuanyuan
    Xie, Zhong
    Yuan, Xiaohui
    Chen, Jingying
    Song, Wu
    [J]. NEUROCOMPUTING, 2017, 266 : 206 - 215
  • [8] Traffic density estimation via a multi-level feature fusion network
    Ying-Xiang Hu
    Rui-Sheng Jia
    Yong-Chao Li
    Qi Zhang
    Hong-Mei Sun
    [J]. Applied Intelligence, 2022, 52 : 10417 - 10429
  • [9] Traffic density estimation via a multi-level feature fusion network
    Hu, Ying-Xiang
    Jia, Rui-Sheng
    Li, Yong-Chao
    Zhang, Qi
    Sun, Hong-Mei
    [J]. APPLIED INTELLIGENCE, 2022, 52 (09) : 10417 - 10429
  • [10] A Multi-Level Network for Human Pose Estimation
    Shao, Zhanpeng
    Liu, Peng
    Li, Youfu
    Yang, Jianyu
    Zhou, Xiaolong
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 13085 - 13091