RGBD-based Hardware Friendly Head Pose Estimation System via Convolutional attention module

被引:0
|
作者
Cheng, Yen-Yu [1 ]
Chiu, Ching-Te [1 ]
Chen, Yi-Fan [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
Head pose estimation; RGB-D; Attention module; multi-loss strategy; feature decoupler;
D O I
10.1109/ISCAS48785.2022.9937330
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Head pose estimation from RGB images without depth information is a challenging task, owing to the loss of spatial information and large head pose variations in the wild. However, most studies adopt deeper convolutional neural network (CNN) models, such as ResNet50, which are limited by the enormous number of parameters to be implemented on edge devices. Owing to novel technological advancements, several edge devices have also included depth cameras and obtain highquality images. In this study, we propose a lightweight CNN for head pose estimation. By adopting attention module and feature decoupler, we resume the performance decreasing by lower parameters. Moreover, we classify the ground-truth head pose angles of the model intermittently, and adopt the multiloss strategy to train our model. We evaluate the proposed method on three challenging benchmark datasets, and achieved optimal results for Yaw pose and average. The obtained results indicate that although the proposed model has less parameters, it still maintains a remarkable performance. The total number of parameters is 0.19 M, including RGB and depth path, which is 50% lower than FSA-Net. Consequently, the inference speed is 0.92 ms per pair RGB-D images, which is 8% faster than FSA-Net. With fewer parameters, we achieved 3.1 MAE on yaw angle, which is 22.69% lower than that of Quatnet, including 3.5 MAE on average, which is 7.40% lower than those of other advanced methods.
引用
收藏
页码:2715 / 2719
页数:5
相关论文
共 50 条
  • [1] Real-time RGBD-based Extended Body Pose Estimation
    Bashirov, Renat
    Ianina, Anastasia
    Iskakov, Karim
    Kononenko, Yevgeniy
    Strizhkova, Valeriya
    Lempitsky, Victor
    Vakhitov, Alexander
    [J]. 2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2806 - 2815
  • [2] Pose Aware RGBD-based Face Recognition System With Hierarchical Bilinear Pooling
    Wu, Shu-Yun
    Chiu, Ching-Te
    Hsu, Yung-Ching
    [J]. 2023 21ST IEEE INTERREGIONAL NEWCAS CONFERENCE, NEWCAS, 2023,
  • [3] A Dataset for Improved RGBD-Based Object Detection and Pose Estimation for Warehouse Pick-and-Place
    Rennie, Colin
    Shome, Rahul
    Bekris, Kostas E.
    De Souza, Alberto F.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2016, 1 (02) : 1179 - 1185
  • [4] RGBD-Based Real-Time 3D Human Pose Estimation for Fitness Assessment
    Jiang, Yujie
    Cao, Chuang
    Zhu, Xiaoxiao
    Ma, Yanhong
    Cao, Qixin
    [J]. 2020 3RD WORLD CONFERENCE ON MECHANICAL ENGINEERING AND INTELLIGENT MANUFACTURING (WCMEIM 2020), 2020, : 103 - 108
  • [5] Human pose estimation based on frequency domain and attention module
    Zhou, Shuren
    Duan, Xinlan
    Zhou, Jiarui
    [J]. NEUROCOMPUTING, 2024, 604
  • [6] Head Pose Estimation Based on Robust Convolutional Neural Network
    Bao, Jiao
    Ye, Mao
    [J]. CYBERNETICS AND INFORMATION TECHNOLOGIES, 2016, 16 (06) : 133 - 145
  • [7] Monocular depth estimation via convolutional neural network with attention module
    Lan, Lingling
    Zhang, Yaping
    Yang, Yuwei
    [J]. Journal of Physics: Conference Series, 2021, 2025 (01):
  • [8] AGCNNs: Attention-guided convolutional neural networks for infrared head pose estimation in assisted driving system
    Ju, Jianping
    Zheng, Hong
    Li, Congcong
    Li, Xi
    Liu, Hai
    Liu, Tingting
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2022, 123
  • [9] Multiclass classification based on a deep convolutional network for head pose estimation
    Ying Cai
    Meng-long Yang
    Jun Li
    [J]. Frontiers of Information Technology & Electronic Engineering, 2015, 16 : 930 - 939
  • [10] Multiclass classification based on a deep convolutional network for head pose estimation
    Cai, Ying
    Yang, Meng-long
    Li, Jun
    [J]. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2015, 16 (11) : 930 - 939