Anisotropic angle distribution learning for head pose estimation and attention understanding in human-computer interaction

被引:118
|
作者
Liu, Hai [1 ,2 ,3 ]
Nie, Hanwen [1 ]
Zhang, Zhaoli [1 ]
Li, You-Fu [3 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China
[2] UCL, UCL Interact Ctr, London, England
[3] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Head pose estimation; Anisotropic angle distribution; Convolutional neural network; Regularization; Learning behavior analysis; Human-computer interaction;
D O I
10.1016/j.neucom.2020.09.068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Head pose estimation is an important way to understand human attention in the human-computer interaction. In this paper, we propose a novel anisotropic angle distribution learning (AADL) network for head pose estimation task. Firstly, two key findings are revealed as following: 1) Head pose image variations are different at the yaw and pitch directions with the same pose angle increasing on a fixed central pose; 2) With the fixed angle interval increasing, the image variations increase firstly and then decrease in yaw angle direction. Then, the maximum a posterior technology is employed to construct the head pose estimation network, which includes three parts, such as convolutional layer, covariance pooling layer and output layer. In the output layer, the labels are constructed as the anisotropic angle distributions on the basis of two key findings. And the anisotropic angle distributions are fitted by the 2D Gaussian like distributions (groundtruth labels). Furthermore, the Kullback-Leibler divergence is selected to measure the predication label and the groundtruth one. The features of head pose images are perceived at the AADL-based convolutional neural network in an end-to-end manner. Experimental results demonstrate that the developed AADL-based labels have several advantages, such as robustness for head pose image missing, insensitivity for the motion blur. Moreover, the proposed method has achieved good performance compared to several state-of-the-art methods on the Pointing'04 and CAS_PEAL_R1 databases. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:310 / 322
页数:13
相关论文
共 50 条
  • [1] Fast Head Pose Estimation for Human-Computer Interaction
    Garcia-Montero, Mario
    Redondo-Cabrera, Carolina
    Lopez-Sastre, Roberto
    Tuytelaars, Tinne
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 101 - 110
  • [2] Head pose estimation in solving human-computer interaction problems
    Anishchenko S.I.
    Osinov B.A.
    Shaposhnikov D.G.
    [J]. Pattern Recognition and Image Analysis, 2011, 21 (3) : 446 - 449
  • [3] Assessment of human head pose in human-computer interaction
    Anishchenko S.I.
    Osinov V.A.
    Shaposhnikov D.G.
    [J]. Anishchenko, S. I. (sergey.anishenko@gmail.com), 1600, Izdatel'stvo Nauka (22): : 541 - 545
  • [4] . Robust Stereoscopic Head Pose Estimation in Human-Computer Interaction and a Unified Evaluation Framework
    Layher, Georg
    Liebau, Hendrik
    Niese, Robert
    Al-Hamadi, Ayoub
    Michaelis, Bernd
    Neumann, Heiko
    [J]. IMAGE ANALYSIS AND PROCESSING - ICIAP 2011, PT I, 2011, 6978 : 227 - 236
  • [5] ARHPE: Asymmetric Relation-Aware Representation Learning for Head Pose Estimation in Industrial Human-Computer Interaction
    Liu, Hai
    Liu, Tingting
    Zhang, Zhaoli
    Sangaiah, Arun Kumar
    Yang, Bing
    Li, Youfu
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) : 7107 - 7117
  • [6] Detection of head pose and gaze direction for human-computer interaction
    Weidenbacher, Ulrich
    Layher, Georg
    Bayerl, Pierre
    Neumann, Heiko
    [J]. PERCEPTION AND INTERACTIVE TECHNOLOGIES, PROCEEDINGS, 2006, 4021 : 9 - 19
  • [7] Precise head pose estimation on HPD5A database for attention recognition based on convolutional neural network in human-computer interaction
    Liu, Hai
    Li, Duantengchuan
    Wang, Xiang
    Liu, Leyuan
    Zhang, Zhaoli
    Subramanian, Sriram
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2021, 116 (116)
  • [8] GMDL: Toward precise head pose estimation via Gaussian mixed distribution learning for students' attention understanding
    Liu, Tingting
    Yang, Bing
    Liu, Hai
    Ju, Jianping
    Tang, Jianyin
    Subramanian, Sriram
    Zhang, Zhaoli
    [J]. INFRARED PHYSICS & TECHNOLOGY, 2022, 122
  • [9] Auditory Attention Control for Human-Computer Interaction
    Poguntke, Mark
    Ellis, Kirsten
    [J]. 2008 CONFERENCE ON HUMAN SYSTEM INTERACTIONS, VOLS 1 AND 2, 2008, : 225 - 230
  • [10] Human-computer interaction in distance learning
    Maurino, PS
    [J]. CANADIAN JOURNAL OF INFORMATION AND LIBRARY SCIENCE-REVUE CANADIENNE DES SCIENCES DE L INFORMATION ET DE BIBLIOTHECONOMIE, 2003, 27 (04): : 85 - 86