Anisotropic angle distribution learning for head pose estimation and attention understanding in human-computer interaction

被引:118
|
作者
Liu, Hai [1 ,2 ,3 ]
Nie, Hanwen [1 ]
Zhang, Zhaoli [1 ]
Li, You-Fu [3 ]
机构
[1] Cent China Normal Univ, Natl Engn Res Ctr E Learning, Wuhan 430079, Peoples R China
[2] UCL, UCL Interact Ctr, London, England
[3] City Univ Hong Kong, Dept Mech Engn, Kowloon, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Head pose estimation; Anisotropic angle distribution; Convolutional neural network; Regularization; Learning behavior analysis; Human-computer interaction;
D O I
10.1016/j.neucom.2020.09.068
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Head pose estimation is an important way to understand human attention in the human-computer interaction. In this paper, we propose a novel anisotropic angle distribution learning (AADL) network for head pose estimation task. Firstly, two key findings are revealed as following: 1) Head pose image variations are different at the yaw and pitch directions with the same pose angle increasing on a fixed central pose; 2) With the fixed angle interval increasing, the image variations increase firstly and then decrease in yaw angle direction. Then, the maximum a posterior technology is employed to construct the head pose estimation network, which includes three parts, such as convolutional layer, covariance pooling layer and output layer. In the output layer, the labels are constructed as the anisotropic angle distributions on the basis of two key findings. And the anisotropic angle distributions are fitted by the 2D Gaussian like distributions (groundtruth labels). Furthermore, the Kullback-Leibler divergence is selected to measure the predication label and the groundtruth one. The features of head pose images are perceived at the AADL-based convolutional neural network in an end-to-end manner. Experimental results demonstrate that the developed AADL-based labels have several advantages, such as robustness for head pose image missing, insensitivity for the motion blur. Moreover, the proposed method has achieved good performance compared to several state-of-the-art methods on the Pointing'04 and CAS_PEAL_R1 databases. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:310 / 322
页数:13
相关论文
共 50 条
  • [41] The Importance of Human-Computer Interaction in Radiology E-learning
    den Harder, Annemarie M.
    Frijlingh, Marissa
    Ravesloot, Cecile J.
    Oosterbaan, Anne E.
    van der Gijp, Anouk
    [J]. JOURNAL OF DIGITAL IMAGING, 2016, 29 (02) : 195 - 205
  • [42] A Framework for Collaborative Human-Computer Interaction E-learning
    Elkaffas, Saleh Mesbah
    Seddek, Sherif Mohamed
    [J]. PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 1, 2018, 15 : 138 - 155
  • [43] Learning Gestures for Customizable Human-Computer Interaction in the Operating Room
    Schwarz, Loren Arthur
    Bigdelou, Ali
    Navab, Nassir
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, MICCAI 2011, PT I, 2011, 6891 : 129 - 136
  • [44] Understanding User Acceptance of Electrical Muscle Stimulation in Human-Computer Interaction
    Faltaous, Sarah
    Williamson, Julie R.
    Koelle, Marion
    Pfeifer, Max
    Keppel, Jonas
    Schneegass, Stefan
    [J]. PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [45] Hand Gesture Control for Human-Computer Interaction with Deep Learning
    Chua, S. N. David
    Chin, K. Y. Richard
    Lim, S. F.
    Jain, Pushpdant
    [J]. JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2022, 17 (03) : 1961 - 1970
  • [46] Understanding the customer experience in human-computer interaction: a systematic literature review
    Quinones, Daniela
    Rojas, Luis
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [47] Teaching and learning human-computer interaction: Past, present, and future
    Churchill, Elizabeth F.
    Bowser, Anne
    Preece, Jennifer
    [J]. Interactions, 2013, 20 (02) : 44 - 53
  • [48] Topology Analysis of Learning Cognitive Flow for Human-Computer Interaction
    Shi, Guimin
    Yang, Sheng
    Liu, Changyong
    Luo, Zhiming
    Meng, Shimin
    Li, Shaozi
    [J]. 2018 NINTH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME 2018), 2018, : 677 - 682
  • [49] Where do machine learning and human-computer interaction meet?
    Moustakis, VS
    Herrmann, J
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 1997, 11 (7-8) : 595 - 609
  • [50] Human Computer Interaction with Head Pose, Eye Gaze and Body Gestures
    Wang, Kang
    Zhao, Rui
    Ji, Qiang
    [J]. PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 789 - 789