Head pose estimation with uncertainty and an application to dyadic interaction detection

被引:2
|
作者
Tomenotti, Federico Figari [1 ]
Noceti, Nicoletta [1 ]
Odone, Francesca [1 ]
机构
[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy
关键词
Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;
D O I
10.1016/j.cviu.2024.103999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] TRFH: towards real-time face detection and head pose estimation
    Chen, Shicun
    Zhang, Yong
    Yin, Baocai
    Wang, Boyue
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (04) : 1745 - 1755
  • [42] TRFH: towards real-time face detection and head pose estimation
    Shicun Chen
    Yong Zhang
    Baocai Yin
    Boyue Wang
    Pattern Analysis and Applications, 2021, 24 : 1745 - 1755
  • [43] Topographic feature mapping for head pose estimation with application to facial gesture interfaces
    Raytchev, B
    Yoda, I
    Sakaue, K
    COMPUTER VISION IN HUMAN-COMPUTER INTERACTION, PROCEEDINGS, 2005, 3766 : 180 - 188
  • [44] Head pose estimation in face recognition across pose scenarios
    Sarfraz, A. Saquib
    Hellwich, Olaf
    VISAPP 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 1, 2008, : 235 - 242
  • [45] COUPLED CASCADE REGRESSION FOR SIMULTANEOUS FACIAL LANDMARK DETECTION AND HEAD POSE ESTIMATION
    Gou, Chao
    Wu, Yue
    Wang, Fei-Yue
    Ji, Qiang
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 2906 - 2910
  • [46] Towards unsupervised learning of joint facial landmark detection and head pose estimation
    Zou, Zhiming
    Jia, Dian
    Tang, Wei
    PATTERN RECOGNITION, 2025, 162
  • [47] Robust head pose estimation based on key frames for human-machine interaction
    Madrigal, Francisco
    Lerasle, Frederic
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2020, 2020 (01)
  • [48] Robust head pose estimation based on key frames for human-machine interaction
    Francisco Madrigal
    Frederic Lerasle
    EURASIP Journal on Image and Video Processing, 2020
  • [49] Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation
    Yang, Heng
    Pavone, Marco
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8947 - 8958
  • [50] Ego-Body Pose Estimation via Ego-Head Pose Estimation
    Li, Jiaman
    Liu, C. Karen
    Wu, Jiajun
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17142 - 17151