Head pose estimation with uncertainty and an application to dyadic interaction detection

被引:2
|
作者
Tomenotti, Federico Figari [1 ]
Noceti, Nicoletta [1 ]
Odone, Francesca [1 ]
机构
[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy
关键词
Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;
D O I
10.1016/j.cviu.2024.103999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Head pose estimation by regression algorithm
    Abate, Andrea F.
    Barra, Paola
    Pero, Chiara
    Tucci, Maurizio
    PATTERN RECOGNITION LETTERS, 2020, 140 : 179 - 185
  • [32] SINGLE VIEW HEAD POSE ESTIMATION
    Martins, Pedro
    Batista, Jorge
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 1652 - 1655
  • [33] Head Pose Estimation: Classification or Regression?
    Guo, Guodong
    Fu, Yun
    Dyer, Charles R.
    Huang, Thomas S.
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 567 - +
  • [34] Head Nod Detection in Dyadic Conversations
    Numanoglu, Tugce
    Erzin, Engin
    Yemez, Yucel
    Sezgint, M. Tevfik
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [35] Head Pose Estimation by Instance Parameterization
    Peng, Xi
    Huang, Junzhou
    Hu, Qiong
    Zhang, Shaoting
    Metaxas, Dimitris
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1800 - 1805
  • [36] On Head Pose Estimation in Face Recognition
    Sarfraz, M. Saquib
    Hellwich, Olaf
    COMPUTER VISION AND COMPUTER GRAPHICS: THEORY AND APPLICATIONS, 2009, 24 : 162 - 175
  • [37] Head Pose Estimation For a Domestic Robot
    van der Pol, David
    Cuijpers, Raymond H.
    Juola, James F.
    PROCEEDINGS OF THE 6TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTIONS (HRI 2011), 2011, : 277 - 278
  • [38] Head pose estimation for driver monitoring
    Zhu, YD
    Fujimura, K
    2004 IEEE INTELLIGENT VEHICLES SYMPOSIUM, 2004, : 501 - 506
  • [39] Drowsiness Detection and Head Pose Estimation in Online Learning Platforms with Image Processing
    Unsal, Gurcan
    Tekerek, Adem
    4TH INTERDISCIPLINARY CONFERENCE ON ELECTRICS AND COMPUTER, INTCEC 2024, 2024,
  • [40] A fast algorithm face detection and head pose estimation for driver assistant system
    Guo, Zhibo
    Liu, Huajun
    Wang, Qiong
    Yang, Jingyu
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1733 - +