Head pose estimation with uncertainty and an application to dyadic interaction detection

被引:2
|
作者
Tomenotti, Federico Figari [1 ]
Noceti, Nicoletta [1 ]
Odone, Francesca [1 ]
机构
[1] Univ Genoa, MaLGa DIBRIS, Via Dodecaneso 35, I-16146 Genoa, Italy
关键词
Head pose estimation; Multi-task regression; Neural networks; Heteroscedastic uncertainty; Dyadic interaction detection; PEOPLE LOOKING; GAZE; COMMUNICATION; MODEL;
D O I
10.1016/j.cviu.2024.103999
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the visual focus of attention of people in a scene is a fundamental cue to understand social interactions from videos. Gaze direction is ideal for determining eye contact, a basic cue of non-verbal communication, but it is not always easy to recognize. Head direction is a well-known proxy of gaze direction, more robust to the variability of the scene, thus offering a valuable alternative. In this work, we consider HHP-net, a method for estimating the head direction from single frames based on a heteroscedastic neural network to estimate people's head pose from a minimal set of head key points. We formulate the problem as a multi -task regression, to predict the pose as a triplet of Euler angles from the output of a 2D pose estimator. HHP-net also provides a measure of the aleatoric heteroscedastic uncertainties associated with the angles, through an ad -hoc loss function we introduce. In a thorough experimental analysis, we show that our model is efficient and effective compared with the state of the art, with only similar to 2 degrees of degradation in the worst case counterbalanced by a space occupation similar to 12 times smaller. We also show the beneficial effects of uncertainty on interpretability. Finally, we discuss the robustness of our method to input variability, showing that it can be seen as a plug-in to different pose estimators. As a proof -of -concept, we address social interaction analysis, with an algorithm to detect dyadic interactions in images.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Dyadic Interaction Detection from Pose and Flow
    van Gemeren, Coert
    Tan, Robby T.
    Poppe, Ronald
    Veltkamp, Remco C.
    HUMAN BEHAVIOR UNDERSTANDING (HBU 2014), 2014, 8749 : 101 - 115
  • [2] Dyadic interaction detection from pose and flow
    van Gemeren, Coert (C.J.VanGemeren@uu.nl), 1600, Springer Verlag (8749):
  • [3] Predicting Head Pose in Dyadic Conversation
    Greenwood, David
    Laycock, Stephen
    Matthews, Iain
    INTELLIGENT VIRTUAL AGENTS, IVA 2017, 2017, 10498 : 160 - 169
  • [4] PHOW Based Feature Detection For Head Pose Estimation
    Jian, Wang
    Hua, Van
    Jing, Li
    Ping, Xia
    2015 IEEE 16TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2015, : 437 - 440
  • [5] HeadDiff: Exploring Rotation Uncertainty With Diffusion Models for Head Pose Estimation
    Wang, Yaoxing
    Liu, Hao
    Feng, Yaowei
    Li, Zhendong
    Wu, Xiangjuan
    Zhu, Congcong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1868 - 1882
  • [6] Fast Head Pose Estimation for Human-Computer Interaction
    Garcia-Montero, Mario
    Redondo-Cabrera, Carolina
    Lopez-Sastre, Roberto
    Tuytelaars, Tinne
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 101 - 110
  • [7] Detection of Dangerous Behavior by Estimation of Head Pose and Moving Direction
    Miyoshi, Kenji
    Nomiya, Hiroki
    Hochin, Teruhisa
    2018 5TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE/ INTELLIGENCE AND APPLIED INFORMATICS (CSII 2018), 2018, : 121 - 126
  • [8] DETECTION AND ANALYSIS OF SYMMETRICAL PARTS ON FACE FOR HEAD POSE ESTIMATION
    Dahmane, Afifa
    Larabi, Slimane
    Djeraba, Chabane
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 3249 - 3252
  • [9] Gaze Detection Based on Head Pose Estimation in Smart TV
    Dat Tien Nguyen
    Shin, Kwang Yong
    Lee, Won Oh
    Kim, Yeong Gon
    Kim, Ki Wan
    Hong, Hyung Gil
    Park, Kang Ryoung
    Oh, CheonIn
    Lee, HanKyu
    Jeong, Youngho
    2013 INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2013): FUTURE CREATIVE CONVERGENCE TECHNOLOGIES FOR NEW ICT ECOSYSTEMS, 2013, : 283 - 288
  • [10] Efficient and Robust Integration of Face Detection and Head Pose Estimation
    Jiang, Feijun
    Ekenel, Hazim Kemal
    Shi, Bertram E.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1578 - 1581