Joint Estimation of Human Pose and Conversational Groups from Social Scenes

被引:20
|
作者
Varadarajan, Jagannadan [1 ]
Subramanian, Ramanathan [2 ,3 ]
Bulo, Samuel Rota [4 ,5 ]
Ahuja, Narendra [1 ,6 ]
Lanz, Oswald [5 ]
Ricci, Elisa [5 ,7 ]
机构
[1] Adv Digital Sci Ctr, Singapore, Singapore
[2] Int Inst Informat Technol, Hyderabad, Andhra Prades, India
[3] Univ Glasgow, Glasgow, Lanark, Scotland
[4] Mapillary Res, Graz, Austria
[5] Fdn Bruno Kessler, Trento, Italy
[6] Univ Illinois, Champaign, IL USA
[7] Univ Perugia, Dept Engn, Perugia, Italy
关键词
Head and body pose estimation; F-formation estimation; Semi-supervised learning; Convex optimization; Conversational groups; Video surveillance; HEAD POSE; VISUAL FOCUS; ATTENTION; TRACKING;
D O I
10.1007/s11263-017-1026-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite many attempts in the last few years, automatic analysis of social scenes captured by wide-angle camera networks remains a very challenging task due to the low resolution of targets, background clutter and frequent and persistent occlusions. In this paper, we present a novel framework for jointly estimating (i) head, body orientations of targets and (ii) conversational groups called F-formations from social scenes. In contrast to prior works that have (a) exploited the limited range of head and body orientations to jointly learn both, or (b) employed the mutual head (but not body) pose of interactors for deducing F-formations, we propose a weakly-supervised learning algorithm for joint inference. Our algorithm employs body pose as the primary cue for F-formation estimation, and an alternating optimization strategy is proposed to iteratively refine F-formation and pose estimates. We demonstrate the increased efficacy of joint inference over the state-of-the-art via extensive experiments on three social datasets.
引用
收藏
页码:410 / 429
页数:20
相关论文
共 50 条
  • [1] Joint Estimation of Human Pose and Conversational Groups from Social Scenes
    Jagannadan Varadarajan
    Ramanathan Subramanian
    Samuel Rota Bulò
    Narendra Ahuja
    Oswald Lanz
    Elisa Ricci
    [J]. International Journal of Computer Vision, 2018, 126 : 410 - 429
  • [2] Human Pose Estimation in Real Traffic Scenes
    Kress, Viktor
    Jung, Janis
    Zernetsch, Stefan
    Doll, Konrad
    Sick, Bernhard
    [J]. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 518 - 523
  • [3] Exploiting Learnable Joint Groups for Hand Pose Estimation
    Li, Moran
    Gao, Yuan
    Sang, Nong
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 1921 - 1929
  • [4] Towards Accurate Human Pose Estimation in Videos of Crowded Scenes
    Chang, Shuning
    Yuan, Li
    Nie, Xuecheng
    Huang, Ziyuan
    Zhou, Yichen
    Chen, Yupeng
    Feng, Jiashi
    Yan, Shuicheng
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4630 - 4634
  • [5] Simple Pair Pose - Pairwise Human Pose Estimation in Dense Urban Traffic Scenes
    Braun, Markus
    Flohr, Fabian B.
    Krebs, Sebastian
    Kressel, Ulrich
    Gavrila, Dariu M.
    [J]. 2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 1545 - 1552
  • [6] Joint relation based human pose estimation
    Shuang Liang
    Gang Chu
    Chi Xie
    Jiewen Wang
    [J]. The Visual Computer, 2022, 38 : 1369 - 1381
  • [7] Learning Joint Structure for Human Pose Estimation
    Feng, Shenming
    Hu, Haifeng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2020, 16 (03)
  • [8] Joint relation based human pose estimation
    Liang, Shuang
    Chu, Gang
    Xie, Chi
    Wang, Jiewen
    [J]. VISUAL COMPUTER, 2022, 38 (04): : 1369 - 1381
  • [9] JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes
    Zhao, Haimei
    Zhang, Jing
    Zhang, Sen
    Tao, Dacheng
    [J]. COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 : 708 - 726
  • [10] LIGHTWEIGHT HUMAN POSE ESTIMATION UNDER RESOURCE-LIMITED SCENES
    Zhang, Zhe
    Tang, Jie
    Wu, Gangshan
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2170 - 2174