Joint Estimation of Human Pose and Conversational Groups from Social Scenes

被引:20
|
作者
Varadarajan, Jagannadan [1 ]
Subramanian, Ramanathan [2 ,3 ]
Bulo, Samuel Rota [4 ,5 ]
Ahuja, Narendra [1 ,6 ]
Lanz, Oswald [5 ]
Ricci, Elisa [5 ,7 ]
机构
[1] Adv Digital Sci Ctr, Singapore, Singapore
[2] Int Inst Informat Technol, Hyderabad, Andhra Prades, India
[3] Univ Glasgow, Glasgow, Lanark, Scotland
[4] Mapillary Res, Graz, Austria
[5] Fdn Bruno Kessler, Trento, Italy
[6] Univ Illinois, Champaign, IL USA
[7] Univ Perugia, Dept Engn, Perugia, Italy
关键词
Head and body pose estimation; F-formation estimation; Semi-supervised learning; Convex optimization; Conversational groups; Video surveillance; HEAD POSE; VISUAL FOCUS; ATTENTION; TRACKING;
D O I
10.1007/s11263-017-1026-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite many attempts in the last few years, automatic analysis of social scenes captured by wide-angle camera networks remains a very challenging task due to the low resolution of targets, background clutter and frequent and persistent occlusions. In this paper, we present a novel framework for jointly estimating (i) head, body orientations of targets and (ii) conversational groups called F-formations from social scenes. In contrast to prior works that have (a) exploited the limited range of head and body orientations to jointly learn both, or (b) employed the mutual head (but not body) pose of interactors for deducing F-formations, we propose a weakly-supervised learning algorithm for joint inference. Our algorithm employs body pose as the primary cue for F-formation estimation, and an alternating optimization strategy is proposed to iteratively refine F-formation and pose estimates. We demonstrate the increased efficacy of joint inference over the state-of-the-art via extensive experiments on three social datasets.
引用
收藏
页码:410 / 429
页数:20
相关论文
共 50 条
  • [21] Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference
    Wang, Dongkai
    Zhang, Shiliang
    Hua, Gang
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [22] Research on Multiplayer Pose Estimation in Complex Sports Scenes
    Liu, Changyuan
    Zang, Yancheng
    Lan, Chaofeng
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2024, 53 (06): : 930 - 939
  • [23] Robust pose estimation for arbitrary objects in complex scenes
    Dörfler, P
    Schnurr, C
    [J]. PATTERN RECOGNITION, 2004, 3175 : 455 - 462
  • [24] Dual Graph Networks for Pose Estimation in Crowded Scenes
    Jun Tu
    Gangshan Wu
    Limin Wang
    [J]. International Journal of Computer Vision, 2024, 132 (3) : 633 - 653
  • [25] Dual Graph Networks for Pose Estimation in Crowded Scenes
    Tu, Jun
    Wu, Gangshan
    Wang, Limin
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (03) : 633 - 653
  • [26] Camera Pose Estimation in Dynamic Scenes with Background Tracking
    [J]. 1600, International Frequency Sensor Association, 46 Thorny Vineway, Toronto, ON M2J 4J2, Canada (19):
  • [27] Joint Human Pose Estimation and Stereo 3D Localization
    Deng, Wenlong
    Bertoni, Lorenzo
    Kreiss, Sven
    Alahi, Alexandre
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2324 - 2330
  • [28] Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation
    Tompson, Jonathan
    Jain, Arjun
    LeCun, Yann
    Bregler, Christoph
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [29] Human Pose Estimation using Body Parts Dependent Joint Regressors
    Dantone, Matthias
    Gall, Juergen
    Leistner, Christian
    Van Gool, Luc
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 3041 - 3048
  • [30] Human Pose Estimation from Video and IMUs
    von Marcard, Timo
    Pons-Moll, Gerard
    Rosenhahn, Bodo
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (08) : 1533 - 1547