A Multi-Task Learning Framework for Head Pose Estimation under Target Motion

被引:90
|
作者
Yan, Yan [1 ]
Ricci, Elisa [2 ,3 ]
Subramanian, Ramanathan [4 ]
Liu, Gaowen [1 ]
Lanz, Oswald [2 ]
Sebe, Nicu [1 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[2] Fdn Bruno Kessler, Technol Vis, I-06100 Trento, Italy
[3] Univ Perugia, Dept Engn, I-06100 Perugia, Italy
[4] ADSC, Singapore, Singapore
关键词
Multi-task learning; graph guided; head pose classification; video surveillance; multi-camera systems;
D O I
10.1109/TPAMI.2015.2477843
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, head pose estimation (HPE) from low-resolution surveillance data has gained in importance. However, monocular and multi-view HPE approaches still work poorly under target motion, as facial appearance distorts owing to camera perspective and scale changes when a person moves around. To this end, we propose FEGA-MTL, a novel framework based on Multi-Task Learning (MTL) for classifying the head pose of a person who moves freely in an environment monitored by multiple, large field-of-view surveillance cameras. Upon partitioning the monitored scene into a dense uniform spatial grid, FEGA-MTL simultaneously clusters grid partitions into regions with similar facial appearance, while learning region-specific head pose classifiers. In the learning phase, guided by two graphs which a-priori model the similarity among (1) grid partitions based on camera geometry and (2) head pose classes, FEGA-MTL derives the optimal scene partitioning and associated pose classifiers. Upon determining the target's position using a person tracker at test time, the corresponding region-specific classifier is invoked for HPE. The FEGA-MTL framework naturally extends to a weakly supervised setting where the target's walking direction is employed as a proxy in lieu of head orientation. Experiments confirm that FEGA-MTL significantly outperforms competing single-task and multi-task learning methods in multi-view settings.
引用
收藏
页码:1070 / 1083
页数:14
相关论文
共 50 条
  • [21] Novel Multi-Task Learning for Motion Magnification
    Chen, Li
    Peng, Cong
    Zhao, Bingchao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5973 - 5985
  • [22] Multi-task learning framework for echocardiography segmentation
    Monkam, Patrice
    Jin, Songbai
    Lu, Wenkai
    2022 IEEE INTERNATIONAL ULTRASONICS SYMPOSIUM (IEEE IUS), 2022,
  • [23] Robust multi-task learning and online refinement for spacecraft pose estimation across domain gap
    Park, Tae Ha
    D'Amico, Simone
    ADVANCES IN SPACE RESEARCH, 2024, 73 (11) : 5726 - 5740
  • [24] Multi-Task Learning for Influence Estimation and Maximization
    Panagopoulos, George
    Malliaros, Fragkiskos D.
    Vazirgiannis, Michalis
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (09) : 4398 - 4409
  • [25] Orientation and Occlusion Aware Multi-Person Pose Estimation using Multi-Task Deep Learning Network
    Zhang, Huiyang
    Gu, Yanlei
    Kamijo, Shunsuke
    2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2019,
  • [26] CrosslnfoNet: Multi-Task Information Sharing Based Hand Pose Estimation
    Du, Kuo
    Lin, Xiangbo
    Sun, Yi
    Ma, Xiaohong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9888 - 9897
  • [27] Interpreting and Exploiting Functional Specialization in Multi-Head Attention under Multi-task Learning
    Li, Chong
    Wang, Shaonan
    Zhang, Yunhao
    Zhang, Jiajun
    Zong, Chengqing
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16460 - 16476
  • [28] IMPROVING SAR TARGET RECOGNITION WITH MULTI-TASK LEARNING
    Du, Wenrui
    Zhang, Fan
    Ma, Fei
    Yin, Qiang
    Zhou, Yongsheng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 284 - 287
  • [29] A multi-task framework for metric learning with common subspace
    Yang, Peipei
    Huang, Kaizhu
    Liu, Cheng-Lin
    NEURAL COMPUTING & APPLICATIONS, 2013, 22 (7-8): : 1337 - 1347
  • [30] Focused multi-task learning in a Gaussian process framework
    Gayle Leen
    Jaakko Peltonen
    Samuel Kaski
    Machine Learning, 2012, 89 : 157 - 182