A Multi-Task Learning Framework for Head Pose Estimation under Target Motion

被引:90
|
作者
Yan, Yan [1 ]
Ricci, Elisa [2 ,3 ]
Subramanian, Ramanathan [4 ]
Liu, Gaowen [1 ]
Lanz, Oswald [2 ]
Sebe, Nicu [1 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[2] Fdn Bruno Kessler, Technol Vis, I-06100 Trento, Italy
[3] Univ Perugia, Dept Engn, I-06100 Perugia, Italy
[4] ADSC, Singapore, Singapore
关键词
Multi-task learning; graph guided; head pose classification; video surveillance; multi-camera systems;
D O I
10.1109/TPAMI.2015.2477843
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, head pose estimation (HPE) from low-resolution surveillance data has gained in importance. However, monocular and multi-view HPE approaches still work poorly under target motion, as facial appearance distorts owing to camera perspective and scale changes when a person moves around. To this end, we propose FEGA-MTL, a novel framework based on Multi-Task Learning (MTL) for classifying the head pose of a person who moves freely in an environment monitored by multiple, large field-of-view surveillance cameras. Upon partitioning the monitored scene into a dense uniform spatial grid, FEGA-MTL simultaneously clusters grid partitions into regions with similar facial appearance, while learning region-specific head pose classifiers. In the learning phase, guided by two graphs which a-priori model the similarity among (1) grid partitions based on camera geometry and (2) head pose classes, FEGA-MTL derives the optimal scene partitioning and associated pose classifiers. Upon determining the target's position using a person tracker at test time, the corresponding region-specific classifier is invoked for HPE. The FEGA-MTL framework naturally extends to a weakly supervised setting where the target's walking direction is employed as a proxy in lieu of head orientation. Experiments confirm that FEGA-MTL significantly outperforms competing single-task and multi-task learning methods in multi-view settings.
引用
收藏
页码:1070 / 1083
页数:14
相关论文
共 50 条
  • [1] Multi-Task Head Pose Estimation in-the-Wild
    Valle, Roberto
    Buenaposada, Jose M.
    Baumela, Luis
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (08) : 2874 - 2881
  • [2] No Matter Where You Are: Flexible Graph-guided Multi-task Learning for Multi-view Head Pose Classification under Target Motion
    Yan, Yan
    Ricci, Elisa
    Subramanian, Ramanathan
    Lanz, Oswald
    Sebe, Nicu
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1177 - 1184
  • [3] Multi-Task Learning Framework for Motion Estimation and Dynamic Scene Deblurring
    Jung, Hyungjoo
    Kim, Youngjung
    Jang, Hyunsung
    Ha, Namkoo
    Sohn, Kwanghoon
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 (30) : 8170 - 8183
  • [4] MULTI-TASK DEEP LEARNING AND UNCERTAINTY ESTIMATION FOR PET HEAD MOTION CORRECTION
    Lieffrig, Eleonore V.
    Zeng, Tianyi
    Zhang, Jiazhen
    Fontaine, Kathryn
    Fang, Xi
    Revilla, Enette
    Lu, Yihuan
    Onofrey, John A.
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,
  • [5] A Multi-task Learning Framework for Quality Estimation
    Deoghare, Sourabh
    Choudhary, Paramveer
    Kanojia, Diptesh
    Ranasinghe, Tharindu
    Bhattacharyya, Pushpak
    Orasan, Constantin
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9191 - 9205
  • [6] A Multi-Task Learning Framework for Multi-Target Stance Detection
    Li, Yingjie
    Caragea, Cornelia
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2320 - 2326
  • [7] A multi-task learning framework for gas detection and concentration estimation
    Liu, Huixiang
    Li, Qing
    Gu, Yu
    NEUROCOMPUTING, 2020, 416 : 28 - 37
  • [8] Joint Head Pose Estimation with Multi-task Cascaded Convolutional Networks for Face Alignment
    Cai, Zhenni
    Liu, Qingshan
    Wang, Shanmin
    Yang, Bruce
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 495 - 500
  • [9] Classification-based Multi-task Learning for Efficient Pose Estimation Network
    Kang, Dongoh
    Roh, Myung-Cheol
    Kim, Hansaem
    Kim, Yonghyun
    Lee, Seong-Whan
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3295 - 3302
  • [10] HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition
    Ranjan, Rajeev
    Patel, Vishal M.
    Chellappa, Rama
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (01) : 121 - 135