A Multi-Task Learning Framework for Head Pose Estimation under Target Motion

被引:90
|
作者
Yan, Yan [1 ]
Ricci, Elisa [2 ,3 ]
Subramanian, Ramanathan [4 ]
Liu, Gaowen [1 ]
Lanz, Oswald [2 ]
Sebe, Nicu [1 ]
机构
[1] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[2] Fdn Bruno Kessler, Technol Vis, I-06100 Trento, Italy
[3] Univ Perugia, Dept Engn, I-06100 Perugia, Italy
[4] ADSC, Singapore, Singapore
关键词
Multi-task learning; graph guided; head pose classification; video surveillance; multi-camera systems;
D O I
10.1109/TPAMI.2015.2477843
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, head pose estimation (HPE) from low-resolution surveillance data has gained in importance. However, monocular and multi-view HPE approaches still work poorly under target motion, as facial appearance distorts owing to camera perspective and scale changes when a person moves around. To this end, we propose FEGA-MTL, a novel framework based on Multi-Task Learning (MTL) for classifying the head pose of a person who moves freely in an environment monitored by multiple, large field-of-view surveillance cameras. Upon partitioning the monitored scene into a dense uniform spatial grid, FEGA-MTL simultaneously clusters grid partitions into regions with similar facial appearance, while learning region-specific head pose classifiers. In the learning phase, guided by two graphs which a-priori model the similarity among (1) grid partitions based on camera geometry and (2) head pose classes, FEGA-MTL derives the optimal scene partitioning and associated pose classifiers. Upon determining the target's position using a person tracker at test time, the corresponding region-specific classifier is invoked for HPE. The FEGA-MTL framework naturally extends to a weakly supervised setting where the target's walking direction is employed as a proxy in lieu of head orientation. Experiments confirm that FEGA-MTL significantly outperforms competing single-task and multi-task learning methods in multi-view settings.
引用
收藏
页码:1070 / 1083
页数:14
相关论文
共 50 条
  • [41] Multi-task Representation Learning for Travel Time Estimation
    Li, Yaguang
    Fu, Kun
    Wang, Zheng
    Shahabi, Cyrus
    Ye, Jieping
    Liu, Yan
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1695 - 1704
  • [42] Multi-Task Rank Learning for Visual Saliency Estimation
    Li, Jia
    Tian, Yonghong
    Huang, Tiejun
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (05) : 623 - 636
  • [43] Target identification of different task weight under multi-interface and multi-task
    Zeng, Qingxin
    Zhuang, Damin
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2006, 32 (05): : 499 - 502
  • [44] Multi-Head TrajectoryCNN: A New Multi-Task Framework for Action Prediction
    Liu, Xiaoli
    Yin, Jianqin
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [45] Multi-task, multi-domain learning: Application to semantic segmentation and pose regression
    Fourure, Damien
    Emonet, Remi
    Fromont, Elisa
    Muselet, Damien
    Neverova, Natalia
    Tremeau, Alain
    Wolf, Christian
    NEUROCOMPUTING, 2017, 251 : 68 - 80
  • [46] Multi-triage: A multi-task learning framework for bug triage
    Aung, Thazin Win Win
    Wan, Yao
    Huo, Huan
    Sui, Yulei
    Journal of Systems and Software, 2022, 184
  • [47] 3D human pose and shape estimation via de-occlusion multi-task learning
    Ran, Hang
    Ning, Xin
    Li, Weijun
    Hao, Meilan
    Tiwari, Prayag
    NEUROCOMPUTING, 2023, 548
  • [48] Multi-triage: A multi-task learning framework for bug triage
    Aung, Thazin Win Win
    Wan, Yao
    Huo, Huan
    Sui, Yulei
    JOURNAL OF SYSTEMS AND SOFTWARE, 2022, 184
  • [49] Real-Time Human Pose Estimation via Cascaded Neural Networks Embedded with Multi-task Learning
    Tanabe, Satoshi
    Yamanaka, Ryosuke
    Tomono, Mitsuru
    Ito, Makiko
    Ishihara, Teruo
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS: 17TH INTERNATIONAL CONFERENCE, CAIP 2017, PT II, 2017, 10425 : 241 - 252
  • [50] Clustered Multi-Task Learning for Automatic Radar Target Recognition
    Li, Cong
    Bao, Weimin
    Xu, Luping
    Zhang, Hua
    SENSORS, 2017, 17 (10)