Monocular head pose estimation using generalized adaptive view-based appearance model

被引:25
|
作者
Morency, Louis-Philippe [1 ]
Whitehill, Jacob [2 ]
Movellan, Javier [2 ]
机构
[1] USC Inst Creat Technol, Marina Del Rey, CA 90292 USA
[2] Univ Calif San Diego, Machine Percept Lab, La Jolla, CA 92093 USA
关键词
Head pose estimation; View-based appearance model; Keyframe tracking; Differential tracking; Rigid body tracking; Kalman filter update; Bounded drift; TRACKING;
D O I
10.1016/j.imavis.2009.08.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurately estimating the person's head position and orientation is an important task for a wide range of applications such as driver awareness, meeting analysis and human-robot interaction. Over the past two decades, many approaches have been suggested to solve this problem, each with its own advantages and disadvantages. In this paper, we present a probabilistic framework called Generalized Adaptive View-based Appearance Model (GAVAM) which integrates the advantages from three of these approaches: (1) the automatic initialization and stability of static head pose estimation, (2) the relative precision and user-independence of differential registration, and (3) the robustness and bounded drift of keyframe tracking. In our experiments, we show how the GAVAM model can be used to estimate head position and orientation in real-time using a simple monocular camera. Our experiments on two previously published datasets show that the GAVAM framework can accurately track for a long period of time with an average accuracy of 3.5 degrees and 0.75 in. when compared with an inertial sensor and a 3D magnetic sensor. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:754 / 761
页数:8
相关论文
共 50 条
  • [1] Generalized Adaptive View-based Appearance Model: Integrated Framework for Monocular Head Pose Estimation
    Morency, Louis-Philippe
    Whitehill, Jacob
    Movellan, Javier
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 875 - +
  • [2] Adaptive view-based appearance models
    Morency, LP
    Rahimi, A
    Darrell, T
    2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2003, : 803 - 810
  • [3] Efficient pose estimation using view-based object representations
    Peters, G
    COMPUTER VISION SYSTEMS, PROCEEDINGS, 2003, 2626 : 12 - 21
  • [4] Efficient pose estimation using view-based object representations
    Peters, G
    MACHINE VISION AND APPLICATIONS, 2004, 16 (01) : 59 - 63
  • [5] Efficient pose estimation using view-based object representations
    Gabriele Peters
    Machine Vision and Applications, 2004, 16 : 59 - 63
  • [6] Large head movement tracking using scale invariant view-based appearance model
    Zhao, Gangqiang
    Chen, Ling
    Chen, Gencai
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PROCEEDINGS, 2007, 4678 : 331 - 339
  • [7] Pose estimation using 3D view-based eigenspaces
    Morency, LP
    Sundberg, P
    Darrell, T
    IEEE INTERNATIONAL WORKSHOP ON ANALYSIS AND MODELING OF FACE AND GESTURES, 2003, : 45 - 52
  • [8] A practical approach to view-based synergetic pose estimation
    Hogg, T
    Talhami, H
    Rees, D
    Johnson, R
    IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 663 - 666
  • [9] Head pose estimation using view based eigenspaces
    Srinivasan, S
    Boyer, KL
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITON, VOL IV, PROCEEDINGS, 2002, : 302 - 305
  • [10] Monocular head pose estimation
    Martins, Pedro
    Batista, Jorge
    IMAGE ANALYSIS AND RECOGNITION, PROCEEDINGS, 2008, 5112 : 357 - 368