A Multi-Gesture Interaction System Using a 3-D Iris Disk Model for Gaze Estimation and an Active Appearance Model for 3-D Hand Pointing

被引:57
|
作者
Reale, Michael J. [1 ]
Canavan, Shaun [1 ]
Yin, Lijun [1 ]
Hu, Kaoning [1 ]
Hung, Terry [2 ]
机构
[1] SUNY Binghamton, Dept Comp Sci, Binghamton, NY 13902 USA
[2] Corning Corp, Taichung 40763, Taiwan
基金
美国国家科学基金会;
关键词
Gaze estimation; hand tracking; human-computer interaction (HCI); TRACKING;
D O I
10.1109/TMM.2011.2120600
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a vision-based human-computer interaction system, which integrates control components using multiple gestures, including eye gaze, head pose, hand pointing, and mouth motions. To track head, eye, and mouth movements, we present a two-camera system that detects the face from a fixed, wide-angle camera, estimates a rough location for the eye region using an eye detector based on topographic features, and directs another active pan-tilt-zoom camera to focus in on this eye region. We also propose a novel eye gaze estimation approach for point-of-regard (POR) tracking on a viewing screen. To allow for greater head pose freedom, we developed a new calibration approach to find the 3-D eyeball location, eyeball radius, and fovea position. Moreover, in order to get the optical axis, we create a 3-D iris disk by mapping both the iris center and iris contour points to the eyeball sphere. We then rotate the fovea accordingly and compute the final, visual axis gaze direction. This part of the system permits natural, non-intrusive, pose-invariant POR estimation from a distance without resorting to infrared or complex hardware setups. We also propose and integrate a two-camera hand pointing estimation algorithm for hand gesture tracking in 3-D from a distance. The algorithms of gaze pointing and hand finger pointing are evaluated individually, and the feasibility of the entire system is validated through two interactive information visualization applications.
引用
收藏
页码:474 / 486
页数:13
相关论文
共 50 条
  • [1] POINTING WITH THE EYES: GAZE ESTIMATION USING A STATIC/ACTIVE CAMERA SYSTEM AND 3D IRIS DISK MODEL
    Reale, Michael
    Hung, Terry
    Yin, Lijun
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 280 - 285
  • [2] Gaze Estimation Using 3-D Eyeball Model And Eyelid Shapes
    Han, Sang Yoon
    Hwang, Insung
    Lee, Sang Hwa
    Cho, Nam Ik
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [3] Gaze Estimation Using 3-D Eyeball Model under HMD Circumstance
    Han, Sang Yoon
    Lee, Sang Hwa
    Cho, Nam Ik
    [J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [4] A 3-D anthropometric-muscle-based active appearance model
    Cordea, MD
    Petriu, EM
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2006, 55 (01) : 91 - 98
  • [5] 3-D Gaze Estimation by Stereo Gaze Direction
    Pichitwong, Wudthipong
    Chamnongthai, Kosin
    [J]. 2016 13TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2016,
  • [6] A 3-d modeling system using hands gesture
    Ishii, Masahiro
    Yokogawa, Takeshi
    [J]. Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 2011, 65 (06): : 806 - 810
  • [7] Evaluation of Hand Pointing System Based on 3-D Computer Vision
    Serarinavicius, P.
    Sajauskas, S.
    Daunys, G.
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2008, (08) : 95 - 98
  • [8] 3-D model
    不详
    [J]. NUCLEAR PLANT JOURNAL, 2000, 18 (01) : 12 - 12
  • [9] Fast and reliable active appearance model search for 3-D face tracking
    Dornaika, F
    Ahlberg, J
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (04): : 1838 - 1853
  • [10] Real-time hand gesture recognition using pseudo 3-D hidden markov model
    Binh, Nguyen Dang
    Ejima, Toshiaki
    [J]. PROCEEDINGS OF THE FIFTH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS, VOLS 1 AND 2, 2006, : 820 - 824