Gaze point detection by computing the 3D positions and 3D motion of face

被引:0
|
作者
Park, KR [1 ]
Kim, J [1 ]
机构
[1] Yonsei Univ, Dept Elect & Comp Engn, Seodaemoon Gu, Seoul 120749, South Korea
来源
关键词
gaze position; 3D position estimation; 3D motion estimation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaze detection is to locate the position on a monitor screen where a user is looking. In our work, we implement it with a computer vision system setting a single camera above a monitor and a user moves (rotates and/or translates) her fate to gaze at a different position on the monitor. For our case, the user is requested not to move pupils of her eyes when she gazes at a different position on the monitor screen, though we are working on to relax this restriction. To detect the gaze position, we extract facial features (both eyes, nostrils and lip corners) automatically in 2D camera images. From the movement of feature points detected in starting images, we can compute the initial 3D positions of those features by recursive estimation algorithm. Then, when a user moves her head in order to gaze at one position on a monitor, the moved 3D positions of those features can be computed from 3D motion estimation by Iterative Extended Kalman Filter (IEKF) and affine transform. Finally, the gaze position on a monitor is computed from the normal vector of the plane determined by those moved 3D positions of features. Especially in order to obtain the exact 3D positions of initial feature points, we unify three coordinate systems (face, monitor and camera coordinate system) based on perspective transformation. As experimental results, the 3D position estimation error of initial feature paints, which is the RMS error between the estimated initial 3D feature positions and the real positions (measured by 3D position tracker sensor) is about 1.28 cm (0.75 cm in X axis, 0.85 cm in Y axis, 0.6 cm in Z axis) and the 3D motion estimation errors of feature points by Iterative Extended Kalman Filter (IEKF) are about 2.8 degrees and 1.21 cm in rotation and translation, respectively. From that, we can obtain the gaze position on a monitor (17 inches) and the gaze position accuracy between the calculated positions and the real ones is about 2.06 inches of RMS error.
引用
下载
收藏
页码:884 / 894
页数:11
相关论文
共 50 条
  • [21] Disparity-based 3D face modeling for 3D face recognition
    Ansari, A-Nasser
    Abdel-Mottaleb, Mohamed
    Mahoor, Mohammad H.
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 657 - +
  • [22] Multimodal Virtual Point 3D Detection
    Yin, Tianwei
    Zhou, Xingyi
    Krahenbhul, Philipp
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] A3FD: Accurate 3D face detection
    Anisetti, Marco
    Bellandi, Valerio
    Damiani, Ernesto
    Arnone, Luigi
    Rat, Benoit
    SIGNAL PROCESSING FOR IMAGE ENHANCEMENT AND MULTIMEDIA PROCESSING, 2008, : 155 - +
  • [24] 3D Face & 3D Ear Recognition: Process and Techniques
    Tharewal, Sumegh
    Gite, Hanumant
    Kale, K., V
    2017 INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER, ELECTRICAL, ELECTRONICS AND COMMUNICATION (CTCEEC), 2017, : 1044 - 1049
  • [25] Estimating 3D motion and position of point target
    Ouyang, GH
    Sun, JX
    Li, H
    Wang, WH
    ULTRAHIGH- AND HIGH-SPEED PHOTOGRAPHY AND IMAGE-BASED MOTION MEASUREMENT, 1997, 3173 : 386 - 394
  • [26] Gaze Estimation based on 3D Face Structure and Pupil Centers
    Xiong, Chunshui
    Huang, Lei
    Liu, Changping
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1156 - 1161
  • [27] 3D Face Reconstruction and Gaze Tracking in the HMD for Virtual Interaction
    Chen, Shu-Yu
    Lai, Yu-Kun
    Xia, Shihong
    Rosin, Paul L.
    Gao, Lin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3166 - 3179
  • [28] Disparity-based 3D face modeling using 3D deformable facial mask for 3D face recognition
    Ansari, A-Nasser
    Abdel-Mottaleb, Mohamed
    Mahoor, Mohammad H.
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 981 - +
  • [29] 3D gaze estimation and interaction
    Ki, Jeongseok
    Kwon, Yong-Moo
    2008 3DTV-CONFERENCE: THE TRUE VISION - CAPTURE, TRANSMISSION AND DISPLAY OF 3D VIDEO, 2008, : 353 - 356
  • [30] 3D face recognition
    Beumier, C
    CIHSPS 2004: PROCEEDINGS OF THE 2004 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE FOR HOMELAND SECURITY AND PERSONAL SAFETY, 2004, : 93 - 96