Appearance-based gaze estimation with feature fusion of multi-level information elements

被引:2
|
作者
Ren, Zhonghe [1 ]
Fang, Fengzhou [1 ,2 ]
Hou, Gaofeng [1 ]
Li, Zihao [1 ]
Niu, Rui [1 ]
机构
[1] Tianjin Univ, Lab Micro Nano Mfg Technol MNMT, State Key Lab Precis Measuring Technol & Instrumen, Tianjin 300072, Peoples R China
[2] Univ Coll Dublin, Ctr Micro Nano Mfg Technol MNMT Dublin, Dublin, Ireland
基金
中国国家自然科学基金;
关键词
gaze estimation; feature extraction; feature fusion; deep learning; artificial intelligence; EYE-GAZE; ARTIFICIAL-INTELLIGENCE; ATTENTION; TRACKING; DATASET;
D O I
10.1093/jcde/qwad038
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Gaze estimation is a fundamental task in many applications of cognitive sciences, human-computer interaction, and robotics. The purely data-driven appearance-based gaze estimation methods may suffer from a lack of interpretability, which prevents their applicability to pervasive scenarios. In this study, a feature fusion method with multi-level information elements is proposed to improve the comprehensive performance of the appearance-based gaze estimation model. The multi-level feature extraction and expression are carried out from the originally captured images, and a multi-level information element matrix is established. A gaze conduction principle is formulated for reasonably fusing information elements from the established matrix. According to the gaze conduction principle along with the matrix, a multi-level information element fusion (MIEF) model for gaze estimation is proposed. Then, several input modes and network structures of the MIEF model are designed, and a series of grouping experiments are carried out on a small-scale sub-dataset. Furthermore, the optimized input modes and network structures of the MIEF model are selected for training and testing on the whole dataset to verify and compare model performance. Experimental results show that optimizing the feature combination in the input control module and fine-tuning the computational architecture in the feature extraction module can improve the performance of the gaze estimation model, which would enable the reduction of the model by incorporating the critical features and thus improve the performance and accessibility of the method. Compared with the reference baseline, the optimized model based on the proposed feature fusion method of multi-level information elements can achieve efficient training and improve the test accuracy in the verification experiment. The average error is 1.63 cm on phones on the GazeCapture dataset, which achieves comparable accuracy with state-of-the-art methods.
引用
收藏
页码:1080 / 1109
页数:30
相关论文
共 50 条
  • [1] Benefits of temporal information for appearance-based gaze estimation
    Palmero, Cristina
    Komogortsev, Oleg V.
    Talathi, Sachin S.
    [J]. ETRA 2020 SHORT PAPERS: ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, 2020,
  • [2] Appearance-based eye gaze estimation
    Tan, KH
    Kriegman, DJ
    Ahuja, N
    [J]. SIXTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2002, : 191 - 195
  • [3] Appearance-Based Gaze Estimation in the Wild
    Zhang, Xucong
    Sugano, Yusuke
    Fritz, Mario
    Bulling, Andreas
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4511 - 4520
  • [4] Head Pose Estimation Based on Multi-Level Feature Fusion
    Yan, Chunman
    Zhang, Xiao
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (02)
  • [5] Appearance-Based Gaze Estimation for ASD Diagnosis
    Li, Jing
    Chen, Zejin
    Zhong, Yihao
    Lam, Hak-Keung
    Han, Junxia
    Ouyang, Gaoxiang
    Li, Xiaoli
    Liu, Honghai
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 6504 - 6517
  • [6] Appearance-based Gaze Estimation using Kinect
    Choi, Jinsoo
    Ahn, Byungtae
    Park, Jaesik
    Kweon, In So
    [J]. 2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 260 - 261
  • [7] Appearance-Based Gaze Estimation for Driver Monitoring
    Nikan, Soodeh
    Upadhyay, Devesh
    [J]. GAZE MEETS MACHINE LEARNING WORKSHOP, VOL 210, 2022, 210 : 127 - 139
  • [8] Searching Efficient Neural Architecture with Multi-resolution Fusion Transformer for Appearance-based Gaze Estimation
    Nagpure, Vikrant
    Okuma, Kenji
    [J]. 2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 890 - 899
  • [9] Appearance-based Gaze Estimation with Multi-Modal Convolutional Neural Networks
    Wang, Fei
    Wang, Yan
    Li, Teng
    [J]. INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021, 2021, 11884
  • [10] Offset Calibration for Appearance-Based Gaze Estimation via Gaze Decomposition
    Chen, Zhaokang
    Shi, Bertram E.
    [J]. 2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 259 - 268