Appearance-Based Gaze Estimation Method Using Static Transformer Temporal Differential Network

被引:5
|
作者
Li, Yujie [1 ]
Huang, Longzhao [2 ]
Chen, Jiahui [2 ]
Wang, Xiwen [2 ]
Tan, Benying [1 ]
机构
[1] Univ Key Lab AI Algorithm Engn, Guilin Univ Elect Technol, Guangxi Coll, Sch Artificial Intelligence, Jinji Rd, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Artificial Intelligence, Jinji Rd, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
gaze estimation; static transformer temporal differential network; static transformer module; temporal differential module; self-attention mechanism;
D O I
10.3390/math11030686
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Gaze behavior is important and non-invasive human-computer interaction information that plays an important role in many fields-including skills transfer, psychology, and human-computer interaction. Recently, improving the performance of appearance-based gaze estimation, using deep learning techniques, has attracted increasing attention: however, several key problems in these deep-learning-based gaze estimation methods remain. Firstly, the feature fusion stage is not fully considered: existing methods simply concatenate the different obtained features into one feature, without considering their internal relationship. Secondly, dynamic features can be difficult to learn, because of the unstable extraction process of ambiguously defined dynamic features. In this study, we propose a novel method to consider feature fusion and dynamic feature extraction problems. We propose the static transformer module (STM), which uses a multi-head self-attention mechanism to fuse fine-grained eye features and coarse-grained facial features. Additionally, we propose an innovative recurrent neural network (RNN) cell-that is, the temporal differential module (TDM)-which can be used to extract dynamic features. We integrated the STM and the TDM into the static transformer with a temporal differential network (STTDN). We evaluated the STTDN performance, using two publicly available datasets (MPIIFaceGaze and Eyediap), and demonstrated the effectiveness of the STM and the TDM. Our results show that the proposed STTDN outperformed state-of-the-art methods, including that of Eyediap (by 2.9%).
引用
收藏
页数:18
相关论文
共 50 条
  • [31] InvisibleEye: Fully Embedded Mobile Eye Tracking Using Appearance-Based Gaze Estimation
    Steil, Julian
    Tonsen, Marc
    Sugano, Yusuke
    Bulling, Andreas
    GETMOBILE-MOBILE COMPUTING & COMMUNICATIONS REVIEW, 2019, 23 (02) : 30 - 34
  • [32] Iris Geometric Transformation Guided Deep Appearance-Based Gaze Estimation
    Nie, Wei
    Wang, Zhiyong
    Ren, Weihong
    Zhang, Hanlin
    Liu, Honghai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1616 - 1631
  • [33] A Head Pose-free Approach for Appearance-based Gaze Estimation
    Lu, Feng
    Okabe, Takahiro
    Sugano, Yusuke
    Sato, Yoichi
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [34] Improving Domain Generalization in Appearance-Based Gaze Estimation With Consistency Regularization
    Back, Moon-Ki
    Yoo, Cheol-Hwan
    Yoo, Jang-Hee
    IEEE ACCESS, 2023, 11 : 137948 - 137956
  • [35] MPIIGaze: Real World Dataset and Deep Appearance-Based Gaze Estimation
    Zhang, Xucong
    Sugano, Yusuke
    Fritz, Mario
    Bulling, Andreas
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (01) : 162 - 175
  • [36] A simple but effective appearance-based gaze estimation method from massive synthetic eye images
    Wang, Yafei
    Zhao, Tongtong
    Ding, Xueyan
    Shen, Tianyi
    Bian, Jiming
    Fu, Xianping
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1184 - 1188
  • [37] Appearance-Based Gaze Estimation With Online Calibration From Mouse Operations
    Sugano, Yusuke
    Matsushita, Yasuyuki
    Sato, Yoichi
    Koike, Hideki
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2015, 45 (06) : 750 - 760
  • [38] PrivatEyes: Appearance-based Gaze Estimation Using Federated Secure Multi-Party Computation
    Elfares M.
    Reisert P.
    Hu Z.
    Tang W.
    Küsters R.
    Bulling A.
    Proceedings of the ACM on Human-Computer Interaction, 2024, 8 (ETRA)
  • [39] Learning to Personalize in Appearance-Based Gaze Tracking
    Linden, Erik
    Sjortrand, Jonas
    Proutiere, Alexandre
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1140 - 1148
  • [40] Appearance-Based Gaze Estimation via Evaluation-Guided Asymmetric Regression
    Cheng, Yihua
    Lu, Feng
    Zhang, Xucong
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 105 - 121