Appearance-Based Gaze Estimation Method Using Static Transformer Temporal Differential Network

被引:5
|
作者
Li, Yujie [1 ]
Huang, Longzhao [2 ]
Chen, Jiahui [2 ]
Wang, Xiwen [2 ]
Tan, Benying [1 ]
机构
[1] Univ Key Lab AI Algorithm Engn, Guilin Univ Elect Technol, Guangxi Coll, Sch Artificial Intelligence, Jinji Rd, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Artificial Intelligence, Jinji Rd, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
gaze estimation; static transformer temporal differential network; static transformer module; temporal differential module; self-attention mechanism;
D O I
10.3390/math11030686
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Gaze behavior is important and non-invasive human-computer interaction information that plays an important role in many fields-including skills transfer, psychology, and human-computer interaction. Recently, improving the performance of appearance-based gaze estimation, using deep learning techniques, has attracted increasing attention: however, several key problems in these deep-learning-based gaze estimation methods remain. Firstly, the feature fusion stage is not fully considered: existing methods simply concatenate the different obtained features into one feature, without considering their internal relationship. Secondly, dynamic features can be difficult to learn, because of the unstable extraction process of ambiguously defined dynamic features. In this study, we propose a novel method to consider feature fusion and dynamic feature extraction problems. We propose the static transformer module (STM), which uses a multi-head self-attention mechanism to fuse fine-grained eye features and coarse-grained facial features. Additionally, we propose an innovative recurrent neural network (RNN) cell-that is, the temporal differential module (TDM)-which can be used to extract dynamic features. We integrated the STM and the TDM into the static transformer with a temporal differential network (STTDN). We evaluated the STTDN performance, using two publicly available datasets (MPIIFaceGaze and Eyediap), and demonstrated the effectiveness of the STM and the TDM. Our results show that the proposed STTDN outperformed state-of-the-art methods, including that of Eyediap (by 2.9%).
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Appearance-Based Gaze Estimation via Uncalibrated Gaze Pattern Recovery
    Lu, Feng
    Chen, Xiaowu
    Sato, Yoichi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) : 1543 - 1553
  • [22] Adaptive Linear Regression for Appearance-Based Gaze Estimation
    Lu, Feng
    Sugano, Yusuke
    Okabe, Takahiro
    Sato, Yoichi
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (10) : 2033 - 2046
  • [23] Unsupervised Outlier Detection in Appearance-Based Gaze Estimation
    Chen, Zhaokang
    Deng, Didan
    Pi, Jimin
    Shi, Bertram E.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1088 - 1097
  • [24] Appearance-based gaze estimation using deep features and random forest regression
    Wang, Yafei
    Shen, Tianyi
    Yuan, Guoliang
    Bian, Jiming
    Fu, Xianping
    KNOWLEDGE-BASED SYSTEMS, 2016, 110 : 293 - 301
  • [25] Searching Efficient Neural Architecture with Multi-resolution Fusion Transformer for Appearance-based Gaze Estimation
    Nagpure, Vikrant
    Okuma, Kenji
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 890 - 899
  • [26] Appearance-Based Gaze Block Estimation via CNN Classification
    Wu, Xuemei
    Li, Jing
    Wu, Qiang
    Sun, Jiande
    2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [27] Appearance-based gaze estimation under slight head motion
    Zhizhi Guo
    Qianxiang Zhou
    Zhongqi Liu
    Multimedia Tools and Applications, 2017, 76 : 2203 - 2222
  • [28] Manifold Alignment for Person Independent Appearance-based Gaze Estimation
    Schneider, Timo
    Schauerte, Boris
    Stiefelhagen, Rainer
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1167 - 1172
  • [29] Appearance-based gaze estimation under slight head motion
    Guo, Zhizhi
    Zhou, Qianxiang
    Liu, Zhongqi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (02) : 2203 - 2222
  • [30] Appearance-Based Gaze Estimation With Deep Learning: A Review and Benchmark
    Cheng, Yihua
    Wang, Haofei
    Bao, Yiwei
    Lu, Feng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (12) : 7509 - 7528