Appearance-Based Gaze Estimation Method Using Static Transformer Temporal Differential Network

被引:4
|
作者
Li, Yujie [1 ]
Huang, Longzhao [2 ]
Chen, Jiahui [2 ]
Wang, Xiwen [2 ]
Tan, Benying [1 ]
机构
[1] Univ Key Lab AI Algorithm Engn, Guilin Univ Elect Technol, Guangxi Coll, Sch Artificial Intelligence, Jinji Rd, Guilin 541004, Peoples R China
[2] Guilin Univ Elect Technol, Sch Artificial Intelligence, Jinji Rd, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
gaze estimation; static transformer temporal differential network; static transformer module; temporal differential module; self-attention mechanism;
D O I
10.3390/math11030686
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Gaze behavior is important and non-invasive human-computer interaction information that plays an important role in many fields-including skills transfer, psychology, and human-computer interaction. Recently, improving the performance of appearance-based gaze estimation, using deep learning techniques, has attracted increasing attention: however, several key problems in these deep-learning-based gaze estimation methods remain. Firstly, the feature fusion stage is not fully considered: existing methods simply concatenate the different obtained features into one feature, without considering their internal relationship. Secondly, dynamic features can be difficult to learn, because of the unstable extraction process of ambiguously defined dynamic features. In this study, we propose a novel method to consider feature fusion and dynamic feature extraction problems. We propose the static transformer module (STM), which uses a multi-head self-attention mechanism to fuse fine-grained eye features and coarse-grained facial features. Additionally, we propose an innovative recurrent neural network (RNN) cell-that is, the temporal differential module (TDM)-which can be used to extract dynamic features. We integrated the STM and the TDM into the static transformer with a temporal differential network (STTDN). We evaluated the STTDN performance, using two publicly available datasets (MPIIFaceGaze and Eyediap), and demonstrated the effectiveness of the STM and the TDM. Our results show that the proposed STTDN outperformed state-of-the-art methods, including that of Eyediap (by 2.9%).
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Benefits of temporal information for appearance-based gaze estimation
    Palmero, Cristina
    Komogortsev, Oleg V.
    Talathi, Sachin S.
    [J]. ETRA 2020 SHORT PAPERS: ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, 2020,
  • [2] Appearance-based Gaze Estimation using Kinect
    Choi, Jinsoo
    Ahn, Byungtae
    Park, Jaesik
    Kweon, In So
    [J]. 2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 260 - 261
  • [3] Appearance-Based Gaze Estimation Using Visual Saliency
    Sugano, Yusuke
    Matsushita, Yasuyuki
    Sato, Yoichi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (02) : 329 - 341
  • [4] Appearance-based eye gaze estimation
    Tan, KH
    Kriegman, DJ
    Ahuja, N
    [J]. SIXTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2002, : 191 - 195
  • [5] Appearance-Based Gaze Estimation in the Wild
    Zhang, Xucong
    Sugano, Yusuke
    Fritz, Mario
    Bulling, Andreas
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4511 - 4520
  • [6] Gaze-Net: Appearance-Based Gaze Estimation using Capsule Networks
    Mahanama, Bhanuka
    Jayawardana, Yasith
    Jayarathna, Sampath
    [J]. AUGMENTED HUMAN 2020: PROCEEDINGS OF THE 11TH AUGMENTED HUMAN INTERNATIONAL CONFERENCE, 2020,
  • [7] Appearance-based Gaze Estimation using Attention and Difference Mechanism
    Murthy, L. R. D.
    Biswas, Pradipta
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3137 - 3146
  • [8] Appearance-Based Gaze Estimation Using Dilated-Convolutions
    Chen, Zhaokang
    Shi, Bertram E.
    [J]. COMPUTER VISION - ACCV 2018, PT VI, 2019, 11366 : 309 - 324
  • [9] A Coarse-to-Fine Adaptive Network for Appearance-Based Gaze Estimation
    Cheng, Yihua
    Huang, Shiyao
    Wang, Fei
    Qian, Chen
    Lu, Feng
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10623 - 10630
  • [10] Appearance-Based Gaze Estimation for ASD Diagnosis
    Li, Jing
    Chen, Zejin
    Zhong, Yihao
    Lam, Hak-Keung
    Han, Junxia
    Ouyang, Gaoxiang
    Li, Xiaoli
    Liu, Honghai
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 6504 - 6517