Appearance-based Gaze Estimation with Multi-Modal Convolutional Neural Networks

被引:0
|
作者
Wang, Fei [1 ]
Wang, Yan [2 ]
Li, Teng [1 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automat, Lab Intelligent Comp & Signal Proc, Minist Educ, Hefei 230601, Peoples R China
[2] Anhui Univ, Sch Comp Sci & Technol, Hefei, Peoples R China
基金
国家重点研发计划;
关键词
Gaze estimation; Multi-region fusion; Deep learning;
D O I
10.1117/12.2603762
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing methods on appearance-based gaze estimation mostly regress gaze direction from eye images, neglecting facial information and head pose which can be much helpful. In this paper, we propose a robust appearance-based gaze estimation method that regresses gaze directions jointly from human face and eye. The face and eye regions are located based on the detected landmark points, and representations of the two modalities are modeled with the convolutional neural networks (CNN), which are finally combined for gaze estimation by a fused network. Furthermore, considering the various impact of different facial regions on human gaze, the spatial weights for facial area are learned automatically with an attention mechanism and are applied to refine the facial representation. Experimental results validate the benefits of fusing multiple modalities in gaze estimation on the Eyediap benchmark dataset, and the propose method can yield better performance to previous advanced methods.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Multi-Modal Depth Estimation Using Convolutional Neural Networks
    Siddiqui, Sadique Adnan
    Vierling, Axel
    Berns, Karsten
    [J]. 2020 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR 2020), 2020, : 354 - 359
  • [2] Appearance-based eye gaze estimation
    Tan, KH
    Kriegman, DJ
    Ahuja, N
    [J]. SIXTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2002, : 191 - 195
  • [3] Gaze-Net: Appearance-Based Gaze Estimation using Capsule Networks
    Mahanama, Bhanuka
    Jayawardana, Yasith
    Jayarathna, Sampath
    [J]. AUGMENTED HUMAN 2020: PROCEEDINGS OF THE 11TH AUGMENTED HUMAN INTERNATIONAL CONFERENCE, 2020,
  • [4] Appearance-Based Gaze Estimation in the Wild
    Zhang, Xucong
    Sugano, Yusuke
    Fritz, Mario
    Bulling, Andreas
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 4511 - 4520
  • [5] Multi-Modal Convolutional Neural Networks for Activity Recognition
    Ha, Sojeong
    Yun, Jeong-Min
    Choi, Seungjin
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 3017 - 3022
  • [6] Appearance-Based Gaze Estimation for ASD Diagnosis
    Li, Jing
    Chen, Zejin
    Zhong, Yihao
    Lam, Hak-Keung
    Han, Junxia
    Ouyang, Gaoxiang
    Li, Xiaoli
    Liu, Honghai
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (07) : 6504 - 6517
  • [7] Appearance-based Gaze Estimation using Kinect
    Choi, Jinsoo
    Ahn, Byungtae
    Park, Jaesik
    Kweon, In So
    [J]. 2013 10TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2013, : 260 - 261
  • [8] Appearance-Based Gaze Estimation for Driver Monitoring
    Nikan, Soodeh
    Upadhyay, Devesh
    [J]. GAZE MEETS MACHINE LEARNING WORKSHOP, VOL 210, 2022, 210 : 127 - 139
  • [9] Learning Confidence Measures by Multi-modal Convolutional Neural Networks
    Fu, Zehua
    Ardabilian Fard, Mohsen
    [J]. 2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1321 - 1330
  • [10] Multi-modal page stream segmentation with convolutional neural networks
    Wiedemann, Gregor
    Heyer, Gerhard
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2021, 55 (01) : 127 - 150