Gaze estimation via bilinear pooling-based attention networks

被引:2
|
作者
Ren, Dakai [1 ]
Chen, Jiazhong [2 ]
Zhong, Jian [2 ]
Lu, Zhaoming [1 ]
Jia, Tao [2 ]
Li, Zongyi [2 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
[2] Huazhong Univ Sci & Technol, Sch Comp Sci & Technol, Wuhan, Peoples R China
关键词
Gaze tracking; Deep learning; Bilinear pooling; Attention;
D O I
10.1016/j.jvcir.2021.103369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Attention mechanism has been found effective for human gaze estimation, and the attention and diversity of learned features are two important aspects of attention mechanism. However, the traditional attention mechanism used in existing gaze model is more prone to utilize first-order information that is attentive but not diverse. Though the existing bilinear pooling-based attention could overcome the shortcoming of traditional attention, it is limited to extract high-order contextual information. Thus we introduce a novel bilinear poolingbased attention mechanism, which could extract the second-order contextual information by the interaction between local deep learned features. To make the gaze-related features robust for spatial misalignment, we further propose an attention-in-attention method, which consists of a global average pooling and an inner attention on the second-order features. For the purpose of gaze estimation, a new bilinear pooling-based attention networks with attention-in-attention is further proposed. Extensive evaluation shows that our method surpasses the state-of-the-art by a big margin.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Gaze Estimation via Strip Pooling and Multi-Criss-Cross Attention Networks
    Yan, Chao
    Pan, Weiguo
    Xu, Cheng
    Dai, Songyin
    Li, Xuewei
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [2] Texture Classification Using Pair-Wise Difference Pooling-Based Bilinear Convolutional Neural Networks
    Dong, Xinghui
    Zhou, Huiyu
    Dong, Junyu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8776 - 8790
  • [3] Attention pooling-based convolutional neural network for sentence modelling
    Er, Meng Joo
    Zhang, Yong
    Wang, Ning
    Pratama, Mahardhika
    [J]. INFORMATION SCIENCES, 2016, 373 : 388 - 403
  • [4] A probabilistic neighbourhood pooling-based attention network for hyperspectral image classification
    Wang, Yuanlin
    Song, Tiecheng
    Xie, Yurui
    Roy, Swalpa Kumar
    [J]. REMOTE SENSING LETTERS, 2022, 13 (01) : 65 - 75
  • [5] Attention Pooling-Based Bidirectional Gated Recurrent Units Model for Sentimental Classification
    Zhang, Dejun
    Hong, Mingbo
    Zou, Lu
    Han, Fei
    He, Fazhi
    Tu, Zhigang
    Ren, Yafeng
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (02) : 723 - 732
  • [6] Attention Pooling-Based Bidirectional Gated Recurrent Units Model for Sentimental Classification
    Dejun Zhang
    Mingbo Hong
    Lu Zou
    Fei Han
    Fazhi He
    Zhigang Tu
    Yafeng Ren
    [J]. International Journal of Computational Intelligence Systems, 2019, 12 : 723 - 732
  • [7] Pooling-based Visual Transformer with low complexity attention hashing for image retrieval
    Ren, Huan
    Guo, Jiangtao
    Cheng, Shuli
    Li, Yongming
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 241
  • [8] Robust RGB-T Tracking via Graph Attention-Based Bilinear Pooling
    Kang, Bin
    Liang, Dong
    Mei, Junxi
    Tan, Xiaoyang
    Zhou, Quan
    Zhang, Dengyin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 9900 - 9911
  • [9] APDC-Net: Attention Pooling-Based Convolutional Network for Aerial Scene Classification
    Bi, Qi
    Qin, Kun
    Zhang, Han
    Xie, Jiafen
    Li, Zhili
    Xu, Kai
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (09) : 1603 - 1607
  • [10] Pooling-based data interpolation and backdating
    Marcellino, Massimiliano
    [J]. JOURNAL OF TIME SERIES ANALYSIS, 2007, 28 (01) : 53 - 71