Frequency-spatial interaction network for gaze estimation

被引:0
|
作者
Jia, Yuanning [1 ,3 ]
Liu, Zhi [1 ,3 ]
Lv, Ying [1 ,3 ]
Lu, Xiaofeng [1 ,3 ]
Liu, Xuefeng [1 ,3 ]
Chen, Jie [2 ]
机构
[1] Shanghai Institute for Advanced Communication and Data Science, School of Communication and Information Engineering, Shanghai University, Shanghai,200444, China
[2] Wenzhou Third Clinical Institute Affiliated to Wenzhou Medical University, the Third Affiliated Hospital of Shanghai University (Wenzhou People's Hospital), Wenzhou,325000, China
[3] Wenzhou Institute of Shanghai University, Wenzhou,325000, China
基金
中国国家自然科学基金;
关键词
Adaptive filtering - Adaptive filters - Frequency domain analysis;
D O I
10.1016/j.displa.2024.102878
中图分类号
学科分类号
摘要
Gaze estimation is a fundamental task in the field of computer vision, which determines the direction a person is looking at. With advancements in Convolutional Neural Networks (CNNs) and the availability of large-scale datasets, appearance-based models have made significant progress. Nonetheless, CNNs exhibit limitations in extracting global information from features, resulting in a constraint on gaze estimation performance. Inspired by the properties of the Fourier transform in signal processing, we propose the Frequency-Spatial Interaction network for Gaze estimation (FSIGaze), which integrates residual modules and Frequency-Spatial Synergistic (FSS) modules. To be specific, its FSS module is a dual-branch structure with a spatial branch and a frequency branch. The frequency branch employs Fast Fourier Transformation to transfer a latent representation to the frequency domain and applies adaptive frequency filter to achieve an image-size receptive field. The spatial branch, on the other hand, can extract local detailed features. Acknowledging the synergistic benefits of global and local information in gaze estimation, we introduce a Dual-domain Interaction Block (DIB) to enhance the capability of the model. Furthermore, we implement a multi-task learning strategy, incorporating eye region detection as an auxiliary task to refine facial features. Extensive experiments demonstrate that our model surpasses other state-of-the-art gaze estimation models on three three-dimensional (3D) datasets and delivers competitive results on two two-dimensional (2D) datasets. © 2024 Elsevier B.V.
引用
收藏
相关论文
共 50 条
  • [1] FVFSNet: Frequency-Spatial Coupling Network for Finger Vein Authentication
    Huang, Junduan
    Zheng, An
    Shakeel, M. Saad
    Yang, Weili
    Kang, Wenxiong
    [J]. IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 1322 - 1334
  • [2] FFENet: frequency-spatial feature enhancement network for clothing classification
    Yu, Feng
    Li, Huiyin
    Shi, Yankang
    Tang, Guangyu
    Chen, Zhaoxiang
    Jiang, Minghua
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [3] Multiple frequency-spatial network for RGBT tracking in the presence of motion blur
    Fan, Shenghua
    Chen, Xi
    He, Chu
    Yu, Lei
    Mao, Zhongjie
    Zheng, Yujin
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (34): : 24389 - 24406
  • [4] Photonics-assisted Frequency and DOA estimation via Frequency-Spatial Compressed Sensing
    Zhang, Tieliang
    Li, Yang
    Zou, Xihua
    Pan, Wei
    Yan, Lianshan
    Yang, Bo
    Chi, Hao
    [J]. 2022 ASIA COMMUNICATIONS AND PHOTONICS CONFERENCE, ACP, 2022, : 1764 - 1768
  • [5] Joint Frequency-Spatial Domain Network for Remote Sensing Optical Image Change Detection
    Zhou, Yuan
    Feng, Yanjie
    Huo, Shuwei
    Li, Xiaofeng
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Frequency-spatial beamformer for MEG source localization
    Thompson, Elizabeth A.
    Xiang, Jing
    Wang, Yingying
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2015, 18 : 263 - 273
  • [7] UHD Underwater Image Enhancement via Frequency-Spatial Domain Aware Network
    Wei, Yiwen
    Zheng, Zhuoran
    Jia, Xiuyi
    [J]. COMPUTER VISION - ACCV 2022, PT III, 2023, 13843 : 21 - 36
  • [8] Promoting camouflaged object detection through novel edge–target interaction and frequency-spatial fusion
    Guan, Juwei
    Qian, Weiqi
    Zhu, Tongxin
    Fang, Xiaolin
    [J]. Neurocomputing, 2025, 617
  • [9] Frequency-spatial domain based salient region detection
    Sun, Xiaoliang
    Zhu, Zunshang
    Liu, Xiaolin
    Shang, Yang
    Yu, Qifeng
    [J]. OPTIK, 2015, 126 (9-10): : 942 - 949
  • [10] Photonic-assisted Multi-antenna Frequency-Spatial Compressed Sensing Array for Joint Frequency and DOA Estimation
    Zhang T.
    Li Y.
    Yang B.
    Pan W.
    Yan L.
    Chi H.
    Zou X.
    [J]. Journal of Lightwave Technology, 2024, 42 (21) : 1 - 8