Fine-grained Early Frequency Attention for Deep Speaker Recognition

被引:2
|
作者
Hajavi, Amirhossein [1 ]
Etemad, Ali
机构
[1] Queens Univ, Dept ECE, Kingston, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Deep Speaker Recognition; Attention; Early Frequency Attention; Robustness to Noise;
D O I
10.1109/IJCNN55064.2022.9892054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attention mechanisms have emerged as important tools that boost the performance of deep models by allowing them to focus on key parts of learned embeddings. However, current attention mechanisms used in speaker recognition tasks fail to consider fine-grained information items such as frequency bins in input spectral representations used by the deep networks. To address this issue, we propose the novel Fine-grained Early Frequency Attention (FEFA) for speaker recognition in-the-wild. Once integrated into a deep neural network, our proposed mechanism works by obtaining queries from early layers of the network and generating learnable weights to attend to information items as small as the frequency bins in the input spectral representations. To evaluate the performance of FEFA, we use several well-known deep models as backbone networks and integrate our attention module in their pipelines. The overall performance of these networks (with and without FEFA) are evaluated on the VoxCeleb1 dataset, where we observe considerable improvements when FEFA is used.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Fine-Grained Early Frequency Attention for Deep Speaker Representation Learning
    Hajavi, Amirhossein
    Etemad, Ali
    [J]. IEEE Transactions on Artificial Intelligence, 2023, 4 (06): : 1413 - 1425
  • [2] Deep LSAC for Fine-Grained Recognition
    Lin, Di
    Wang, Yi
    Liang, Lingyu
    Li, Ping
    Chen, C. L. Philip
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 200 - 214
  • [3] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [4] Fine-Grained Radio Frequency Fingerprint Recognition Network Based on Attention Mechanism
    Zhang, Yulan
    Hu, Jun
    Jiang, Rundong
    Lin, Zengrong
    Chen, Zengping
    [J]. ENTROPY, 2024, 26 (01)
  • [5] Deep Attention-Based Spatially Recursive Networks for Fine-Grained Visual Recognition
    Wu, Lin
    Wang, Yang
    Li, Xue
    Gao, Junbin
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (05) : 1791 - 1802
  • [6] Attention Bilinear Pooling for Fine-Grained Facial Expression Recognition
    Liu, Liyuan
    Zhang, Lifeng
    Jia, Shixiang
    [J]. CYBERSPACE SAFETY AND SECURITY, PT II, 2019, 11983 : 535 - 542
  • [7] Summary of Fine-Grained Image Recognition Based on Attention Mechanism
    Yao, Ma
    Min, Zhi
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [8] Attention cutting and padding learning for fine-grained image recognition
    Zhuo Cheng
    Hongjian Li
    Xiaolin Duan
    Xiangyan Zeng
    Mingxuan He
    Hao Luo
    [J]. Multimedia Tools and Applications, 2021, 80 : 32791 - 32805
  • [9] Fine-Grained Recognition of Mixed Signals with Geometry Coordinate Attention
    Yi, Qingwu
    Wang, Qing
    Zhang, Jianwu
    Zheng, Xiaoran
    Lu, Zetao
    [J]. SENSORS, 2024, 24 (14)
  • [10] Attention cutting and padding learning for fine-grained image recognition
    Cheng, Zhuo
    Li, Hongjian
    Duan, Xiaolin
    Zeng, Xiangyan
    He, Mingxuan
    Luo, Hao
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (21-23) : 32791 - 32805