Unconstrained vocal pattern recognition algorithm based on attention mechanism

被引:1
|
作者
Li, Yaqian [1 ]
Zhang, Xiaolong [2 ]
Zhang, Xuyao [3 ]
Li, Haibin [1 ]
Zhang, Wenming [4 ]
机构
[1] Yanshan Univ, Pattern Recognized, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[2] Yanshan Univ, Speaker Diarizat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[3] Yanshan Univ, Speaker Verificat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
[4] Yanshan Univ, Camera Calibrat, Elect Engn, Qinhuangdao, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
Voiceprint recognition; Unconstrained datasets; Attention mechanism; Feature fusion;
D O I
10.1016/j.dsp.2023.103973
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Deep learning-based voiceprint recognition methods rely heavily on adequate datasets, especially those closer to the natural environment and more complex under unconstrained conditions. Yet, the data types of open-source speech datasets are too homogeneous nowadays, and there are some differences with the address collected in natural application environments. For few Chinese datasets used, this paper proposes and produces an unconstrained Chinese speech dataset with richer data types closer to those collected in a natural environment. To address the inadequate extraction of acoustic features in the unconstrained speech dataset, a new two-dimensional convolutional residual network structure based on the attention mechanism is designed and applied to acoustic feature extraction. The residual block structure in the residual network is improved by the SE module and the CBAM module to obtain the SE-Cov2d and CSA-Cov2d models respectively. Finally, it is experimentally demonstrated that the attention mechanism can help the network focus on more critical feature information and fuse more differentiated features in feature extraction. (c) 2023 Elsevier Inc. All rights reserved.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Attention-Mechanism-Based Models for Unconstrained Face Recognition with Mask Occlusion
    Zhang, Mengya
    Zhang, Yuan
    Zhang, Qinghui
    ELECTRONICS, 2023, 12 (18)
  • [2] Shoe Type Recognition Algorithm Based on Attention Mechanism
    Zhang Jiajun
    Tang Yunqi
    Yang Zhixiong
    Geng Pengzhi
    LASER & OPTOELECTRONICS PROGRESS, 2022, 59 (02)
  • [4] FACIAL EXPRESSION RECOGNITION ALGORITHM BASED ON MULTI-ATTENTION MECHANISM
    Wu, Huixin
    Huang, Zehuan
    Jiang, Wei
    Zhao, Xin
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2023, 19 (04): : 1239 - 1250
  • [5] An End-to-end Speech Recognition Algorithm based on Attention Mechanism
    Chen, Jia-nan
    Gao, Shuang
    Sun, Han-zhe
    Liu, Xiao-hui
    Wang, Zi-ning
    Zheng, Yan
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 2935 - 2940
  • [6] Attention mechanism and its role in invariant pattern recognition
    Zeng, XD
    NEUROCOMPUTING, 2001, 38 : 1611 - 1618
  • [7] Temporal Group Deep Network Action Recognition Algorithm Based on Attention Mechanism
    Hu Z.
    Diao P.
    Zhang R.
    Li S.
    Zhao M.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (10): : 892 - 900
  • [8] Research on open flame recognition algorithm in construction site based on attention mechanism
    Yang, Xiaojiao
    Wang, Zilong
    He, Yinchuan
    He, Yun
    Yang, Jun
    Liang, Pei
    2023 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE, ICACI, 2023,
  • [9] Deflection character recognition algorithm introducing attention mechanism
    Wang, Renrui
    Zhang, Baolong
    Li, Dan
    Ma, Yufeng
    Zhang, Xin
    Qiao, Gaoxue
    Zhang, Zhiqiang
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (10) : 1322 - 1331
  • [10] Research on Modulation Recognition Algorithm Based on Channel and Spatial Self-Attention Mechanism
    Zhang, Wenna
    Sun, Yunqiang
    Xue, Kailiang
    Yao, Aiqin
    IEEE ACCESS, 2023, 11 : 68617 - 68631