A Speaker Verification Method Based on TDNN-LSTMP

被引:6
|
作者
Liu, Hui [1 ,2 ]
Zhao, Longlian [1 ,2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing, Peoples R China
[2] Minist Educ, Modern Precis Agr Syst Integrat Res Key Lab, Beijing, Peoples R China
关键词
Speaker verification; I-vector; TDNN; LSTM; Short utterances;
D O I
10.1007/s00034-019-01092-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speaker recognition, a robust recognition method is essential. This paper proposes a speaker verification method that is based on the time-delay neural network (TDNN) and long short-term memory with recurrent project layer (LSTMP) model for the speaker modeling problem in speaker verification. In this work, we present the application of the fusion of TDNN and LSTMP to the i-vector speaker recognition system that is based on the Gaussian mixture model-universal background model. By using a model that can establish long-term dependencies to create a universal background model that contains a larger amount of speaker information, it is possible to extract more feature parameters, which are speaker dependent, from the speech signal. We conducted experiments with this method on four corpora: two in Chinese and two in English. The equal error rate, minimum detection cost function and detection error tradeoff curve are used as criteria for system performance evaluation. The experimental results show that the TDNN-LSTMP/i-vector speaker recognition method outperforms the baseline system on both Chinese and English corpora and has better robustness.
引用
收藏
页码:4840 / 4854
页数:15
相关论文
共 50 条
  • [1] A Speaker Verification Method Based on TDNN–LSTMP
    Hui Liu
    Longlian Zhao
    Circuits, Systems, and Signal Processing, 2019, 38 : 4840 - 4854
  • [2] ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
    Desplanques, Brecht
    Thienpondt, Jenthe
    Demuynck, Kris
    INTERSPEECH 2020, 2020, : 3830 - 3834
  • [3] MACCIF-TDNN: MULTI ASPECT AGGREGATION OF CHANNEL AND CONTEXT INTERDEPENDENCE FEATURES IN TDNN-BASED SPEAKER VERIFICATION
    Wang, Fangyuan
    Song, Zhigang
    Jiang, Hongchen
    Xu, Bo
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 214 - 219
  • [4] Progressive channel fusion for more efficient TDNN on speaker verification
    Zhao, Zhenduo
    Li, Zhuo
    Wang, Wenchao
    Xu, Ji
    SPEECH COMMUNICATION, 2024, 163
  • [5] DFR-ECAPA: Diffusion Feature Refinement for Speaker Verification Based on ECAPA-TDNN
    Gao, Ya
    Song, Wei
    Zhao, Xiaobing
    Liu, Xiangchun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT X, 2024, 14434 : 457 - 468
  • [6] Speaker Recognition Based on GMM with an Embedded TDNN
    Chen, Cunbao
    Zhao, Li
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 746 - 753
  • [7] SPEAKER CHARACTERIZATION USING TDNN-LSTM BASED SPEAKER EMBEDDING
    Chen, Chia-Ping
    Zhang, Su-Yu
    Yeh, Chih-Ting
    Wang, Jia-Ching
    Wang, Tenghui
    Huang, Chien-Lin
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6211 - 6215
  • [8] ECAPA plus plus : Fine-grained Deep Embedding Learning for TDNN Based Speaker Verification
    Liu, Bei
    Qian, Yanmin
    INTERSPEECH 2023, 2023, : 3132 - 3136
  • [9] P-vectors: A Parallel-Coupled TDNN/Transformer Network for Speaker Verification
    Wang, Xiyuan
    Wang, Fangyuan
    Xu, Bo
    Xu, Liang
    Xiao, Jing
    INTERSPEECH 2023, 2023, : 3182 - 3186
  • [10] Attention-based factorized TDNN for a noise-robust and spoof-aware speaker verification system
    Benhafid Z.
    Selouani S.A.
    Amrouche A.
    Sidi Yakoub M.
    International Journal of Speech Technology, 2023, 26 (04) : 881 - 894