A Speaker Verification Method Based on TDNN-LSTMP

被引:6
|
作者
Liu, Hui [1 ,2 ]
Zhao, Longlian [1 ,2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing, Peoples R China
[2] Minist Educ, Modern Precis Agr Syst Integrat Res Key Lab, Beijing, Peoples R China
关键词
Speaker verification; I-vector; TDNN; LSTM; Short utterances;
D O I
10.1007/s00034-019-01092-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speaker recognition, a robust recognition method is essential. This paper proposes a speaker verification method that is based on the time-delay neural network (TDNN) and long short-term memory with recurrent project layer (LSTMP) model for the speaker modeling problem in speaker verification. In this work, we present the application of the fusion of TDNN and LSTMP to the i-vector speaker recognition system that is based on the Gaussian mixture model-universal background model. By using a model that can establish long-term dependencies to create a universal background model that contains a larger amount of speaker information, it is possible to extract more feature parameters, which are speaker dependent, from the speech signal. We conducted experiments with this method on four corpora: two in Chinese and two in English. The equal error rate, minimum detection cost function and detection error tradeoff curve are used as criteria for system performance evaluation. The experimental results show that the TDNN-LSTMP/i-vector speaker recognition method outperforms the baseline system on both Chinese and English corpora and has better robustness.
引用
收藏
页码:4840 / 4854
页数:15
相关论文
共 50 条
  • [31] Faster speaker enrollment for speaker verification systems based on MLPs by using discriminative cohort speakers method
    Lee, TS
    Choi, SW
    Choi, WH
    Park, HT
    Lim, SS
    Hwang, BW
    DEVELOPMENTS IN APPLIED ARTIFICIAL INTELLIGENCE, 2003, 2718 : 734 - 743
  • [32] Speaker verification with model-based and score-based unsupervised adaptation method
    Wang, Er-Yu
    Guo, Wu
    Li, Yi-Jie
    Dai, Li-Rong
    Wang, Ren-Hua
    Zidonghua Xuebao/ Acta Automatica Sinica, 2009, 35 (03): : 267 - 271
  • [33] A continuous unsupervised adaptation method for speaker verification
    Preti, Alexandre
    Bonastre, Jean-Francois
    Capnian, Francois
    INNOVATIONS IN E-LEARNING, INSTRUCTION TECHNOLOGY, ASSESSMENT, AND ENGINEERING EDUCATION, 2007, : 461 - 465
  • [34] A novel adaptive training method for speaker verification
    Campbell, WM
    Broun, CC
    IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III, 2001, : 249 - 253
  • [35] Speaker and session variability in GMM-based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
  • [36] Speaker verification based on wavelet packets
    Ganchev, T
    Siafarikas, M
    Fakotakis, N
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 299 - 306
  • [37] Speaker Verification Based on Fuzzy Classifier
    Dustor, Adam
    MAN-MACHINE INTERACTIONS, 2009, 59 : 389 - 397
  • [38] A HMM/SVM Hybrid Method for Speaker Verification
    Florin, Rastoceanu
    Militaru, Diana
    PROCEEDINGS OF THE 2010 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS (COMM), 2010, : 111 - 114
  • [39] A Speaker Verification System Based on EMD
    Tang Lizhen
    Zhou Ping
    Wei Xing
    THIRD INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING, 2009, : 553 - 556
  • [40] On the speaker verification using the TESPAR coding method
    Lupu, E
    Fehér, Z
    Pop, PG
    SCS 2003: INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2003, : 173 - 176