A Speaker Verification Method Based on TDNN-LSTMP

被引:6
|
作者
Liu, Hui [1 ,2 ]
Zhao, Longlian [1 ,2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing, Peoples R China
[2] Minist Educ, Modern Precis Agr Syst Integrat Res Key Lab, Beijing, Peoples R China
关键词
Speaker verification; I-vector; TDNN; LSTM; Short utterances;
D O I
10.1007/s00034-019-01092-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speaker recognition, a robust recognition method is essential. This paper proposes a speaker verification method that is based on the time-delay neural network (TDNN) and long short-term memory with recurrent project layer (LSTMP) model for the speaker modeling problem in speaker verification. In this work, we present the application of the fusion of TDNN and LSTMP to the i-vector speaker recognition system that is based on the Gaussian mixture model-universal background model. By using a model that can establish long-term dependencies to create a universal background model that contains a larger amount of speaker information, it is possible to extract more feature parameters, which are speaker dependent, from the speech signal. We conducted experiments with this method on four corpora: two in Chinese and two in English. The equal error rate, minimum detection cost function and detection error tradeoff curve are used as criteria for system performance evaluation. The experimental results show that the TDNN-LSTMP/i-vector speaker recognition method outperforms the baseline system on both Chinese and English corpora and has better robustness.
引用
收藏
页码:4840 / 4854
页数:15
相关论文
共 50 条
  • [41] MFA: TDNN WITH MULTI-SCALE FREQUENCY-CHANNEL ATTENTION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION WITH SHORT UTTERANCES
    Liu, Tianchi
    Das, Rohan Kumar
    Lee, Kong Aik
    Li, Haizhou
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7517 - 7521
  • [42] An SIPCA-WCCN method for SVM-based speaker verification system
    Long, Yanhua
    Guo, Wu
    Dai, Lirong
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1295 - 1299
  • [43] Speaker verification method based on cross-domain attentive feature fusion
    Yang Z.
    Wang T.
    Guo H.
    Wang T.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (08): : 89 - 98
  • [44] A New Speaker Verification Method with Global Speaker Model and Likelihood Score Normalization
    张怡颖
    朱小燕
    张钹
    Journal of Computer Science and Technology, 2000, (02) : 184 - 193
  • [45] A new speaker verification method with global speaker model and likelihood score normalization
    Zhang, YY
    Zhu, XY
    Zhang, B
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2000, 15 (02) : 184 - 193
  • [46] A new speaker verification method with global speaker model and likelihood score normalization
    Yiying Zhang
    Xiaoyan Zhu
    Zhang Bo
    Journal of Computer Science and Technology, 2000, 15 : 184 - 193
  • [47] Speaker verification based on combining speaker individuality parameter selection and decision
    Ma, CY
    Lee, CH
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 71 - 74
  • [48] SPEAKER VERIFICATION
    CHAPMAN, WD
    LI, KP
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 40 (05): : 1282 - &
  • [49] Lhasa Dialect Recognition of Different Phonemes Based on TDNN Method
    Khysru, Kuntharrgyal
    Qie, Yangzhuoma
    Shi, Haiqiang
    Sun, Qilong
    Wei, Jianguo
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT II, 2022, 13339 : 150 - 160
  • [50] Speaker verification
    Atkins, Wendy
    Biometric Technology Today, 2001, 9 (03) : 8 - 11