A Speaker Verification Method Based on TDNN-LSTMP

被引:6
|
作者
Liu, Hui [1 ,2 ]
Zhao, Longlian [1 ,2 ]
机构
[1] China Agr Univ, Coll Informat & Elect Engn, Beijing, Peoples R China
[2] Minist Educ, Modern Precis Agr Syst Integrat Res Key Lab, Beijing, Peoples R China
关键词
Speaker verification; I-vector; TDNN; LSTM; Short utterances;
D O I
10.1007/s00034-019-01092-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In speaker recognition, a robust recognition method is essential. This paper proposes a speaker verification method that is based on the time-delay neural network (TDNN) and long short-term memory with recurrent project layer (LSTMP) model for the speaker modeling problem in speaker verification. In this work, we present the application of the fusion of TDNN and LSTMP to the i-vector speaker recognition system that is based on the Gaussian mixture model-universal background model. By using a model that can establish long-term dependencies to create a universal background model that contains a larger amount of speaker information, it is possible to extract more feature parameters, which are speaker dependent, from the speech signal. We conducted experiments with this method on four corpora: two in Chinese and two in English. The equal error rate, minimum detection cost function and detection error tradeoff curve are used as criteria for system performance evaluation. The experimental results show that the TDNN-LSTMP/i-vector speaker recognition method outperforms the baseline system on both Chinese and English corpora and has better robustness.
引用
收藏
页码:4840 / 4854
页数:15
相关论文
共 50 条
  • [21] A TRANSFER LEARNING METHOD FOR PLDA-BASED SPEAKER VERIFICATION
    Hong, Qingyang
    Zhang, Jun
    Li, Lin
    Wan, Lihong
    Tong, Feng
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5455 - 5459
  • [22] Information based speaker verification
    Pham, T
    Wagner, M
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 278 - 281
  • [23] A ROBUST TEXT-INDEPENDENT SPEAKER VERIFICATION METHOD BASED ON SPEECH SEPARATION AND DEEP SPEAKER
    Zhao, Fei
    Li, Hao
    Zhang, Xueliang
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6101 - 6105
  • [24] A Spectrum Smoothing Method for Speaker Verification
    Zhang, Zhaofeng
    Deng, Jing
    Wang, Longbiao
    Xiao, Xiong
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1291 - 1295
  • [25] Improvement of Speaker Vector-Based Speaker Verification
    Tadokoro, Naoki
    Kosaka, Tetsuo
    Kato, Masaharu
    Kohda, Masaki
    FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 721 - 724
  • [26] An Unsupervised Domain Adaptation Method Based on Distribution Alignment for Speaker Verification
    Gu, Qing
    Song, Yan
    Guo, Wu
    Ye, Zhongfu
    Dai, Lirong
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2024, 2025, 2312 : 359 - 369
  • [27] SVM speaker verification method of mismatch compensation based on factor analysis
    Wu, De-Hui
    Li, Hui
    Liu, Qing-Song
    Dai, Bei-Qian
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2010, 23 (01): : 59 - 64
  • [28] Discriminative Decision Function Based Scoring Method Used in Speaker Verification
    Liang Chunyan
    Zhang Xiang
    Yan Yonghong
    CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (04): : 692 - 696
  • [29] An Effective Speaker Recognition Method Based on Joint Identification and Verification Supervisions
    Liu, Ying
    Song, Yan
    Jiang, Yiheng
    McLoughlin, Ian
    Liu, Lin
    Dai, Lirong
    INTERSPEECH 2020, 2020, : 3007 - 3011
  • [30] Threshold Setting Method Based On Multimodality Detection In Speaker Verification System
    Zhao, Cuncheng
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-CHINA (ICCE-CHINA), 2016,