A Speaker Verification Method Based on TDNN-LSTMP

被引：6

作者：

Liu, Hui ^{[1
,2
]}

Zhao, Longlian ^{[1
,2
]}

机构：

[1] China Agr Univ, Coll Informat & Elect Engn, Beijing, Peoples R China

[2] Minist Educ, Modern Precis Agr Syst Integrat Res Key Lab, Beijing, Peoples R China

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2019年 / 38卷 / 10期

关键词：

Speaker verification; I-vector; TDNN; LSTM; Short utterances;

D O I：

10.1007/s00034-019-01092-3

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In speaker recognition, a robust recognition method is essential. This paper proposes a speaker verification method that is based on the time-delay neural network (TDNN) and long short-term memory with recurrent project layer (LSTMP) model for the speaker modeling problem in speaker verification. In this work, we present the application of the fusion of TDNN and LSTMP to the i-vector speaker recognition system that is based on the Gaussian mixture model-universal background model. By using a model that can establish long-term dependencies to create a universal background model that contains a larger amount of speaker information, it is possible to extract more feature parameters, which are speaker dependent, from the speech signal. We conducted experiments with this method on four corpora: two in Chinese and two in English. The equal error rate, minimum detection cost function and detection error tradeoff curve are used as criteria for system performance evaluation. The experimental results show that the TDNN-LSTMP/i-vector speaker recognition method outperforms the baseline system on both Chinese and English corpora and has better robustness.

引用

页码：4840 / 4854

页数：15

共 50 条

[21] A TRANSFER LEARNING METHOD FOR PLDA-BASED SPEAKER VERIFICATION
Hong, Qingyang
Zhang, Jun
Li, Lin
Wan, Lihong
Tong, Feng
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5455 - 5459
[22] Information based speaker verification
Pham, T
Wagner, M
15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS: IMAGE, SPEECH AND SIGNAL PROCESSING, 2000, : 278 - 281
[23] A ROBUST TEXT-INDEPENDENT SPEAKER VERIFICATION METHOD BASED ON SPEECH SEPARATION AND DEEP SPEAKER
Zhao, Fei
Li, Hao
Zhang, Xueliang
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6101 - 6105
[24] A Spectrum Smoothing Method for Speaker Verification
Zhang, Zhaofeng
Deng, Jing
Wang, Longbiao
Xiao, Xiong
2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 1291 - 1295
[25] Improvement of Speaker Vector-Based Speaker Verification
Tadokoro, Naoki
Kosaka, Tetsuo
Kato, Masaharu
Kohda, Masaki
FIFTH INTERNATIONAL CONFERENCE ON INFORMATION ASSURANCE AND SECURITY, VOL 1, PROCEEDINGS, 2009, : 721 - 724
[26] An Unsupervised Domain Adaptation Method Based on Distribution Alignment for Speaker Verification
Gu, Qing
Song, Yan
Guo, Wu
Ye, Zhongfu
Dai, Lirong
MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2024, 2025, 2312 : 359 - 369
[27] SVM speaker verification method of mismatch compensation based on factor analysis
Wu, De-Hui
Li, Hui
Liu, Qing-Song
Dai, Bei-Qian
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2010, 23 (01): : 59 - 64
[28] Discriminative Decision Function Based Scoring Method Used in Speaker Verification
Liang Chunyan
Zhang Xiang
Yan Yonghong
CHINESE JOURNAL OF ELECTRONICS, 2012, 21 (04): : 692 - 696
[29] An Effective Speaker Recognition Method Based on Joint Identification and Verification Supervisions
Liu, Ying
Song, Yan
Jiang, Yiheng
McLoughlin, Ian
Liu, Lin
Dai, Lirong
INTERSPEECH 2020, 2020, : 3007 - 3011
[30] Threshold Setting Method Based On Multimodality Detection In Speaker Verification System
Zhao, Cuncheng
2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-CHINA (ICCE-CHINA), 2016,

← 1 2 3 4 5 →