IMPROVING DNN SPEAKER INDEPENDENCE WITH I-VECTOR INPUTS

被引：0

作者：

Senior, Andrew ^{[1
]}

Lopez-Moreno, Ignacio ^{[1
]}

机构：

[1] Google Inc, New York, NY 10011 USA

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Deep neural networks; large vocabulary speech recognition; Voice Search; i-vectors; speaker adaptation;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We propose providing additional utterance-level features as inputs to a deep neural network (DNN) to facilitate speaker, channel and background normalization. Modifications of the basic algorithm are developed which result in significant reductions in word error rates (WERs). The algorithms are shown to combine well with speaker adaptation by backpropagation, resulting in a 9% relative WER reduction. We address implementation of the algorithm for a streaming task.

引用

页数：5

共 50 条

[1] I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification
Tan, Zhili
Mak, Man-Wai
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1562 - 1566
[2] A Segmental DNN/i-vector Approach for Digit-Prompted Speaker Verification
Yan, Jie
Lei, Xie
Wang, Guangsen
Fu, Zhong-Hua
[J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1 - 5
[3] DNN and i-vector combined method for speaker recognition on multi-variability environments
Flavio J. Reyes-Díaz
Gabriel Hernández-Sierra
José R. Calvo de Lara
[J]. International Journal of Speech Technology, 2021, 24 : 409 - 418
[4] END-TO-END DNN BASED SPEAKER RECOGNITION INSPIRED BY I-VECTOR AND PLDA
Rohdin, Johan
Silnova, Anna
Diez, Mireia
Plchot, Oldrich
Matejka, Pavel
Burget, Lukas
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4874 - 4878
[5] DNN and i-vector combined method for speaker recognition on multi-variability environments
Reyes-Diaz, Flavio J.
Hernandez-Sierra, Gabriel
de Lara, Jose R. Calvo
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 409 - 418
[6] DNN i-vector Speaker Verification with Short, Text-constrained Test Utterances
Zhong, Jinghua
Hu, Wenping
Soong, Frank
Meng, Helen
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1507 - 1511
[7] An I-Vector Backend for Speaker Verification
Kenny, Patrick
Stafylakis, Themos
Alam, Jahangir
Kockmann, Marcel
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
[8] Improved i-Vector Representation for Speaker Diarization
Xu, Yan
McLoughlin, Ian
Song, Yan
Wu, Kui
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2016, 35 (09) : 3393 - 3404
[9] Simplification of I-Vector Extraction for Speaker Identification
XU Longting
YANG Zhen
SUN Linhui
[J]. Chinese Journal of Electronics, 2016, 25 (06) : 1121 - 1126
[10] Improved i-Vector Representation for Speaker Diarization
Yan Xu
Ian McLoughlin
Yan Song
Kui Wu
[J]. Circuits, Systems, and Signal Processing, 2016, 35 : 3393 - 3404

← 1 2 3 4 5 →