IMPROVING DNN SPEAKER INDEPENDENCE WITH I-VECTOR INPUTS

被引:0
|
作者
Senior, Andrew [1 ]
Lopez-Moreno, Ignacio [1 ]
机构
[1] Google Inc, New York, NY 10011 USA
关键词
Deep neural networks; large vocabulary speech recognition; Voice Search; i-vectors; speaker adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose providing additional utterance-level features as inputs to a deep neural network (DNN) to facilitate speaker, channel and background normalization. Modifications of the basic algorithm are developed which result in significant reductions in word error rates (WERs). The algorithms are shown to combine well with speaker adaptation by backpropagation, resulting in a 9% relative WER reduction. We address implementation of the algorithm for a streaming task.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] I-Vector DNN Scoring and Calibration for Noise Robust Speaker Verification
    Tan, Zhili
    Mak, Man-Wai
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1562 - 1566
  • [2] A Segmental DNN/i-vector Approach for Digit-Prompted Speaker Verification
    Yan, Jie
    Lei, Xie
    Wang, Guangsen
    Fu, Zhong-Hua
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1 - 5
  • [3] DNN and i-vector combined method for speaker recognition on multi-variability environments
    Flavio J. Reyes-Díaz
    Gabriel Hernández-Sierra
    José R. Calvo de Lara
    [J]. International Journal of Speech Technology, 2021, 24 : 409 - 418
  • [4] END-TO-END DNN BASED SPEAKER RECOGNITION INSPIRED BY I-VECTOR AND PLDA
    Rohdin, Johan
    Silnova, Anna
    Diez, Mireia
    Plchot, Oldrich
    Matejka, Pavel
    Burget, Lukas
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4874 - 4878
  • [5] DNN and i-vector combined method for speaker recognition on multi-variability environments
    Reyes-Diaz, Flavio J.
    Hernandez-Sierra, Gabriel
    de Lara, Jose R. Calvo
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (02) : 409 - 418
  • [6] DNN i-vector Speaker Verification with Short, Text-constrained Test Utterances
    Zhong, Jinghua
    Hu, Wenping
    Soong, Frank
    Meng, Helen
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1507 - 1511
  • [7] An I-Vector Backend for Speaker Verification
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Kockmann, Marcel
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
  • [8] Improved i-Vector Representation for Speaker Diarization
    Xu, Yan
    McLoughlin, Ian
    Song, Yan
    Wu, Kui
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2016, 35 (09) : 3393 - 3404
  • [9] Simplification of I-Vector Extraction for Speaker Identification
    XU Longting
    YANG Zhen
    SUN Linhui
    [J]. Chinese Journal of Electronics, 2016, 25 (06) : 1121 - 1126
  • [10] Improved i-Vector Representation for Speaker Diarization
    Yan Xu
    Ian McLoughlin
    Yan Song
    Kui Wu
    [J]. Circuits, Systems, and Signal Processing, 2016, 35 : 3393 - 3404