TOWARDS NOISE-ROBUST SPEAKER RECOGNITION USING PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS

被引:0
|
作者
Lei, Yun
Burget, Lukas
Ferrer, Luciana
Graciarena, Martin
Scheffer, Nicolas
机构
关键词
Speaker Recognition; noise; robustness; i-vector; PLDA;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work addresses the problem of speaker verification where additive noise is present in the enrollment and testing utterances. We show how the current state-of-the-art framework can be effectively used to mitigate this effect. We first look at the degradation a standard speaker verification system is subjected to when presented with noisy speech waveforms. We designed and generated a corpus with noisy conditions, based on the NIST SRE 2008 and 2010 data, built using open-source tools and freely available noise samples. We then show how adding noisy training data in the current i-vector-based approach followed by probabilistic linear discriminant analysis (PLDA) can bring significant gains in accuracy at various signal-to-noise ratio (SNR) levels. We demonstrate that this improvement is not feature-specific as we present positive results for three disparate sets of features: standard mel frequency cepstral coefficients, prosodic polynomial coefficients and maximum likelihood linear regression (MLLR) transforms.
引用
收藏
页码:4253 / 4256
页数:4
相关论文
共 50 条
  • [41] Frame decorrelation for noise-robust speech recognition
    Jung, HY
    Kim, DY
    Un, CK
    [J]. ELECTRONICS LETTERS, 1996, 32 (13) : 1163 - 1164
  • [42] Frame decorrelation for noise-robust speech recognition
    Korea Advanced Inst of Science and, Technology, Taejon, Korea, Republic of
    [J]. Electron Lett, 13 (1163-1164):
  • [43] Software Entity Recognition with Noise-Robust Learning
    Tai Nguyen
    Di, Yifeng
    Lee, Joohan
    Chen, Muhao
    Zhang, Tianyi
    [J]. 2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 484 - 496
  • [44] Extended VTS for Noise-Robust Speech Recognition
    van Dalen, Rogier C.
    Gales, Mark J. F.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 733 - 743
  • [45] Probabilistic Linear Discriminant Analysis with Bottleneck Features for Speech Recognition
    Lu, Liang
    Renals, Steve
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 910 - 914
  • [46] Noise-robust parameter estimation of linear systems
    Emara-Shabaik, HE
    [J]. JOURNAL OF VIBRATION AND CONTROL, 2000, 6 (05) : 727 - 740
  • [47] Generative Adversarial Networks based X-vector Augmentation for Robust Probabilistic Linear Discriminant Analysis in Speaker Verification
    Yang, Yexin
    Wang, Shuai
    Sun, Man
    Qian, Yanmin
    Yu, Kai
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 205 - 209
  • [48] Noise-robust acoustic signature recognition using nonlinear Hebbian learning
    Lu, Bing
    Dibazar, Alireza
    Berger, Theodore W.
    [J]. NEURAL NETWORKS, 2010, 23 (10) : 1252 - 1263
  • [49] Robust Speaker Identification Using Multimodal Discriminant Analysis with Kernels
    Kim, Min-Seok
    Yang, Il-Ho
    Yu, Ha-Jin
    [J]. ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 319 - 322
  • [50] Noisy Constrained Maximum-Likelihood Linear Regression for Noise-Robust Speech Recognition
    Kim, D. K.
    Gales, M. J. F.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 315 - 325