TOWARDS NOISE-ROBUST SPEAKER RECOGNITION USING PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS

被引:0
|
作者
Lei, Yun
Burget, Lukas
Ferrer, Luciana
Graciarena, Martin
Scheffer, Nicolas
机构
关键词
Speaker Recognition; noise; robustness; i-vector; PLDA;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work addresses the problem of speaker verification where additive noise is present in the enrollment and testing utterances. We show how the current state-of-the-art framework can be effectively used to mitigate this effect. We first look at the degradation a standard speaker verification system is subjected to when presented with noisy speech waveforms. We designed and generated a corpus with noisy conditions, based on the NIST SRE 2008 and 2010 data, built using open-source tools and freely available noise samples. We then show how adding noisy training data in the current i-vector-based approach followed by probabilistic linear discriminant analysis (PLDA) can bring significant gains in accuracy at various signal-to-noise ratio (SNR) levels. We demonstrate that this improvement is not feature-specific as we present positive results for three disparate sets of features: standard mel frequency cepstral coefficients, prosodic polynomial coefficients and maximum likelihood linear regression (MLLR) transforms.
引用
收藏
页码:4253 / 4256
页数:4
相关论文
共 50 条
  • [1] Curriculum Learning based Probabilistic Linear Discriminant Analysis for Noise Robust Speaker Recognition
    Ranjan, Shivesh
    Misra, Abhinav
    Hansen, John H. L.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3717 - 3721
  • [2] Speaker Recognition Using Sparse Probabilistic Linear Discriminant Analysis
    Yang, Hai
    Xu, Yunfei
    Zhao, Qinwei
    Zhou, Ruohua
    Yan, Yonghong
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (10) : 1938 - 1945
  • [3] Noise-Robust Speaker Recognition Based on Morphological Component Analysis
    He, Yongjun
    Chen, Chen
    Han, Jiqing
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3001 - 3005
  • [5] Fuzzy restricted Boltzmann machine based probabilistic linear discriminant analysis for noise-robust text-dependent speaker verification on short utterances
    Yoon, Sung-Hyun
    Koh, Min-Sung
    Yu, Ha-Jin
    [J]. IAENG International Journal of Computer Science, 2020, 47 (03) : 468 - 480
  • [6] A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation
    Ferrer, Luciana
    McLaren, Mitchell
    Scheffer, Nicolas
    Lei, Yun
    Graciarena, Martin
    Mitra, Vikramjit
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1980 - 1984
  • [7] Noise-robust feature based on sparse representation for speaker recognition
    Qi, Hongzhuo
    [J]. Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
  • [8] Towards Using Reservoir Computing Networks for Noise-Robust Image Recognition
    Jalalvand, Azarakhsh
    De Neve, Wesley
    Van de Walle, Rik
    Martens, Jean-Pierre
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 1666 - 1672
  • [9] Noise-robust speaker recognition using subband likelihoods and reliable-feature selection
    Kim, Sungtak
    Ji, Mikyong
    Kim, Hoirin
    [J]. ETRI JOURNAL, 2008, 30 (01) : 89 - 100
  • [10] Probabilistic Linear Discriminant Analysis for Robust Speaker Identification in Co-channel Speech
    Shokouhi, Navid
    Hansen, John H. L.
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3016 - 3020