Maximum Likelihood Linear Dimension Reduction of Heteroscedastic Feature for Robust Speaker Recognition

被引:0
|
作者
Shon, Suwon [1 ]
Mun, Seongkyu [1 ]
Han, David K. [2 ]
Ko, Hanseok [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
[2] Off Naval Res, Arlington, VA USA
关键词
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
This paper analyzes heteroscedasticity in i-vector for robust forensics and surveillance speaker recognition system. Linear DiscriminantA nalysis (LDA), a widely-used linear dimension reduction technique, assumes that classes are homoscedastic within a same covariance. In this paper it is assumed that general speech utterances contain both homoscedastic and heteroscedastic elements. We show the validity of this assumption by employing several analyses and also demonstrate that dimension reduction using principal components is feasible. To effectively handle the presence of heteroscedastic and homoscedastic elements, we propose a fusion approach of applying both LDA and Heteroscedastic-LDA (HLDA). The experiments are conducted to show its effectiveness and compare to other methods using the telephone database of National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE) 2010 extended.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Maximum likelihood linear programming data fusion for speaker recognition
    Monte-Moreno, Enric
    Chetouani, Mohamed
    Faundez-Zanuy, Marcos
    Sole-Casals, Jordi
    [J]. SPEECH COMMUNICATION, 2009, 51 (09) : 820 - 830
  • [2] Robust endpoint detection based on feature weighted likelihood and dimension reduction
    Wang, Huanliang
    Han, Jiqing
    Li, Haifeng
    [J]. Shengxue Xuebao/Acta Acustica, 2007, 32 (01): : 62 - 68
  • [3] Robust maximum likelihood training of heteroscedastic probabilistic neural networks
    Yang, ZR
    Chen, S
    [J]. NEURAL NETWORKS, 1998, 11 (04) : 739 - 747
  • [4] Speaker verification using speaker model synthesis and feature mapping based on maximum-likelihood linear regression
    Chen, Cunbao
    Zhao, Li
    Zou, Cairong
    [J]. Shengxue Xuebao/Acta Acustica, 2011, 36 (01): : 81 - 87
  • [5] Maximum likelihood and maximum a posteriori adaptation for distributed speaker recognition systems
    Sit, CH
    Mak, MW
    Kung, SY
    [J]. BIOMETRIC AUTHENTICATION, PROCEEDINGS, 2004, 3072 : 640 - 647
  • [6] Regularized Feature-Based Maximum Likelihood Linear Regression for Speech Recognition
    Omar, Mohamed Kamal
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2456 - 2459
  • [7] The existence of maximum likelihood estimates in heteroscedastic linear models with censored data
    Wang, Yu
    Li, Jihong
    Yang, Liu
    [J]. DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2007, 14 : 416 - 420
  • [8] New speaker recognition feature using correlation dimension
    Seo, J
    Hong, S
    Gu, J
    Kim, M
    Baek, I
    Kwon, Y
    Lee, K
    Yang, S
    [J]. ISIE 2001: IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS PROCEEDINGS, VOLS I-III, 2001, : 505 - 507
  • [9] Environment adaptation for robust speaker verification by cascading maximum likelihood linear regression and reinforced learning.
    Yiu, K. K.
    Mak, M. W.
    Kung, S. Y.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2007, 21 (02): : 231 - 246
  • [10] Robust maximum likelihood estimation in the linear model
    Calafiore, G
    El Ghaoui, L
    [J]. AUTOMATICA, 2001, 37 (04) : 573 - 580