Maximum Likelihood Linear Dimension Reduction of Heteroscedastic Feature for Robust Speaker Recognition

被引:0
|
作者
Shon, Suwon [1 ]
Mun, Seongkyu [1 ]
Han, David K. [2 ]
Ko, Hanseok [1 ]
机构
[1] Korea Univ, Sch Elect Engn, Seoul, South Korea
[2] Off Naval Res, Arlington, VA USA
关键词
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
This paper analyzes heteroscedasticity in i-vector for robust forensics and surveillance speaker recognition system. Linear DiscriminantA nalysis (LDA), a widely-used linear dimension reduction technique, assumes that classes are homoscedastic within a same covariance. In this paper it is assumed that general speech utterances contain both homoscedastic and heteroscedastic elements. We show the validity of this assumption by employing several analyses and also demonstrate that dimension reduction using principal components is feasible. To effectively handle the presence of heteroscedastic and homoscedastic elements, we propose a fusion approach of applying both LDA and Heteroscedastic-LDA (HLDA). The experiments are conducted to show its effectiveness and compare to other methods using the telephone database of National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE) 2010 extended.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Maximum-likelihood stochastic matching approach to non-linear equalization for robust speech recognition
    Surendran, AC
    Lee, CH
    Rahim, M
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1836 - 1839
  • [32] Maximum-Likelihood Linear Transformation for Unsupervised Domain Adaptation in Speaker Verification
    Misra, Abhinav
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1549 - 1558
  • [33] Exploration of Feature Reduction of MFCC Spectral Features in Speaker Recognition
    Kumar, Mohit
    Katti, Sachin
    Das, Pradip K.
    ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2016, 452 : 151 - 159
  • [34] A linear dimension reduction technique for face recognition
    Anjum, MA
    Javed, MY
    Basit, A
    SAM '05: Proceedings of the 2005 International Conference on Security and Management, 2005, : 524 - 528
  • [35] Manifold learning based speaker dependent dimension reduction for robust text independent speaker verification
    Zabihzadeh D.
    Moattar M.H.
    Zabihzadeh, D. (d.zabihzadeh@gmail.com), 1600, Kluwer Academic Publishers (17): : 271 - 280
  • [36] FUSION OF HYPERSPECTRAL AND LIDAR DATA BASED ON DIMENSION REDUCTION AND MAXIMUM LIKELIHOOD
    Abbasi, B.
    Arefi, H.
    Bigdeli, B.
    Motagh, M.
    Roessner, S.
    36TH INTERNATIONAL SYMPOSIUM ON REMOTE SENSING OF ENVIRONMENT, 2015, 47 (W3): : 569 - 573
  • [37] REGULARIZED CONSTRAINED MAXIMUM LIKELIHOOD LINEAR REGRESSION FOR SPEECH RECOGNITION
    Ghalehjegh, Sina Hamidi
    Rose, Richard C.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [38] Maximum Conditional Likelihood Linear Regression and Maximum A Posteriori for Hidden Conditional Random Fields speaker adaptation
    Sung, Yun-Hsuan
    Boulis, Constantinos
    Jurafsky, Dan
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4293 - +
  • [39] Robust Speaker Recognition Using Improved GFCC and Adaptive Feature Selection
    Zhang, Xingyu
    Zou, Xia
    Sun, Meng
    Wu, Penglong
    SECURITY WITH INTELLIGENT COMPUTING AND BIG-DATA SERVICES, 2020, 895 : 159 - 169
  • [40] Multitaper Based MFCC Feature Extraction for Robust Speaker Recognition System
    Bharath, K. P.
    Kumar, Rajesh M.
    2019 INNOVATIONS IN POWER AND ADVANCED COMPUTING TECHNOLOGIES (I-PACT), 2019,