Intrinsic Variation Robust Speaker Verification based on Sparse Representation

被引:0
|
作者
Nie, Yi [1 ]
Xu, Mingxing [1 ]
Xianyu, Haishu [1 ]
机构
[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol TNList, Dept Comp Sci & Technol, Key Lab Pervas Comp,Minist Educ, Beijing 100084, Peoples R China
关键词
speaker verification; speaking style; intrinsic variation; sparse representation; K-SVD;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Intrinsic variation is one of the major factors that aggravate performance of speaker verification system dramatically. In this paper, we focus on alleviating influence caused by intrinsic variation using sparse representation. Because the over-complete dictionary increases the flexibility and the ability to adapt to variable data in signal representation, we expect redundancy of the dictionary could benefit addressing the implicit properties of intrinsic variation within each speaker. Both exemplar dictionary and learned dictionary are evaluated on an intrinsic variation corpus and compared with GMM-UBM, Joint Factor Analysis (JFA) and i-vector systems. In our system, we choose the K-SVD algorithm, generalization of K-means algorithm to learn dictionary with Singular Value Decomposition (SVD). The experiment results show that the two sparse representation systems achieve higher accuracy than GMM-UBM, JFA and i-vector systems consistently, especially outperform GMM-UBM respectively by 37.17% and 41.55%. We also find that the K-SVD based sparse representation system has almost the best performance, which achieve an average Error Equal Rate (EER) of 14.23%.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Robust speaker verification based on max pooling of sparse representation
    Wang, Wei
    Han, Jiqing
    Zheng, Tieran
    Zheng, Guibin
    Han, J. (jqhan@hit.edu.cn), 1600, Computer Society of the Republic of China (24): : 56 - 65
  • [3] A robust feature based on sparse representation for speaker recognition
    Xie, Yining
    Huang, Jinjie
    Wang, Xinlei
    Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
  • [4] SPEAKER VERIFICATION USING SPARSE REPRESENTATION CLASSIFICATION
    Kua, Jia Min Karen
    Ambikairajah, Eliathamby
    Epps, Julien
    Togneri, Roberto
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4548 - 4551
  • [5] A robust sparse auditory feature for speaker verification
    Han, J. (jqhan@hit.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
  • [6] Frame level sparse representation classification for speaker verification
    Hasheminejad, Mohammad
    Farsi, Hassan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (20) : 21211 - 21224
  • [7] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
    Qiang Wu
    Liqing Zhang
    EURASIP Journal on Audio, Speech, and Music Processing, 2008
  • [8] Noise-robust feature based on sparse representation for speaker recognition
    Qi, Hongzhuo
    Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
  • [9] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
    Wu, Qiang
    Zhang, Liqing
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
  • [10] Frame level sparse representation classification for speaker verification
    Mohammad Hasheminejad
    Hassan Farsi
    Multimedia Tools and Applications, 2017, 76 : 21211 - 21224