Intrinsic Variation Robust Speaker Verification based on Sparse Representation

被引:0
|
作者
Nie, Yi [1 ]
Xu, Mingxing [1 ]
Xianyu, Haishu [1 ]
机构
[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol TNList, Dept Comp Sci & Technol, Key Lab Pervas Comp,Minist Educ, Beijing 100084, Peoples R China
关键词
speaker verification; speaking style; intrinsic variation; sparse representation; K-SVD;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Intrinsic variation is one of the major factors that aggravate performance of speaker verification system dramatically. In this paper, we focus on alleviating influence caused by intrinsic variation using sparse representation. Because the over-complete dictionary increases the flexibility and the ability to adapt to variable data in signal representation, we expect redundancy of the dictionary could benefit addressing the implicit properties of intrinsic variation within each speaker. Both exemplar dictionary and learned dictionary are evaluated on an intrinsic variation corpus and compared with GMM-UBM, Joint Factor Analysis (JFA) and i-vector systems. In our system, we choose the K-SVD algorithm, generalization of K-means algorithm to learn dictionary with Singular Value Decomposition (SVD). The experiment results show that the two sparse representation systems achieve higher accuracy than GMM-UBM, JFA and i-vector systems consistently, especially outperform GMM-UBM respectively by 37.17% and 41.55%. We also find that the K-SVD based sparse representation system has almost the best performance, which achieve an average Error Equal Rate (EER) of 14.23%.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Robust speaker identification and verification
    Wang, Jia-Ching
    Yang, Chung-Hsien
    Wang, Jhing-Fa
    Lee, Hsiao-Ping
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2007, 2 (02) : 52 - 59
  • [22] Image Dehazing Based on Robust Sparse Representation
    Huang, Shuying
    Wu, Donglei
    Yang, Yong
    Zhu, Haijun
    IEEE ACCESS, 2018, 6 : 53907 - 53917
  • [23] Robust object tracking based on sparse representation
    Zhang, Shengping
    Yao, Hongxun
    Sun, Xin
    Liu, Shaohui
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2010, 2010, 7744
  • [24] Speaker adaptations in sparse training data for improved speaker verification
    Ahn, S
    Ko, H
    ELECTRONICS LETTERS, 2000, 36 (04) : 371 - 373
  • [25] Comparison of Two Kinds of Speaker Location Representation for SVM-based Speaker Verification
    Zhao, Xianyu
    Dong, Yuan
    Yang, Hao
    Zhao, Jian
    Lu, Liang
    Wang, Haila
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1001 - +
  • [26] Robust model for speaker verification against session-dependent utterance variation
    Matsui, T
    Aikawa, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 117 - 120
  • [27] SPEAKER VERIFICATION WITH THE MIXTURE OF GAUSSIAN FACTOR ANALYSIS BASED REPRESENTATION
    Li, Ming
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4679 - 4683
  • [28] Speaker Identification Based on Robust Sparse Coding with Limited Data
    Wang, Taolin
    Cheng, Jian
    2012 5TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING (CISP), 2012, : 1611 - 1614
  • [29] TOWARDS ROBUST SPEAKER VERIFICATION WITH TARGET SPEAKER ENHANCEMENT
    Zhang, Chunlei
    Yu, Meng
    Weng, Chao
    Yu, Dong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6693 - 6697
  • [30] SPARSE-BASED AUDITORY MODEL FOR ROBUST SPEAKER RECOGNITION
    You, Datao
    Han, Jiqing
    Zheng, Tieran
    Zheng, Guibin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (07)