Intrinsic Variation Robust Speaker Verification based on Sparse Representation

被引：0

作者：

Nie, Yi ^{[1
]}

Xu, Mingxing ^{[1
]}

Xianyu, Haishu ^{[1
]}

机构：

[1] Tsinghua Univ, Tsinghua Natl Lab Informat Sci & Technol TNList, Dept Comp Sci & Technol, Key Lab Pervas Comp,Minist Educ, Beijing 100084, Peoples R China

来源：

2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2014年

关键词：

speaker verification; speaking style; intrinsic variation; sparse representation; K-SVD;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Intrinsic variation is one of the major factors that aggravate performance of speaker verification system dramatically. In this paper, we focus on alleviating influence caused by intrinsic variation using sparse representation. Because the over-complete dictionary increases the flexibility and the ability to adapt to variable data in signal representation, we expect redundancy of the dictionary could benefit addressing the implicit properties of intrinsic variation within each speaker. Both exemplar dictionary and learned dictionary are evaluated on an intrinsic variation corpus and compared with GMM-UBM, Joint Factor Analysis (JFA) and i-vector systems. In our system, we choose the K-SVD algorithm, generalization of K-means algorithm to learn dictionary with Singular Value Decomposition (SVD). The experiment results show that the two sparse representation systems achieve higher accuracy than GMM-UBM, JFA and i-vector systems consistently, especially outperform GMM-UBM respectively by 37.17% and 41.55%. We also find that the K-SVD based sparse representation system has almost the best performance, which achieve an average Error Equal Rate (EER) of 14.23%.

引用

页数：4

共 50 条

[1] Robust speaker verification based on max pooling of sparse representation
Wang, Wei
Han, Jiqing
Zheng, Tieran
Zheng, Guibin
Han, J. (jqhan@hit.edu.cn), 1600, Computer Society of the Republic of China (24): : 56 - 65
[2] Robust speaker verification using sparse representation on joint factor analysis
Yang, H., 2012, Science Press (37):
[3] A robust feature based on sparse representation for speaker recognition
Xie, Yining
Huang, Jinjie
Wang, Xinlei
Journal of Computational Information Systems, 2013, 9 (09): : 3553 - 3561
[4] SPEAKER VERIFICATION USING SPARSE REPRESENTATION CLASSIFICATION
Kua, Jia Min Karen
Ambikairajah, Eliathamby
Epps, Julien
Togneri, Roberto
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4548 - 4551
[5] A robust sparse auditory feature for speaker verification
Han, J. (jqhan@hit.edu.cn), 1600, Binary Information Press, P.O. Box 162, Bethel, CT 06801-0162, United States (09):
[6] Frame level sparse representation classification for speaker verification
Hasheminejad, Mohammad
Farsi, Hassan
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (20) : 21211 - 21224
[7] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
Qiang Wu
Liqing Zhang
EURASIP Journal on Audio, Speech, and Music Processing, 2008
[8] Noise-robust feature based on sparse representation for speaker recognition
Qi, Hongzhuo
Metallurgical and Mining Industry, 2015, 7 (04): : 64 - 69
[9] Auditory Sparse Representation for Robust Speaker Recognition Based on Tensor Structure
Wu, Qiang
Zhang, Liqing
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2008, 2008 (1)
[10] Frame level sparse representation classification for speaker verification
Mohammad Hasheminejad
Hassan Farsi
Multimedia Tools and Applications, 2017, 76 : 21211 - 21224

← 1 2 3 4 5 →