Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch

被引：0

作者：

vanVuuren, S

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We compare speaker recognition performance of Vector Quantization (VQ), Gaussian Mixture Modeling (GMM) and the Arithmetic Harmonic Sphericity measure (AHS) in adverse telephone speech conditions. The aim is to address the question: how do multimodal VQ and GMM typically compare to the simpler unimodal AHS for matched and mismatched training and testing environments. We study identification (dosed set) and verification errors on a new multi-environment database. We consider LPC and PLP features as well as their RASTA derivatives. We conclude that RASTA processing can remove redundancies from the features. We affirm that even when we use channel and noise compensation schemes speaker recognition errors remain high when there is acoustic mismatch.

引用

页码：1788 / 1791

页数：4

共 50 条

[21] TEXT-INDEPENDENT SPEAKER RECOGNITION USING NEURAL NETWORKS
HATTORI, H
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1993, E76D (03) : 345 - 351
[22] Research on text-independent speaker recognition methods using wavelet neural network
Bai, Ying
Zhao, Zhen-Dong
Qi, Yin-Cheng
Wang, Bin
Guo, Jian-Yong
[J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2006, 28 (06): : 1036 - 1039
[23] A study of variational method for text-independent speaker recognition
He, Liang
Tian, Yao
Liu, Yi
Dong, Fang
Zhang, WeiQiang
Liu, Jia
[J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[24] PCA/LDA Approach for Text-Independent Speaker Recognition
Ge, Zhenhao
Sharma, Sudhendu R.
Smith, Mark J. T.
[J]. INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING X, 2012, 8401
[25] AN ACOUSTIC SEGMENT MODEL APPROACH TO INCORPORATING TEMPORAL INFORMATION INTO SPEAKER MODELING FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
Tsao, Yu
Sun, Hanwu
Li, Haizhou
Lee, Chin-Hui
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4422 - 4425
[26] A discriminative training approach for text-independent speaker recognition
Hong, QY
Kwong, S
[J]. SIGNAL PROCESSING, 2005, 85 (07) : 1449 - 1463
[27] Exploring discriminative learning for text-independent speaker recognition
Liu, Ming
Zhang, Zhengyou
Hasegawa-Johnson, Mark
Huang, Thomas S.
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 56 - 59
[28] Text-independent speaker recognition using graph matching
Hautamaki, Ville
Kinnunen, Tomi
Franti, Pasi
[J]. PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1427 - 1432
[29] I-MATRIX FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
He, Liang
Liu, Jia
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7194 - 7198
[30] Searching through a speech memory for text-independent speaker verification
Petrovska-Delacrétaz, D
El Hannani, A
Chollet, G
[J]. AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 95 - 103

← 1 2 3 4 5 →