Comparison of text-independent speaker recognition methods on telephone speech with acoustic mismatch

被引:0
|
作者
vanVuuren, S
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We compare speaker recognition performance of Vector Quantization (VQ), Gaussian Mixture Modeling (GMM) and the Arithmetic Harmonic Sphericity measure (AHS) in adverse telephone speech conditions. The aim is to address the question: how do multimodal VQ and GMM typically compare to the simpler unimodal AHS for matched and mismatched training and testing environments. We study identification (dosed set) and verification errors on a new multi-environment database. We consider LPC and PLP features as well as their RASTA derivatives. We conclude that RASTA processing can remove redundancies from the features. We affirm that even when we use channel and noise compensation schemes speaker recognition errors remain high when there is acoustic mismatch.
引用
收藏
页码:1788 / 1791
页数:4
相关论文
共 50 条
  • [22] Research on text-independent speaker recognition methods using wavelet neural network
    Bai, Ying
    Zhao, Zhen-Dong
    Qi, Yin-Cheng
    Wang, Bin
    Guo, Jian-Yong
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2006, 28 (06): : 1036 - 1039
  • [23] A study of variational method for text-independent speaker recognition
    He, Liang
    Tian, Yao
    Liu, Yi
    Dong, Fang
    Zhang, WeiQiang
    Liu, Jia
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [24] PCA/LDA Approach for Text-Independent Speaker Recognition
    Ge, Zhenhao
    Sharma, Sudhendu R.
    Smith, Mark J. T.
    [J]. INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING X, 2012, 8401
  • [25] AN ACOUSTIC SEGMENT MODEL APPROACH TO INCORPORATING TEMPORAL INFORMATION INTO SPEAKER MODELING FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
    Tsao, Yu
    Sun, Hanwu
    Li, Haizhou
    Lee, Chin-Hui
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4422 - 4425
  • [26] A discriminative training approach for text-independent speaker recognition
    Hong, QY
    Kwong, S
    [J]. SIGNAL PROCESSING, 2005, 85 (07) : 1449 - 1463
  • [27] Exploring discriminative learning for text-independent speaker recognition
    Liu, Ming
    Zhang, Zhengyou
    Hasegawa-Johnson, Mark
    Huang, Thomas S.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 56 - 59
  • [28] Text-independent speaker recognition using graph matching
    Hautamaki, Ville
    Kinnunen, Tomi
    Franti, Pasi
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1427 - 1432
  • [29] I-MATRIX FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
    He, Liang
    Liu, Jia
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7194 - 7198
  • [30] Searching through a speech memory for text-independent speaker verification
    Petrovska-Delacrétaz, D
    El Hannani, A
    Chollet, G
    [J]. AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 95 - 103