Robust speaker identification system based on wavelet transform and Gaussian mixture model

被引:0
|
作者
Chen, WC [1 ]
Hsieh, CT
Lai, E
机构
[1] St Johns & St Marys Inst Technol, Dept Elect Engn, Taipei, Taiwan
[2] Tamkang Univ, Dept Elect Engn, Taipei, Taiwan
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an effective method for improving the performance of a speaker identification system. Based on the multiresolution property of the wavelet transform, the input speech signal is decomposed into various frequency bands in order not to spread noise distortions over the entire feature space. The linear predictive cepstral coefficients (LPCCs) of each band are calculated. Furthermore, the cepstral mean normalization technique is applied to all computed features. We use feature recombination and likelihood recombination methods to evaluate the task of the text-independent speaker identification. The feature recombination scheme combines the cepstral coefficients of each band to form a single feature vector used to train the Gaussian mixture model (GMM). The likelihood recombination scheme combines the likelihood scores of independent GMM for each band. Experimental results show that both proposed methods outperform the GMM model using full-band LPCCs and mel-frequency cepstral coefficients (MFCCs) in both clean and noisy environments.
引用
收藏
页码:263 / 271
页数:9
相关论文
共 50 条
  • [1] Robust speaker identification system based on wavelet transform and Gaussian mixture model
    Hsieh, CT
    Lai, E
    Wang, YC
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2003, 19 (02) : 267 - 282
  • [2] A robust speaker identification system based on wavelet transform
    Hsieh, CT
    Wang, YC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (07): : 839 - 846
  • [3] Speaker identification research based on gaussian mixture model
    Chunguang, Han
    Hua, Li
    Jia, Ding
    2007 International Symposium on Computer Science & Technology, Proceedings, 2007, : 702 - 705
  • [4] Robust speech features based on wavelet transform with application to speaker identification
    Hsieh, CT
    Lai, E
    Wang, YC
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (02): : 108 - 114
  • [5] Speaker Identification Wavelet Transform Based Method
    Daqrouq, Khaled
    Al-Sawalmeh, Wael
    Al-Qawasmi, Abdel-Rahman
    Abu-Isbeib, Ibrahim N.
    2008 5TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS AND DEVICES, VOLS 1 AND 2, 2008, : 698 - 702
  • [6] An efficient scoring algorithm for Gaussian mixture model based speaker identification
    Pellom, BL
    Hansen, JHL
    IEEE SIGNAL PROCESSING LETTERS, 1998, 5 (11) : 281 - 284
  • [7] Distributed genetic algorithm for Gaussian mixture model based speaker identification
    Lung, SY
    PATTERN RECOGNITION, 2003, 36 (10) : 2479 - 2481
  • [8] Optimization of Gaussian mixture model parameters for speaker identification
    Hong, QY
    Kwong, S
    Wang, HL
    GENETIC AND EVOLUTIONARY COMPUTATION GECCO 2004 , PT 2, PROCEEDINGS, 2004, 3103 : 1310 - 1311
  • [9] Individual dimension Gaussian mixture model for speaker identification
    Wang, C
    Hou, LM
    Fang, Y
    ADVANCES IN BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2005, 3781 : 172 - 179
  • [10] Speaker identification using hybrid Karhunen-Loeve transform and Gaussian mixture model approach
    Chen, CCT
    Chen, CT
    Hou, CK
    PATTERN RECOGNITION, 2004, 37 (05) : 1073 - 1075