Robust speaker identification system based on wavelet transform and Gaussian mixture model

被引：0

作者：

Chen, WC ^{[1
]}

Hsieh, CT

Lai, E

机构：

[1] St Johns & St Marys Inst Technol, Dept Elect Engn, Taipei, Taiwan

[2] Tamkang Univ, Dept Elect Engn, Taipei, Taiwan

来源：

NATURAL LANGUAGE PROCESSING - IJCNLP 2004 | 2005年 / 3248卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents an effective method for improving the performance of a speaker identification system. Based on the multiresolution property of the wavelet transform, the input speech signal is decomposed into various frequency bands in order not to spread noise distortions over the entire feature space. The linear predictive cepstral coefficients (LPCCs) of each band are calculated. Furthermore, the cepstral mean normalization technique is applied to all computed features. We use feature recombination and likelihood recombination methods to evaluate the task of the text-independent speaker identification. The feature recombination scheme combines the cepstral coefficients of each band to form a single feature vector used to train the Gaussian mixture model (GMM). The likelihood recombination scheme combines the likelihood scores of independent GMM for each band. Experimental results show that both proposed methods outperform the GMM model using full-band LPCCs and mel-frequency cepstral coefficients (MFCCs) in both clean and noisy environments.

引用

页码：263 / 271

页数：9

共 50 条

[21] A genetic algorithm based method for optimisation of Gaussian mixture model parameters for speaker identification
Mashao, DJ
Tsai, CT
6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL III, PROCEEDINGS: IMAGE, ACOUSTIC, SPEECH AND SIGNAL PROCESSING I, 2002, : 254 - 258
[22] Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification
M.A. El-Gamal
M.F. Abu El-Yazeed
M.M.H. El Ayadi
International Journal of Speech Technology, 2005, 8 (1) : 93 - 103
[23] Multistage Speaker Feature Tracking Identification System Based on Continuous and Discrete Wavelet Transform
Al-Sawalmeh, Wael
Daqrouq, Khaled
Al-Qawasmi, Abdel-Rahman
MUSP '06: PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON MULTIMEDIA SYSTEMS AND SIGNAL PROCESSING, 2009, : 30 - +
[24] Enhancing the Performance of Gaussian Mixture Model-Based Text Independent Speaker Identification
El-Gamal, M. A.
Abu El-Yazeed, M. F.
El Ayadi, M. M. H.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (01) : 93 - 103
[25] Research on the Parameter Optimal Algorithm of Gaussian Mixture Model in Speaker Identification
Ding, Hui
Tang, Zhenmin
Li, Yanping
PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 639 - +
[26] SPEAKER IDENTIFICATION AND VERIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
REYNOLDS, DA
SPEECH COMMUNICATION, 1995, 17 (1-2) : 91 - 108
[27] Improving Speaker Identification System Using Discrete Wavelet Transform and AWGN
Maged, Heba
AbouEl-Farag, Ahmed
Mesbah, Saleh
2014 5TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2014, : 1171 - 1176
[28] Parameter optimization for Gaussian mixture model and its application in speaker identification
1600, ICIC Express Letters Office (07):
[29] Discrete Wavelet Transform-Based Gaussian Mixture Model for Remote Sensing Image Compression
Xiang, Shao
Liang, Qiaokang
Fang, Leyuan
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[30] Multiple-Antenna Cooperative Spectrum Sensing Based on the Wavelet Transform and Gaussian Mixture Model
Zhang, Shunchao
Wang, Yonghua
Yuan, Hantao
Wan, Pin
Zhang, Yongwei
SENSORS, 2019, 19 (18)

← 1 2 3 4 5 →