A GMM-based handset selector for channel mismatch compensation with applications to speaker identification

被引：0

作者：

Yiu, KK ^{[1
]}

Mak, MW ^{[1
]}

Kung, SY ^{[1
]}

机构：

[1] Hong Kong Polytech Univ, Ctr Multimedia Signal Proc, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China

来源：

ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS | 2001年 / 2195卷

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In telephone-based speaker identification, variation in handset characteristics can introduce severe speech variability even for speech uttered by the same speaker. This paper proposes a method to compensate the variation in handset characteristics, In the method, a number of Gaussian mixture models axe independently trained to identify the most likely handset given a test utterance. The identified handset is used to select a compensation vector from a set of pre-computed vectors, where the pre-computed vectors axe the average frame-by-frame differences between the clean and distorted utterances. The clean features are then recovered by subtracting the selected compensation vector from the distorted vectors. Experimental results based on 138 speakers of the YOHO and telephone YOHO corpora show that the proposed approach is computationally efficient and is able to increase the accuracy from 17% (without compensation) to 85% (with compensation).

引用

下载

页码：1132 / 1137

页数：6

共 50 条

[1] FPGA Implementation for GMM-Based Speaker Identification
EhKan, Phaklen
Allen, Timothy
Quigley, Steven F.
INTERNATIONAL JOURNAL OF RECONFIGURABLE COMPUTING, 2011, 2011
[2] A GMM-Based Speaker Identification System on FPGA
Kan, Phak Len Eh
Allen, Tim
Quigley, Steven F.
RECONFIGURABLE COMPUTING: ARCHITECTURES, TOOLS AND APPLICATIONS, 2010, 5992 : 358 - 363
[3] An Improved GMM-based Clustering Algorithm for Efficient Speaker Identification
Lin, Wenyong
PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 1490 - 1493
[4] Speaker and session variability in GMM-based speaker verification
Kenny, Patrick
Boulianne, Gilles
Ouellet, Pierre
Dumouchel, Pierre
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (04): : 1448 - 1460
[5] Experimental Study on GMM-Based Speaker Recognition
Ye, Wenxing
Wu, Dapeng
Nucci, Antonio
MOBILE MULTIMEDIA/IMAGE PROCESSING, SECURITY, AND APPLICATIONS 2010, 2010, 7708
[6] Quantization for adapted GMM-based speaker verification
Tseng, Ivy H.
Verscheure, Olivier
Turaga, Deepak S.
Chaudhari, Upendra V.
2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 653 - 656
[7] Maximum Likelihood A Priori Knowledge Interpolation-Based Handset Mismatch Compensation for Robust Speaker Identification
廖元甫
庄智显
杨智合
Tsinghua Science and Technology, 2008, (04) : 528 - 532
[8] Maximum Likelihood A Priori Knowledge Interpolation-Based Handset Mismatch Compensation for Robust Speaker Identification
Department of Electronic Engineering, Taipei University of Technology, Taipei, 106, Taiwan
不详
Tsinghua Sci. Tech., 2008, 4 (528-532): : 528 - 532
[9] A GMM-based Probabilistic Sequence Kernel for Speaker Verification
Lee, Kong-Aik
You, Changhuai
Li, Haizhou
Kinnunen, Tomi
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1553 - 1556
[10] Evaluation of GMM-based Features for SVM Speaker Verification
Liu, Minghui
Huang, Zhongwei
2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5027 - 5030

← 1 2 3 4 5 →