A GMM-based handset selector for channel mismatch compensation with applications to speaker identification

被引:0
|
作者
Yiu, KK [1 ]
Mak, MW [1 ]
Kung, SY [1 ]
机构
[1] Hong Kong Polytech Univ, Ctr Multimedia Signal Proc, Dept Elect & Informat Engn, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In telephone-based speaker identification, variation in handset characteristics can introduce severe speech variability even for speech uttered by the same speaker. This paper proposes a method to compensate the variation in handset characteristics, In the method, a number of Gaussian mixture models axe independently trained to identify the most likely handset given a test utterance. The identified handset is used to select a compensation vector from a set of pre-computed vectors, where the pre-computed vectors axe the average frame-by-frame differences between the clean and distorted utterances. The clean features are then recovered by subtracting the selected compensation vector from the distorted vectors. Experimental results based on 138 speakers of the YOHO and telephone YOHO corpora show that the proposed approach is computationally efficient and is able to increase the accuracy from 17% (without compensation) to 85% (with compensation).
引用
下载
收藏
页码:1132 / 1137
页数:6
相关论文
共 50 条
  • [31] Analysis of feature extraction and channel compensation in a GMM speaker recognition system
    Burget, Lukas
    Matejka, Pavel
    Schwarz, Petr
    Glembek, Ondfei
    Cernocky, Jan 'Honza'
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 1979 - 1986
  • [32] A GMM-based telephone channel classification for Mandarin speech recognition
    Xu, W
    Peng, X
    Wang, BX
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 642 - 645
  • [33] GMM-based Handwriting Style Identification System for Historical Documents
    Slimane, Fouad
    Schassan, Torsten
    Maergner, Volker
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 387 - 392
  • [34] Research on Adaptive Speaker Identification Based on GMM
    Zhou, Yuhuan
    Wang, Jinming
    Zhang, Xiongwei
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 2, PROCEEDINGS, 2009, : 330 - 332
  • [35] Comparison of One and Two-Level Architecture of the GMM-Based Speaker Age Classifier
    Pribil, Jiri
    Pribilova, Anna
    Matousek, Jindrich
    2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 299 - 302
  • [36] GMM based on local PCA for speaker identification
    Seo, CW
    Lee, KY
    Lee, J
    ELECTRONICS LETTERS, 2001, 37 (24) : 1486 - 1488
  • [37] Speaker identification based on GMM with embedded AANN
    Chen C.-B.
    Zhao L.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2010, 32 (03): : 528 - 532
  • [38] Unseen handset mismatch compensation based on feature/model-space a priori knowledge interpolation for robust speaker recognition
    Yang, JH
    Liao, YF
    2004 INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2004, : 65 - 68
  • [39] Data-driven Gaussian Component Selection for Fast GMM-Based Speaker Verification
    Zhang, Ce
    Zheng, Rong
    Xu, Bo
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 252 - 255
  • [40] Target Speech GMM-based Spectral Compensation for Noise Robust Speech Recognition
    Shinozaki, Takahiro
    Furui, Sadaoki
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1223 - 1226