Two-stage model-based feature compensation for robust speech recognition

被引:0
|
作者
Haifeng Shen
Gang Liu
Jun Guo
机构
[1] Beijing University of Posts and Telecommunications,
来源
Computing | 2012年 / 94卷
关键词
Batch-EM algorithm; Model-based feature compensation; Robust speech recognition; 68T10;
D O I
暂无
中图分类号
学科分类号
摘要
This paper presents a combination approach to robust speech recognition by using two-stage model-based feature compensation. Gaussian mixture model (GMM)-based and hidden Markov model (HMM)-based compensation approaches are combined together and conducted sequentially in the multiple-decoding recognition system. The clean speech is firstly modeled as a GMM in the initial pass, and then modeled as a HMM generated from the initial pass in the following passes, respectively. The environment parameter estimation on these two modeling strategies are formulated both under maximum a posteriori (MAP) criterion. Experimental result shows that a significant improvement is achieved compared to European Telecommunications Standards Institute (ETSI) advanced compensation approach, GMM-based feature compensation approach, HMM-based feature compensation approach, and acoustic model compensation approach.
引用
收藏
页码:1 / 20
页数:19
相关论文
共 50 条
  • [1] Two-stage model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Liu, Gang
    Guo, Jun
    [J]. COMPUTING, 2012, 94 (01) : 1 - 20
  • [2] Model-based feature compensation for robust speech recognition
    Shen, Haifeng
    Li, Qunxia
    Guo, Jun
    Liu, Gang
    [J]. FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
  • [3] Predictive model-based compensation schemes for robust speech recognition
    Gales, MJF
    [J]. SPEECH COMMUNICATION, 1998, 25 (1-3) : 49 - 74
  • [4] Two-domain feature compensation for robust speech recognition
    Shen, HF
    Liu, G
    Guo, J
    Li, QX
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 351 - 356
  • [5] On stochastic feature and model compensation approaches to robust speech recognition
    Lee, CH
    [J]. SPEECH COMMUNICATION, 1998, 25 (1-3) : 29 - 47
  • [6] Two-Stage System for Robust Neutral/Lombard Speech Recognition
    Boril, Hynek
    Fousek, Petr
    Hoege, Harald
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2936 - +
  • [7] VTS feature compensation based on two-layer GMM structure for robust speech recognition
    Zhou, Lin
    Li, Haijing
    Chen, Ying
    Wu, Zhenyang
    Lu, Yong
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [8] A robust two-stage face recognition system with localisation error compensation
    Su, Ching-Yao
    Yang, Jar-Ferr
    [J]. IET COMPUTER VISION, 2014, 8 (06) : 690 - 700
  • [9] Model-Based Feature Enhancement for Reverberant Speech Recognition
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
  • [10] Model-based feature enhancement for noisy speech recognition
    Couvreur, C
    Van hamme, H
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1719 - 1722