Two-stage model-based feature compensation for robust speech recognition

被引：0

作者：

Haifeng Shen

Gang Liu

Jun Guo

机构：

[1] Beijing University of Posts and Telecommunications,

来源：

Computing | 2012年 / 94卷

关键词：

Batch-EM algorithm; Model-based feature compensation; Robust speech recognition; 68T10;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper presents a combination approach to robust speech recognition by using two-stage model-based feature compensation. Gaussian mixture model (GMM)-based and hidden Markov model (HMM)-based compensation approaches are combined together and conducted sequentially in the multiple-decoding recognition system. The clean speech is firstly modeled as a GMM in the initial pass, and then modeled as a HMM generated from the initial pass in the following passes, respectively. The environment parameter estimation on these two modeling strategies are formulated both under maximum a posteriori (MAP) criterion. Experimental result shows that a significant improvement is achieved compared to European Telecommunications Standards Institute (ETSI) advanced compensation approach, GMM-based feature compensation approach, HMM-based feature compensation approach, and acoustic model compensation approach.

引用

页码：1 / 20

页数：19

共 50 条

[1] Two-stage model-based feature compensation for robust speech recognition
Shen, Haifeng
Liu, Gang
Guo, Jun
[J]. COMPUTING, 2012, 94 (01) : 1 - 20
[2] Model-based feature compensation for robust speech recognition
Shen, Haifeng
Li, Qunxia
Guo, Jun
Liu, Gang
[J]. FUNDAMENTA INFORMATICAE, 2006, 72 (04) : 529 - 539
[3] Predictive model-based compensation schemes for robust speech recognition
Gales, MJF
[J]. SPEECH COMMUNICATION, 1998, 25 (1-3) : 49 - 74
[4] Two-domain feature compensation for robust speech recognition
Shen, HF
Liu, G
Guo, J
Li, QX
[J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 351 - 356
[5] On stochastic feature and model compensation approaches to robust speech recognition
Lee, CH
[J]. SPEECH COMMUNICATION, 1998, 25 (1-3) : 29 - 47
[6] Two-Stage System for Robust Neutral/Lombard Speech Recognition
Boril, Hynek
Fousek, Petr
Hoege, Harald
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2936 - +
[7] VTS feature compensation based on two-layer GMM structure for robust speech recognition
Zhou, Lin
Li, Haijing
Chen, Ying
Wu, Zhenyang
Lu, Yong
[J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
[8] A robust two-stage face recognition system with localisation error compensation
Su, Ching-Yao
Yang, Jar-Ferr
[J]. IET COMPUTER VISION, 2014, 8 (06) : 690 - 700
[9] Model-Based Feature Enhancement for Reverberant Speech Recognition
Krueger, Alexander
Haeb-Umbach, Reinhold
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
[10] Model-based feature enhancement for noisy speech recognition
Couvreur, C
Van hamme, H
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1719 - 1722

← 1 2 3 4 5 →