Word graph based feature enhancement for noisy speech recognition

被引：0

作者：

Yan, Zhi-Jie ^{[1
]}

Soong, Frank K. ^{[2
]}

Wang, Ren-Hua ^{[1
]}

机构：

[1] Univ Sci & Technol China, iFlytek Speech Lab, Hefei 230027, Peoples R China

[2] Microsoft Res Asia, Beijing 100080, Peoples R China

来源：

2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3 | 2007年

关键词：

speech recognition; robustness; speech enhancement;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper presents a word graph based feature enhancement method for robust speech recognition in noise. The approach uses signal processing based speech enhancement as a starting point, and then performs Wiener filtering to remove residual noise. During the process, a decoded word graph is used to directly guide the feature enhancement with respect to the HMM for recognition, so that the enhanced feature can match the clean speech model better in the acoustic space. The proposed word graph based feature enhancement method was tested on the Aurora 2 database. Experimental results show that an improved recognition performance can be obtained comparing with conventional signal processing based and GMM based feature enhancement methods. With signal processing based Weighted Noise Estimation and GMM based method, the relative error rate reductions are 35.44% and 42.58%, respectively. The proposed word graph based method improves the performance further, and a relative error rate reduction of 57.89% is obtained.

引用

页码：373 / +

页数：2

共 50 条

[1] Model-based feature enhancement for noisy speech recognition
Couvreur, C
Van hamme, H
[J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1719 - 1722
[2] Noisy speech recognition based on speech enhancement
Wang, Xia
Tang, Hongmei
Zhao, Xiaoqun
[J]. SNPD 2007: EIGHTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING, AND PARALLEL/DISTRIBUTED COMPUTING, VOL 3, PROCEEDINGS, 2007, : 713 - +
[3] Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech
Leem, Seong-Gyun
Fulford, Daniel
Onnela, Jukka-Pekka
Gard, David
Busso, Carlos
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 917 - 929
[4] Speech enhancement method based on feature compensation gain for effective speech recognition in noisy environments
Bae, Ara
Kim, Wooil
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 51 - 55
[5] On the effectiveness of speech enhancement to a proposed speech recognition process that applied to noisy isolated-word recognition
Liu, Lih-Cherng
Lu, Ching-Ta
Tsai, Ho-Hsuan
[J]. PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3310 - +
[6] Feature weighting in noisy speech recognition
Huang, KC
Juang, YT
[J]. ELECTRONICS LETTERS, 2003, 39 (12) : 938 - 939
[7] Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement
Schuller, Bjoern
Woellmer, Martin
Moosmayr, Tobias
Rigoll, Gerhard
[J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2009,
[8] Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement
Björn Schuller
Martin Wöllmer
Tobias Moosmayr
Gerhard Rigoll
[J]. EURASIP Journal on Audio, Speech, and Music Processing, 2009
[9] Speech enhancement applied to speech recognition in noisy environments
[J]. Xu, Y.F., 2001, Press of Tsinghua University (41):
[10] Robust recognition of noisy speech using speech enhancement
Xu, YF
Zhang, JJ
Yao, KS
Cao, ZG
Ma, ZX
[J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 734 - 737

← 1 2 3 4 5 →