Cepstral statistics compensation and normalization using online pseudo stereo codebooks for robust speech recognition in additive noise environments

被引：1

作者：

Hung, Jeih-Weih ^{[1
]}

机构：

[1] Natl Chi Nan Univ, Dept Elect Engn, Nantou County, Taiwan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2008年 / E91D卷 / 02期

关键词：

cepstral statistics compensation; pseudo stereo codebooks; linear least squares; quadratic least squares;

D O I：

10.1093/ietisy/e91-d.2.296

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes several cepstral statistics compensation and normalization algorithms which alleviate the effect of additive noise on cepstral features for speech recognition. The algorithms are simple yet efficient noise reduction techniques that use online-constructed pseudo-stereo codebooks to evaluate the statistics in both clean and noisy environments. The process yields transformations for both clean speech cepstra and noise-corrupted speech cepstra, or for noise-corrupted speech cepstra only, so that the statistics of the transformed speech cepstra are similar for both environments. Experimental results show that these codebook-based algorithms can provide significant performance gains compared to results obtained by using conventional utterance-based normalization approaches. The proposed codebook-based cesptral mean and variance normalization (C-CMVN), linear least squares (LLS) and quadratic least squares (QLS) outperform utterance-based CMVN (U-CMVN) by 26.03%, 22.72% and 27.48%, respectively, in relative word error rate reduction for experiments conducted on Test Set A of the Aurora-2 digit database.

引用

页码：296 / 311

页数：16

共 38 条

[21] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
Shen, Guanghu
Jung, Ho-Youl
Chung, Hyun-Yeol
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
[22] Noise Robust Speech Recognition Based on Noise-Adapted HMMs Using Speech Feature Compensation
Chung, Yong-Joo
2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2014, : 132 - 135
[23] Robust speech recognition by model adaptation and normalization using pre-observed noise
Kobashikawa, Satoshi
Takahashi, Satoshi
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 422 - 429
[24] Noise robust speech recognition using delta-cepstrum normalization and channel selection
Obuchi, Yasunari
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2006, 89 (07): : 9 - 20
[25] ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USING PARALLEL MODEL COMBINATION
GALES, MJF
YOUNG, SJ
COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04): : 289 - 307
[26] Robust speech recognition using a cepstral minimum-mean-square-error-motivated noise suppressor
Yu, Dong
Deng, Li
Droppo, Jasha
Wu, Jian
Gong, Yian
Acero, Alex
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 1061 - 1070
[27] Robust Speech Recognition for Similar Pronunciation Phrases Using MMSE under Noise Environments
Watanabe, Masumi
Tsutsui, Hiroshi
Miyanaga, Yoshikazu
2013 13TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT): COMMUNICATION AND INFORMATION TECHNOLOGY FOR NEW LIFE STYLE BEYOND THE CLOUD, 2013, : 802 - 807
[28] Nonlinear Compensation Using the Gauss-Newton Method for Noise-Robust Speech Recognition
Zhao, Yong
Juang, Biing-Hwang
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2191 - 2206
[29] Noise Robust Speech Recognition Using Parallel Model Compensation and Voice Activity Detection Methods
Hizlisoy, Serhat
Tufekci, Zekeriya
2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
[30] Recursive noise estimation using iterative stochastic approximation for stereo-based robust speech recognition
Li, D
Droppo, J
Acero, A
ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 81 - 84

← 1 2 3 4 →