Cepstral statistics compensation and normalization using online pseudo stereo codebooks for robust speech recognition in additive noise environments

被引:1
|
作者
Hung, Jeih-Weih [1 ]
机构
[1] Natl Chi Nan Univ, Dept Elect Engn, Nantou County, Taiwan
关键词
cepstral statistics compensation; pseudo stereo codebooks; linear least squares; quadratic least squares;
D O I
10.1093/ietisy/e91-d.2.296
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes several cepstral statistics compensation and normalization algorithms which alleviate the effect of additive noise on cepstral features for speech recognition. The algorithms are simple yet efficient noise reduction techniques that use online-constructed pseudo-stereo codebooks to evaluate the statistics in both clean and noisy environments. The process yields transformations for both clean speech cepstra and noise-corrupted speech cepstra, or for noise-corrupted speech cepstra only, so that the statistics of the transformed speech cepstra are similar for both environments. Experimental results show that these codebook-based algorithms can provide significant performance gains compared to results obtained by using conventional utterance-based normalization approaches. The proposed codebook-based cesptral mean and variance normalization (C-CMVN), linear least squares (LLS) and quadratic least squares (QLS) outperform utterance-based CMVN (U-CMVN) by 26.03%, 22.72% and 27.48%, respectively, in relative word error rate reduction for experiments conducted on Test Set A of the Aurora-2 digit database.
引用
收藏
页码:296 / 311
页数:16
相关论文
共 38 条
  • [21] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
    Shen, Guanghu
    Jung, Ho-Youl
    Chung, Hyun-Yeol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
  • [22] Noise Robust Speech Recognition Based on Noise-Adapted HMMs Using Speech Feature Compensation
    Chung, Yong-Joo
    2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2014, : 132 - 135
  • [23] Robust speech recognition by model adaptation and normalization using pre-observed noise
    Kobashikawa, Satoshi
    Takahashi, Satoshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 422 - 429
  • [24] Noise robust speech recognition using delta-cepstrum normalization and channel selection
    Obuchi, Yasunari
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2006, 89 (07): : 9 - 20
  • [25] ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USING PARALLEL MODEL COMBINATION
    GALES, MJF
    YOUNG, SJ
    COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04): : 289 - 307
  • [26] Robust speech recognition using a cepstral minimum-mean-square-error-motivated noise suppressor
    Yu, Dong
    Deng, Li
    Droppo, Jasha
    Wu, Jian
    Gong, Yian
    Acero, Alex
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 1061 - 1070
  • [27] Robust Speech Recognition for Similar Pronunciation Phrases Using MMSE under Noise Environments
    Watanabe, Masumi
    Tsutsui, Hiroshi
    Miyanaga, Yoshikazu
    2013 13TH INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES (ISCIT): COMMUNICATION AND INFORMATION TECHNOLOGY FOR NEW LIFE STYLE BEYOND THE CLOUD, 2013, : 802 - 807
  • [28] Nonlinear Compensation Using the Gauss-Newton Method for Noise-Robust Speech Recognition
    Zhao, Yong
    Juang, Biing-Hwang
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (08): : 2191 - 2206
  • [29] Noise Robust Speech Recognition Using Parallel Model Compensation and Voice Activity Detection Methods
    Hizlisoy, Serhat
    Tufekci, Zekeriya
    2016 5TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA), 2016,
  • [30] Recursive noise estimation using iterative stochastic approximation for stereo-based robust speech recognition
    Li, D
    Droppo, J
    Acero, A
    ASRU 2001: IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, CONFERENCE PROCEEDINGS, 2001, : 81 - 84