Speech recognition based on HMM decomposition and composition method with a microphone array in noisy reverberant environments

被引:0
|
作者
Miki, K
Nishiura, T
Nakamura, S
Shikano, K
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300101, Japan
[2] ATR Spoken Language Translat Res Labs, Kyoto 6190288, Japan
关键词
hands-free; microphone array; HMM decomposition and composition; noisy and echo environment; speech recognition;
D O I
10.1002/ecjb.10068
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Handling background noise or echo (reverberation) etc. is very important for having an automated robot etc. recognize remote speech in a real environment. As effective schemes for handling this problem, noise reducing schemes such as model adaptation schemes including HMM decomposition and composition or microphone array (beam-former) signal processing, spectral subtraction, etc. have been proposed. In particular, a model adaptation scheme is very effective for speech recognition in a noisy environment and its recognition performance increases in proportion to the signal-to-noise ratio (SNR). In this paper, improving the recognition performance in a low-SNR environment by receiving speech at a high SNR using a: microphone array before HMM decomposition and composition is attempted. The results of speech recognition experiments conducted in a noisy environment in an acoustic laboratory show an improvement in the recognition rate of about 25% by the proposed method for the case in which the SNR in a single microphone is 0 dB, As compared with the cases of using microphone array signal processing, HMM decomposition and composition. alone. In addition, the proposed method shows recognition performance comparable to the case of using cepstrum mean normalization and spectral subtraction performed with an optimal coefficient given to the speech after microphone array processing. (C) 2002 Wiley Periodicals, Inc.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 50 条
  • [21] Speech Intelligibility of Microphone Arrays in Reverberant Environments with Interference
    Ideli, Elham
    Vaughan, Rodney G.
    Bajic, Ivan, V
    2018 IEEE 20TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2018,
  • [22] An Efficient HMM-Based Feature Enhancement Method With Filter Estimation for Reverberant Speech Recognition
    Cho, Ji-Won
    Park, Hyung-Min
    IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (12) : 1199 - 1202
  • [23] Speech recognition under noisy environments using segmental unit input HMM
    Yamamoto, Kazumasa
    Nakagawa, Seiichi
    Systems and Computers in Japan, 2002, 33 (08) : 111 - 120
  • [24] Noisy speech recognition with microphone array steering and fourier/wavelet spectral subtraction
    Denda, Y
    Nishiura, T
    Kawahara, H
    PROCEEDINGS OF THE 2003 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING, 2003, : 593 - 596
  • [25] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Dong, Huan-Yu
    Lee, Chang-Myung
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [26] Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays
    GonzalezRodrigeuz, J
    OrtegaGarcia, J
    Marin, C
    Hernandez, L
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1333 - 1336
  • [27] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Huan-Yu Dong
    Chang-Myung Lee
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [28] MICROPHONE ARRAYS FOR IMPROVING SPEECH-INTELLIGIBILITY IN A REVERBERANT OR NOISY SPACE
    NOMURA, H
    MIYATA, H
    HOUTGAST, T
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1993, 41 (10): : 771 - 781
  • [29] An Investigation into Audiovisual Speech Correlation in Reverberant Noisy Environments
    Cifani, Simone
    Abel, Andrew
    Hussain, Amir
    Squartini, Stefano
    Piazza, Francesco
    CROSS-MODAL ANALYSIS OF SPEECH, GESTURES, GAZE AND FACIAL EXPRESSIONS, 2009, 5641 : 331 - +
  • [30] Chinese speech intelligibility of children in noisy and reverberant environments
    Peng, Jianxin
    Wu, Shengju
    INDOOR AND BUILT ENVIRONMENT, 2018, 27 (10) : 1357 - 1363