Speech recognition based on HMM decomposition and composition method with a microphone array in noisy reverberant environments

被引:0
|
作者
Miki, K
Nishiura, T
Nakamura, S
Shikano, K
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300101, Japan
[2] ATR Spoken Language Translat Res Labs, Kyoto 6190288, Japan
关键词
hands-free; microphone array; HMM decomposition and composition; noisy and echo environment; speech recognition;
D O I
10.1002/ecjb.10068
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Handling background noise or echo (reverberation) etc. is very important for having an automated robot etc. recognize remote speech in a real environment. As effective schemes for handling this problem, noise reducing schemes such as model adaptation schemes including HMM decomposition and composition or microphone array (beam-former) signal processing, spectral subtraction, etc. have been proposed. In particular, a model adaptation scheme is very effective for speech recognition in a noisy environment and its recognition performance increases in proportion to the signal-to-noise ratio (SNR). In this paper, improving the recognition performance in a low-SNR environment by receiving speech at a high SNR using a: microphone array before HMM decomposition and composition is attempted. The results of speech recognition experiments conducted in a noisy environment in an acoustic laboratory show an improvement in the recognition rate of about 25% by the proposed method for the case in which the SNR in a single microphone is 0 dB, As compared with the cases of using microphone array signal processing, HMM decomposition and composition. alone. In addition, the proposed method shows recognition performance comparable to the case of using cepstrum mean normalization and spectral subtraction performed with an optimal coefficient given to the speech after microphone array processing. (C) 2002 Wiley Periodicals, Inc.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 50 条
  • [1] Experiments of speech recognition in a noisy and reverberant environment using a microphone array and HMM adaptation
    Giuliani, D
    Omologo, M
    Svaizer, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1329 - 1332
  • [2] Model Adaptation by HMM Decomposition and Composition in Noisy Reverberant Environments
    Takiguchi, Tetsuya
    Nakamura, Satoshi
    Shikano, Kiyohiro
    1600, John Wiley and Sons Inc. (31): : 4 - 6
  • [3] DISTANT SPEECH RECOGNITION IN REVERBERANT NOISY CONDITIONS EMPLOYING A MICROPHONE ARRAY
    Morales-Cordovilla, Juan A.
    Hagmueller, Martin
    Pessentheiner, Hannes
    Kubin, Gernot
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 2380 - 2384
  • [4] Model adaptation based on HMM decomposition for reverberant speech recognition
    Takiguchi, T
    Nakamura, S
    Huo, Q
    Shikano, K
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 827 - 830
  • [5] Speech Emotion Recognition in Noisy and Reverberant Environments
    Heracleous, Panikos
    Yasuda, Keiji
    Sugaya, Fumiaki
    Yoneyama, Akio
    Hashimoto, Masayuki
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 262 - 266
  • [6] HMM-Based Multipitch Tracking for Noisy and Reverberant Speech
    Jin, Zhaozhang
    Wang, DeLiang
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1091 - 1102
  • [7] SPEECH RECOGNITION IN NOISY ENVIRONMENTS WITH THE AID OF MICROPHONE ARRAYS
    VANCOMPERNOLLE, D
    MA, W
    XIE, F
    VANDIEST, M
    SPEECH COMMUNICATION, 1990, 9 (5-6) : 433 - 442
  • [8] HMM adaptation and microphone array processing for distant speech recognition
    Kleban, J
    Gong, YF
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1411 - 1414
  • [9] Two-Stage Enhancement of Noisy and Reverberant Microphone Array Speech for Automatic Speech Recognition Systems Trained with Only Clean Speech
    Wang, Quandong
    Wang, Sicheng
    Ge, Fengpei
    Han, Chang Woo
    Lee, Jaewon
    Guo, Lianghao
    Lee, Chin-Hui
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 21 - 25
  • [10] Subband parameter optimization of microphone arrays for speech recognition in reverberant environments
    Seltzer, ML
    Stern, RM
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 408 - 411