Speech recognition based on HMM decomposition and composition method with a microphone array in noisy reverberant environments

被引:0
|
作者
Miki, K
Nishiura, T
Nakamura, S
Shikano, K
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300101, Japan
[2] ATR Spoken Language Translat Res Labs, Kyoto 6190288, Japan
关键词
hands-free; microphone array; HMM decomposition and composition; noisy and echo environment; speech recognition;
D O I
10.1002/ecjb.10068
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Handling background noise or echo (reverberation) etc. is very important for having an automated robot etc. recognize remote speech in a real environment. As effective schemes for handling this problem, noise reducing schemes such as model adaptation schemes including HMM decomposition and composition or microphone array (beam-former) signal processing, spectral subtraction, etc. have been proposed. In particular, a model adaptation scheme is very effective for speech recognition in a noisy environment and its recognition performance increases in proportion to the signal-to-noise ratio (SNR). In this paper, improving the recognition performance in a low-SNR environment by receiving speech at a high SNR using a: microphone array before HMM decomposition and composition is attempted. The results of speech recognition experiments conducted in a noisy environment in an acoustic laboratory show an improvement in the recognition rate of about 25% by the proposed method for the case in which the SNR in a single microphone is 0 dB, As compared with the cases of using microphone array signal processing, HMM decomposition and composition. alone. In addition, the proposed method shows recognition performance comparable to the case of using cepstrum mean normalization and spectral subtraction performed with an optimal coefficient given to the speech after microphone array processing. (C) 2002 Wiley Periodicals, Inc.
引用
收藏
页码:13 / 22
页数:10
相关论文
共 50 条
  • [31] TDOA ESTIMATION OF SPEECH SOURCE IN NOISY REVERBERANT ENVIRONMENTS
    Bu, Suliang
    Zhao, Tuo
    Zhao, Yunxin
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 1059 - 1066
  • [32] Source Separation in Noisy and Reverberant Environment using Miniature Microphone Array
    Li, Shuo
    Stanacevic, Milutin
    CONFERENCE RECORD OF THE 2014 FORTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2014, : 446 - 449
  • [33] A microphone array processing technique for speech enhancement in a reverberant space
    Liu, QG
    Champagne, B
    Kabal, P
    SPEECH COMMUNICATION, 1996, 18 (04) : 317 - 334
  • [34] A Posterior Approach for Microphone Array Based Speech Recognition
    Wang, Dong
    Himawan, Ivan
    Frankel, Joe
    King, Simon
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 996 - 999
  • [35] Speech improvement in noisy reverberant environments using virtual microphones along with proposed array geometry
    Sadeghi, Mohammad Ebrahim
    Sheikhzadeh, Hamid
    Emadi, Mohammad Javad
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2022, 2022 (01)
  • [36] Speech improvement in noisy reverberant environments using virtual microphones along with proposed array geometry
    Mohammad Ebrahim Sadeghi
    Hamid Sheikhzadeh
    Mohammad Javad Emadi
    EURASIP Journal on Advances in Signal Processing, 2022
  • [37] ROBUST SPEECH RECOGNITION IN UNKNOWN REVERBERANT AND NOISY CONDITIONS
    Hsiao, Roger
    Ma, Jeff
    Hartmann, William
    Karafiat, Martin
    Grezl, Frantisek
    Burget, Lukas
    Szoke, Igor
    Cernocky, Jan Honza
    Watanabe, Shinji
    Chen, Zhuo
    Mallidi, Sri Harish
    Hermansky, Hynek
    Tsakalidis, Stavros
    Schwartz, Richard
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 533 - 538
  • [38] Techniques for robust speech recognition in noisy and reverberant conditions
    Brown, GJ
    Palomäki, KJ
    SPEECH SEPARATION BY HUMANS AND MACHINES, 2005, : 213 - 220
  • [39] A PROGRESSIVE ENHANCEMENT METHOD FOR NOISY AND REVERBERANT SPEECH
    Shu, Xiaofeng
    Zhou, Yi
    Cao, Yin
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [40] SPEECH RECOGNITION IN A NOISY AND REVERBERANT ENVIRONMENT WITH AND WITHOUT EARMUFFS
    PEKKARINEN, E
    VILJANEN, V
    SALMIVALLI, A
    SUONPAA, J
    AUDIOLOGY, 1990, 29 (05): : 286 - 293