An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition

被引:0
|
作者
Nishiura, T [1 ]
Nakayama, M [1 ]
Nakamura, S [1 ]
机构
[1] ATR, Spoken Language Translat Res Labs, Kyoto 6190288, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Distant-talking speech recognition in noisy environments is indispensable for self-moving robots or tele-conference systems. However, background noise and room reverberations seriously degrade the sound-capture quality in real acoustic environments. A microphone array is an ideal candidate as an effective method for capturing distant-talking speech. AMNOR (Adaptive Microphone-array for NOise Reduction) was proposed as an adaptive beamformer for capturing the desired distant signals in noisy environments by Kaneda et al. Although the AMNOR has been proven effective, it can be further improved if we know the spectrum characteristics of the desired distant signals in advance. Therefore, we regarded speech as a desired distant signal and designed an AMNOR based on the average speech spectrum. In this paper, we particularly focused on the performance of AMNOR based on the average speech spectrum for distant-talking speech capture and recognition. As a result of evaluation experiments in real acoustic environments, we confirmed that the ASR (Automatic Speech Recognition) performance was improved 5 - 10% by using an AMNOR based on the average speech spectrum in noisy environments. In addition, the proposed AMNOR provides better noise reduction performance than that of conventional AMNOR.
引用
收藏
页码:209 / 212
页数:4
相关论文
共 50 条
  • [42] Speech enhancement method based on feature compensation gain for effective speech recognition in noisy environments
    Bae, Ara
    Kim, Wooil
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 51 - 55
  • [43] An effective cluster-based model for robust speech detection and speech recognition in noisy environments
    Górriz, J.M.
    Ramírez, J.
    Segura, J.C.
    Puntonet, C.G.
    Journal of the Acoustical Society of America, 2006, 120 (01): : 470 - 481
  • [44] MAP-based perceptual modeling for noisy speech recognition
    Sher, Yung-Ji
    Chen, Yeou-Jiunn
    Chiu, Yu-Hsien
    Chung, Kao-Chi
    Wu, Chung-Hsien
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2006, 22 (05) : 999 - 1013
  • [45] Weighted method for noisy speech recognition based on loudness property
    Jiang, W.J.
    Lin, Y.R.
    Wei, G.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2001, 14 (02):
  • [46] MAP-based perceptual modeling for noisy speech recognition
    Institute of Biomedical Engineering, National Cheng Kung University, Tainan, 701, Taiwan
    不详
    不详
    不详
    不详
    J. Inf. Sci. Eng., 2006, 5 (999-1013):
  • [47] Model-based feature enhancement for noisy speech recognition
    Couvreur, C
    Van hamme, H
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1719 - 1722
  • [48] Power Spectrum Difference Teager Energy Features for Speech Recognition in Noisy Environment
    Nehe, N. S.
    Holambe, R. S.
    IEEE REGION 10 COLLOQUIUM AND THIRD INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS, VOLS 1 AND 2, 2008, : 223 - 227
  • [49] Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech
    Leem, Seong-Gyun
    Fulford, Daniel
    Onnela, Jukka-Pekka
    Gard, David
    Busso, Carlos
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 917 - 929
  • [50] Speech Enhancement and Recognition of Compressed Speech Signal in Noisy Reverberant Conditions
    Suman, Maloji
    Khan, Habibulla
    Latha, M. Madhavi
    Kumari, Devarakonda Aruna
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 379 - +