An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition

被引:0
|
作者
Nishiura, T [1 ]
Nakayama, M [1 ]
Nakamura, S [1 ]
机构
[1] ATR Spoken Language Translat Res Labs, Kyoto 6190288, Japan
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Distant-talking speech recognition in noisy environments is indispensable for self-moving robots or tele-conference systems. However, background noise and room reverberations seriously degrade the sound-capture quality in real acoustic environments. A microphone array is an ideal candidate as an effective method for capturing distant-talking speech. AMNOR (Adaptive Microphone-array for NOise Reduction) was proposed as an adaptive beamformer for capturing the desired distant signals in noisy environments by Kaneda et al. Although the AMNOR has been proven effective, it can be further improved if we know the spectrum characteristics of the desired distant signals in advance. Therefore, we regarded speech as a desired distant signal and designed an AMNOR based on the average speech spectrum. In this paper, we particularly focused on the performance of AMNOR based on the average speech spectrum for distant-talking speech capture and recognition. As a result of evaluation experiments in real acoustic environments, we confirmed that the ASR (Automatic Speech Recognition) performance was improved 5 - 10% by using an AMNOR based on the average speech spectrum in noisy environments. In addition, the proposed AMNOR provides better noise reduction performance than that of conventional AMNOR.
引用
收藏
页码:668 / 671
页数:4
相关论文
共 50 条
  • [21] Noisy Speech Recognition Based On RBF Neural Network
    Yan Gang
    Kong Haidong
    Yu Yang
    Zheng Xiaoxia
    [J]. ADVANCED MATERIALS AND INFORMATION TECHNOLOGY PROCESSING, PTS 1-3, 2011, 271-273 : 597 - 602
  • [22] EVALUATION OF ADAPTIVE SPEECH CODERS UNDER NOISY CHANNEL CONDITIONS
    SCAGLIOLA, C
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1979, 58 (06): : 1369 - 1394
  • [23] COMPARISON OF DIFFERENT SPEECH ENHANCEMENT METHODS ON RECOGNITION OF NOISY SPEECH
    AHMED, MS
    ALMARZOUG, AM
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1994, 19 (01): : 45 - 56
  • [24] Adaptive time segmentation of noisy speech for improved speech enhancement
    Hendriks, RC
    Heusdens, R
    Jensen, J
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 153 - 156
  • [25] Continuous Kannada Noisy Speech Recognition
    Pasha, Nadeem
    Roopa, S.
    [J]. 2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 857 - 861
  • [26] Emotion recognition from noisy speech
    You, Mingyu
    Chen, Chun
    Bu, Jiajun
    Liu, Jia
    Tao, Jianhua
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1653 - +
  • [27] Feature weighting in noisy speech recognition
    Huang, KC
    Juang, YT
    [J]. ELECTRONICS LETTERS, 2003, 39 (12) : 938 - 939
  • [28] A new noisy speech recognition method
    Zhao, XQ
    Wang, J
    [J]. International Symposium on Communications and Information Technologies 2005, Vols 1 and 2, Proceedings, 2005, : 282 - 286
  • [29] PROBLEMS AND SOLUTIONS FOR NOISY SPEECH RECOGNITION
    HATON, JP
    [J]. JOURNAL DE PHYSIQUE IV, 1994, 4 (C5): : 439 - 448
  • [30] SPEECH RECOGNITION IN NOISY ENVIRONMENTS - A SURVEY
    GONG, YF
    [J]. SPEECH COMMUNICATION, 1995, 16 (03) : 261 - 291