A noise-robust speech input interface for information kiosk terminals

被引:1
|
作者
Ida, M [1 ]
Mori, H
Nakamura, S
Shikano, K
机构
[1] OMRON Co, Informat Technol Res Ctr, Kyoto 6008530, Japan
[2] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma 6300101, Japan
关键词
speech recognition; noise reduction; microphone array; spectral subtraction;
D O I
10.1002/ecjb.20135
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In social amenity apparatus such as information terminals or oil-line shopping terminals and ticket vending machines, more natural and smoother communication between machines and humans is made possible by incorporating speech recognition, thus rapidly enhancing operation and convenience. In this paper, we will describe the trial production of an information kiosk terminal equipped with speech recognition and the noise reduction technique adopted as a speech input interface. In general, the aforementioned system equipment is installed in public spaces such as stations. Therefore, robust speech recognition performance is required even in the worst noise environments. For the realization of this antinoise capability, we have used noise reduction based on both a microphone array and spectral subtraction (SS). Experiments with evaluation data recorded in real environments indicate that a recognition performance of over 90% in isolated-word recognition of 216 words uttered by all unspecified speaker is achieved by the combined use of a 32-channel microphone array and SS. (C) 2004 Wiley Periodicals, Inc.
引用
收藏
页码:51 / 61
页数:11
相关论文
共 50 条
  • [21] A speech emphasis method for noise-robust speech recognition by using repetitive phrase
    Hirai, Takanori
    Kuroiwa, Shingo
    Tsuge, Satoru
    Ren, Fuji
    Fattah, Mohamed Abdel
    [J]. 2006 10TH INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLS 1 AND 2, PROCEEDINGS, 2006, : 1269 - +
  • [22] An improved algorithm for noise-robust sparse linear prediction of speech
    Zhou, Bin
    Zou, Xia
    Zhang, Xiongwei
    [J]. Shengxue Xuebao/Acta Acustica, 2014, 39 (05): : 655 - 662
  • [23] NOISE-ROBUST DETECTION OF PEAK-CLIPPING IN DECODED SPEECH
    Eaton, James
    Naylor, Patrick A.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [24] MULTI-TASK AUTOENCODER FOR NOISE-ROBUST SPEECH RECOGNITION
    Zhang, Haoyi
    Liu, Conggui
    Inoue, Nakamasa
    Shinoda, Koichi
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5599 - 5603
  • [25] Empirical Mode Decomposition For Noise-Robust Automatic Speech Recognition
    Wu, Kuo-Hao
    Chen, Chia-Ping
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2074 - 2077
  • [26] INCORPORATING MASK MODELLING FOR NOISE-ROBUST AUTOMATIC SPEECH RECOGNITION
    Koekueer, Muenevver
    Jancovic, Peter
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3929 - 3932
  • [27] Noise-Robust Speech Recognition Based on RBF Neural Network
    Hou, Xuemei
    [J]. HIGH PERFORMANCE STRUCTURES AND MATERIALS ENGINEERING, PTS 1 AND 2, 2011, 217-218 : 413 - 418
  • [28] On the temporal decorrelation of feature parameters for noise-robust speech recognition
    Jung, HY
    Lee, SY
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 407 - 416
  • [29] Deep Maxout Networks Applied to Noise-Robust Speech Recognition
    de-la-Calle-Silos, F.
    Gallardo-Antolin, A.
    Pelaez-Moreno, C.
    [J]. ADVANCES IN SPEECH AND LANGUAGE TECHNOLOGIES FOR IBERIAN LANGUAGES, IBERSPEECH 2014, 2014, 8854 : 109 - 118
  • [30] An improved algorithm for noise-robust sparse linear prediction of speech
    ZHOU Bin
    ZOU Xia
    ZHANG Xiongwei
    [J]. Chinese Journal of Acoustics, 2015, 34 (01) : 84 - 95