Hands-free speech recognition and communication on PDAS using microphone array technology

被引:0
|
作者
Herbordt, W [1 ]
Horiuchi, T [1 ]
Fujimoto, M [1 ]
Jitsuhiro, T [1 ]
Nakamura, S [1 ]
机构
[1] ATR, Spoken Language Commun Res Labs, Kyoto 6190288, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a personal digital assistant (PDA) for handsfree speech recognition and communication with a microphone array mounted on the PDA is presented. An outlier-robust generalized sidelobe canceller (RGSC) and a minimum mean-squared error (MMSE) estimator for log Mel-spectral energy coefficients using a Gaussian mixture model (GMM) for clean speech are implemented in real-time and evaluated for speech recognition based on a small experimental multichannel database. It is shown that the joint system of beamformer and single-channel noise suppression highly improves the noise-robustness of a large-vocabulary speech recognizer so that down to SNR= 5 dB more than 91 % word accuracy is obtained.
引用
收藏
页码:302 / 307
页数:6
相关论文
共 50 条
  • [1] Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition
    Jen-Tzung Chien
    Jain-Ray Lai
    [J]. Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 141 - 151
  • [2] Use of microphone array and model adaptation for hands-free speech acquisition and recognition
    Chien, JT
    Lai, JR
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 141 - 151
  • [3] Speech recognizer-based microphone array processing for robust hands-free speech recognition
    Seltzer, ML
    Raj, B
    Stern, RM
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 897 - 900
  • [4] Hands-free speech recognition based on 3-D viterbi search using a microphone array
    Yamada, T
    Nakamura, S
    Shikano, K
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 245 - 248
  • [5] Adaptive parameter compensation for robust hands-free speech recognition using a dual beamforming microphone array
    McCowan, IA
    Sridharan, S
    [J]. PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 547 - 550
  • [6] Microphone Array Beampattern Characterization for Hands-free Speech Applications
    Taghizadeh, Mohammad J.
    Garner, Philip N.
    Bourlard, Herve
    [J]. 2012 IEEE 7TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2012, : 465 - 468
  • [7] A microphone array solution for duplex hands-free communication systems
    Nordholm, Sven Erik
    Low, Siow Yong
    Dam, Hai Quang
    [J]. ICCE: 2007 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2007, : 7 - 8
  • [8] Microphone array systems for hands-free telecommunication
    Elko, GW
    [J]. SPEECH COMMUNICATION, 1996, 20 (3-4) : 229 - 240
  • [9] Bridging the gap: Towards a unified framework for hands-free speech recognition using microphone arrays
    Seltzer, Michael L.
    [J]. 2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 105 - 108
  • [10] Noise robust hands-free speech recognition using microphone array and Kalman filter as front-end system of conversational TV
    Fujimoto, M
    Ariki, Y
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2002, : 268 - 271