Use of microphone array and model adaptation for hands-free speech acquisition and recognition

被引:3
|
作者
Chien, JT [1 ]
Lai, JR [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
microphone array; delay-and-sum beamformer; coherence measure; model adaptation; speech enhancement; speech recognition;
D O I
10.1023/B:VLSI.0000015093.07192.eb
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a combined microphone array and model adaptation algorithm for hands-free speech recognition. Our purpose is to remove the inconvenience of using head-mounted/hand-holding microphone in conventional speech recognizer. To improve the speech quality with car noise interference, a linear microphone array is applied and acted as robust acquisition system. A time-domain coherence measure (TDCM) is applied to reliably estimate the time delay for speech signals collected by different microphones. The estimated delay is adopted in a delay-and-sum beamformer for speech enhancement. Further, we adapt the speech hidden Markov models to get close to the acoustic conditions of the enhanced test speech for robust speech recognition. In acquisition and recognition experiments using connected Chinese digits, we found that TDCM can effectively estimate the time delay. The increase in the speech sampling rate is helpful to determine the time delay. Incorporating the model adaptation scheme significantly reduces the recognition errors with moderate computation overhead.
引用
收藏
页码:141 / 151
页数:11
相关论文
共 50 条
  • [1] Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition
    Jen-Tzung Chien
    Jain-Ray Lai
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 141 - 151
  • [2] Hands-free speech recognition and communication on PDAS using microphone array technology
    Herbordt, W
    Horiuchi, T
    Fujimoto, M
    Jitsuhiro, T
    Nakamura, S
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 302 - 307
  • [3] Speech recognizer-based microphone array processing for robust hands-free speech recognition
    Seltzer, ML
    Raj, B
    Stern, RM
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 897 - 900
  • [4] Microphone Array Beampattern Characterization for Hands-free Speech Applications
    Taghizadeh, Mohammad J.
    Garner, Philip N.
    Bourlard, Herve
    2012 IEEE 7TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2012, : 465 - 468
  • [5] Hands-free speech recognition based on 3-D viterbi search using a microphone array
    Yamada, T
    Nakamura, S
    Shikano, K
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 245 - 248
  • [6] Microphone array systems for hands-free telecommunication
    Elko, GW
    SPEECH COMMUNICATION, 1996, 20 (3-4) : 229 - 240
  • [7] Adaptive parameter compensation for robust hands-free speech recognition using a dual beamforming microphone array
    McCowan, IA
    Sridharan, S
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 547 - 550
  • [8] On the joint use of noise reduction and MLLR adaptation for in-car hands-free speech recognition
    Matassoni, M
    Omologo, M
    Santarelli, A
    Svaizer, P
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 289 - 292
  • [9] A microphone array solution for duplex hands-free communication systems
    Nordholm, Sven Erik
    Low, Siow Yong
    Dam, Hai Quang
    ICCE: 2007 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2007, : 7 - 8
  • [10] Fast dereverberation for hands-free speech recognition
    Gomez, Randy
    Even, Jani
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 141 - +