Use of microphone array and model adaptation for hands-free speech acquisition and recognition

被引：3

作者：

Chien, JT ^{[1
]}

Lai, JR ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan

来源：

JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY | 2004年 / 36卷 / 2-3期

关键词：

microphone array; delay-and-sum beamformer; coherence measure; model adaptation; speech enhancement; speech recognition;

D O I：

10.1023/B:VLSI.0000015093.07192.eb

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a combined microphone array and model adaptation algorithm for hands-free speech recognition. Our purpose is to remove the inconvenience of using head-mounted/hand-holding microphone in conventional speech recognizer. To improve the speech quality with car noise interference, a linear microphone array is applied and acted as robust acquisition system. A time-domain coherence measure (TDCM) is applied to reliably estimate the time delay for speech signals collected by different microphones. The estimated delay is adopted in a delay-and-sum beamformer for speech enhancement. Further, we adapt the speech hidden Markov models to get close to the acoustic conditions of the enhanced test speech for robust speech recognition. In acquisition and recognition experiments using connected Chinese digits, we found that TDCM can effectively estimate the time delay. The increase in the speech sampling rate is helpful to determine the time delay. Incorporating the model adaptation scheme significantly reduces the recognition errors with moderate computation overhead.

引用

页码：141 / 151

页数：11

共 50 条

[1] Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition
Jen-Tzung Chien
Jain-Ray Lai
Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 141 - 151
[2] Hands-free speech recognition and communication on PDAS using microphone array technology
Herbordt, W
Horiuchi, T
Fujimoto, M
Jitsuhiro, T
Nakamura, S
2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 302 - 307
[3] Speech recognizer-based microphone array processing for robust hands-free speech recognition
Seltzer, ML
Raj, B
Stern, RM
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 897 - 900
[4] Microphone Array Beampattern Characterization for Hands-free Speech Applications
Taghizadeh, Mohammad J.
Garner, Philip N.
Bourlard, Herve
2012 IEEE 7TH SENSOR ARRAY AND MULTICHANNEL SIGNAL PROCESSING WORKSHOP (SAM), 2012, : 465 - 468
[5] Hands-free speech recognition based on 3-D viterbi search using a microphone array
Yamada, T
Nakamura, S
Shikano, K
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 245 - 248
[6] Microphone array systems for hands-free telecommunication
Elko, GW
SPEECH COMMUNICATION, 1996, 20 (3-4) : 229 - 240
[7] Adaptive parameter compensation for robust hands-free speech recognition using a dual beamforming microphone array
McCowan, IA
Sridharan, S
PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 547 - 550
[8] On the joint use of noise reduction and MLLR adaptation for in-car hands-free speech recognition
Matassoni, M
Omologo, M
Santarelli, A
Svaizer, P
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 289 - 292
[9] A microphone array solution for duplex hands-free communication systems
Nordholm, Sven Erik
Low, Siow Yong
Dam, Hai Quang
ICCE: 2007 DIGEST OF TECHNICAL PAPERS INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 2007, : 7 - 8
[10] Fast dereverberation for hands-free speech recognition
Gomez, Randy
Even, Jani
Saruwatari, Hiroshi
Shikano, Kiyohiro
2008 HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS, 2008, : 141 - +

← 1 2 3 4 5 →