Use of microphone array and model adaptation for hands-free speech acquisition and recognition

被引:3
|
作者
Chien, JT [1 ]
Lai, JR [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
microphone array; delay-and-sum beamformer; coherence measure; model adaptation; speech enhancement; speech recognition;
D O I
10.1023/B:VLSI.0000015093.07192.eb
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a combined microphone array and model adaptation algorithm for hands-free speech recognition. Our purpose is to remove the inconvenience of using head-mounted/hand-holding microphone in conventional speech recognizer. To improve the speech quality with car noise interference, a linear microphone array is applied and acted as robust acquisition system. A time-domain coherence measure (TDCM) is applied to reliably estimate the time delay for speech signals collected by different microphones. The estimated delay is adopted in a delay-and-sum beamformer for speech enhancement. Further, we adapt the speech hidden Markov models to get close to the acoustic conditions of the enhanced test speech for robust speech recognition. In acquisition and recognition experiments using connected Chinese digits, we found that TDCM can effectively estimate the time delay. The increase in the speech sampling rate is helpful to determine the time delay. Incorporating the model adaptation scheme significantly reduces the recognition errors with moderate computation overhead.
引用
收藏
页码:141 / 151
页数:11
相关论文
共 50 条
  • [31] Achieving a hands-free computer interface using voice recognition and speech synthesis
    Evans, JR
    Tjoland, WA
    Allred, LG
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2000, 15 (01) : 14 - 16
  • [32] Transforming HMMs for speaker-independent hands-free speech recognition in the car
    Gong, Y.
    Godfrey, John J.
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 297 - 300
  • [33] Transforming HMMs for speaker-independent hands-free speech recognition in the car
    Gong, Y
    Godfrey, JJ
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 297 - 300
  • [34] Using Eye Contact and Contextual Speech Recognition for Hands-Free Surgical Charting
    Lepinski, G. Julian
    Vertegaal, Roel
    2008 2ND INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE, 2008, : 111 - 112
  • [35] Microphone array system for speech recognition
    Kiyohara, K
    Kaneda, Y
    Takahashi, S
    Nomura, H
    Kojima, J
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS, 1997, : 215 - 218
  • [36] A New HMM Adaptation Approach for the Case of a Hands-free Speech Input in Reverberant Rooms
    Hirsch, Hans-Guenter
    Finster, Harald
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 781 - 784
  • [37] Frame-synchronous noise compensation for hands-free speech recognition in car environments
    Chien, JT
    Lin, MS
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2000, 147 (06): : 508 - 515
  • [38] Clinical use of a neck brace to improve hands-free speech in laryngectomized patients
    Dirven, Richard
    Kooijman, Piet G. C.
    Wouters, Yannick
    Marres, Henri A. M.
    LARYNGOSCOPE, 2012, 122 (06): : 1267 - 1272
  • [39] Target Acquisition by a Hands-free Wireless Tilt Mouse
    Blackmon, Ferrol R.
    Weeks, Michael
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 33 - 38
  • [40] Speech and Hands-free Interaction: Myths, Challenges, and Opportunities
    Munteanu, Cosmin
    Penn, Gerald
    CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,