A New HMM Adaptation Approach for the Case of a Hands-free Speech Input in Reverberant Rooms

被引:0
|
作者
Hirsch, Hans-Guenter [1 ]
Finster, Harald [1 ]
机构
[1] Niederrhein Univ Appl Sci, Krefeld, Germany
关键词
robust speech recognition; HMM adaptation; hands-free speech input; reverberation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new method is presented for adapting the HMMs of a speech recognition system to the condition of a hands-free speech input in a room environment. The reverberation in a room usually has a bad effect on the performance of a recognition system. Reverberation causes an artificial extension of acoustic excitations what gets visible as so called reverberation tail when looking at the envelope of the short-term energy over the whole frequency range or in subbands. The approach is based on the assumption that the acoustic excitation of a speech segment, as modeled by an HMM state, will be seen as attenuated versions at successive HMM states. Adding this attenuated excitations in the spectral domain at each HMM state leads to a considerable improvement of the recognition performance. Furthermore a new approach is presented to adapt the Delta parameters that are usually taken as additional acoustic features. The efficiency of both new techniques has been proved by some experiments on isolated and connected word recognition with the TIDigits speech data base.
引用
收藏
页码:781 / 784
页数:4
相关论文
共 50 条
  • [1] Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
    Matassoni, M
    Omologo, M
    Giuliani, D
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1407 - 1410
  • [2] Training of HMM with filtered speech material for hands-free recognition
    ITC-IRST, Trento, Italy
    ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (449-452):
  • [3] Training of HMM with filtered speech material for hands-free recognition
    Giuliani, D
    Matassoni, M
    Omologo, M
    Svaizer, P
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 449 - 452
  • [4] Experiments of HMM adaptation for hands-free connected digit recognition
    Giuliani, D
    Matassoni, M
    Omologo, M
    Svaizer, P
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 473 - 476
  • [5] IMPROVED HANDS-FREE AUTOMATIC SPEECH RECOGNITION IN REVERBERANT ENVIRONMENT CONDITION
    Gomez, Randy
    Nakamura, Keisuke
    Mizumoto, Takeshi
    Nakadai, Kazuhiro
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 67 - 71
  • [6] Speech enhancement for hands-free terminals
    Grbic, N
    Nordholm, S
    Johansson, A
    ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 435 - 440
  • [7] Use of microphone array and model adaptation for hands-free speech acquisition and recognition
    Chien, JT
    Lai, JR
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 141 - 151
  • [8] Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition
    Jen-Tzung Chien
    Jain-Ray Lai
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 141 - 151
  • [9] Model adaptation based on HMM decomposition for reverberant speech recognition
    Takiguchi, T
    Nakamura, S
    Huo, Q
    Shikano, K
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 827 - 830
  • [10] A two-microphone approach for speech enhancement in hands-free communications
    Jeannes, RLB
    Faucon, G
    Ayad, B
    1996 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLUMES 1 AND 2 - PROCEEDINGS, 1996, : 424 - 427