A New HMM Adaptation Approach for the Case of a Hands-free Speech Input in Reverberant Rooms

被引:0
|
作者
Hirsch, Hans-Guenter [1 ]
Finster, Harald [1 ]
机构
[1] Niederrhein Univ Appl Sci, Krefeld, Germany
关键词
robust speech recognition; HMM adaptation; hands-free speech input; reverberation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new method is presented for adapting the HMMs of a speech recognition system to the condition of a hands-free speech input in a room environment. The reverberation in a room usually has a bad effect on the performance of a recognition system. Reverberation causes an artificial extension of acoustic excitations what gets visible as so called reverberation tail when looking at the envelope of the short-term energy over the whole frequency range or in subbands. The approach is based on the assumption that the acoustic excitation of a speech segment, as modeled by an HMM state, will be seen as attenuated versions at successive HMM states. Adding this attenuated excitations in the spectral domain at each HMM state leads to a considerable improvement of the recognition performance. Furthermore a new approach is presented to adapt the Delta parameters that are usually taken as additional acoustic features. The efficiency of both new techniques has been proved by some experiments on isolated and connected word recognition with the TIDigits speech data base.
引用
收藏
页码:781 / 784
页数:4
相关论文
共 50 条
  • [41] Experiments of in-car audio compensation for hands-free speech recognition
    Matassoni, M
    Omologo, M
    Zieger, C
    ASRU'03: 2003 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING ASRU '03, 2003, : 369 - 374
  • [42] Likelihood-maximizing beamforming for robust hands-free speech recognition
    Seltzer, ML
    Raj, B
    Stern, RM
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05): : 489 - 498
  • [43] Improving Hands-Free Speech Rehabilitation in Laryngectomized Patients with a Moldable Adhesive
    Leemans, Maartje
    Longobardi, Ylenia
    Dirven, Richard
    Honings, Jimmie
    D'Alatri, Lucia
    Galli, Jacopo
    van den Brekel, Michiel
    Parrilla, Claudio
    van Sluis, Klaske E.
    LARYNGOSCOPE, 2023, 133 (11): : 2965 - 2970
  • [44] Sector-based detection for hands-free speech enhancement in cars
    Lathoud, Guillaume
    Bourgeois, Julien
    Freudenberger, Jürgen
    Eurasip Journal on Applied Signal Processing, 2006, 2006
  • [45] Speech recognizer-based microphone array processing for robust hands-free speech recognition
    Seltzer, ML
    Raj, B
    Stern, RM
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 897 - 900
  • [46] Hands-free input interface using mimetic muscle movements for wearable computer
    Hiyama, Atsushi
    Tanikawa, Tomohiro
    Hirose, Michitaka
    COMPUTER-HUMAN INTERACTION, 2008, 5068 : 311 - 320
  • [47] Hands-free speech recognition and communication on PDAS using microphone array technology
    Herbordt, W
    Horiuchi, T
    Fujimoto, M
    Jitsuhiro, T
    Nakamura, S
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 302 - 307
  • [48] A noise robust speech activity detection algorithm for voice activated hands-free
    Bagur, H
    Seventh IASTED International Conference on Signal and Image Processing, 2005, : 1 - 5
  • [49] Achieving a hands-free computer interface using voice recognition and speech synthesis
    Evans, JR
    Tjoland, WA
    Allred, LG
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2000, 15 (01) : 14 - 16
  • [50] Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition
    Delcroix, Marc
    Yoshioka, Takuya
    Ogawa, Atsunori
    Kubo, Yotaro
    Fujimoto, Masakiyo
    Ito, Nobutaka
    Kinoshita, Keisuke
    Espi, Miquel
    Araki, Shoko
    Hori, Takaaki
    Nakatani, Tomohiro
    2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 522 - 526