A New HMM Adaptation Approach for the Case of a Hands-free Speech Input in Reverberant Rooms

被引:0
|
作者
Hirsch, Hans-Guenter [1 ]
Finster, Harald [1 ]
机构
[1] Niederrhein Univ Appl Sci, Krefeld, Germany
关键词
robust speech recognition; HMM adaptation; hands-free speech input; reverberation;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new method is presented for adapting the HMMs of a speech recognition system to the condition of a hands-free speech input in a room environment. The reverberation in a room usually has a bad effect on the performance of a recognition system. Reverberation causes an artificial extension of acoustic excitations what gets visible as so called reverberation tail when looking at the envelope of the short-term energy over the whole frequency range or in subbands. The approach is based on the assumption that the acoustic excitation of a speech segment, as modeled by an HMM state, will be seen as attenuated versions at successive HMM states. Adding this attenuated excitations in the spectral domain at each HMM state leads to a considerable improvement of the recognition performance. Furthermore a new approach is presented to adapt the Delta parameters that are usually taken as additional acoustic features. The efficiency of both new techniques has been proved by some experiments on isolated and connected word recognition with the TIDigits speech data base.
引用
收藏
页码:781 / 784
页数:4
相关论文
共 50 条
  • [31] GazePuffer : Hands-Free Input Method Leveraging Puff Cheeks for VR
    Lai, Yunfei
    Sun, Minghui
    Li, Zhuofeng
    2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES, VR 2024, 2024, : 331 - 341
  • [32] Usability of a Hands-Free Voice Input Interface for Ecological Momentary Assessment
    Adaimi, Rebecca
    Ho, Ka Tai
    Thomaz, Edison
    2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
  • [33] EyeExpress: Expanding Hands-free Input Vocabulary using Eye Expressions
    Ku, Pin-Sung
    Wu, Te-Yen
    Chen, Mike Y.
    ADJUNCT PUBLICATION OF THE 31ST ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY (UIST'18 ADJUNCT), 2018, : 126 - 127
  • [34] HANDS-FREE SPEECH RECOGNITION CHALLENGE FOR REAL-WORLD SPEECH DIALOGUE SYSTEMS
    Saruwatari, Hiroshi
    Kawanami, Hiromichi
    Takeuchi, Shota
    Takahashi, Yu
    Cincarek, Tobias
    Shikano, Kiyohiro
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3729 - 3732
  • [35] Adaptive pitch-based speech detection for hands-free applications
    Abu-El-Quran, AR
    Goubran, RA
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 305 - 308
  • [36] Angular Region-wise Speech Enhancement for Hands-free Speakerphone
    Hioka, Yusuke
    Furuya, Ken'ichi
    Kobayashi, Kazunori
    Sakauchi, Sumitaka
    Haneda, Yoichi
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1403 - 1410
  • [37] Energy-based speech enhancement technique for hands-free communication
    Rahmani, M.
    Yousefian, N.
    Akbari, A.
    ELECTRONICS LETTERS, 2009, 45 (01) : 85 - 86
  • [38] The Hands-Free speech in post laryngectomy voice rehabilitation with tracheosophageal voice
    Serra, Agostino
    Grillo, Calogero
    Nane, Sebastiano
    Ferlito, Salvatore
    Martines, Anna Maria
    Grillo, Caterina
    Cocuzza, Salvatore
    ACTA MEDICA MEDITERRANEA, 2010, 26 (02): : 97 - 100
  • [39] Sector-based detection for hands-free speech enhancement in cars
    Lathoud, Guillaume
    Bourgeois, Julien
    Freudenberger, Juergen
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
  • [40] Sector-Based Detection for Hands-Free Speech Enhancement in Cars
    Guillaume Lathoud
    Julien Bourgeois
    Jürgen Freudenberger
    EURASIP Journal on Advances in Signal Processing, 2006