A New HMM Adaptation Approach for the Case of a Hands-free Speech Input in Reverberant Rooms

被引：0

作者：

Hirsch, Hans-Guenter ^{[1
]}

Finster, Harald ^{[1
]}

机构：

[1] Niederrhein Univ Appl Sci, Krefeld, Germany

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

robust speech recognition; HMM adaptation; hands-free speech input; reverberation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A new method is presented for adapting the HMMs of a speech recognition system to the condition of a hands-free speech input in a room environment. The reverberation in a room usually has a bad effect on the performance of a recognition system. Reverberation causes an artificial extension of acoustic excitations what gets visible as so called reverberation tail when looking at the envelope of the short-term energy over the whole frequency range or in subbands. The approach is based on the assumption that the acoustic excitation of a speech segment, as modeled by an HMM state, will be seen as attenuated versions at successive HMM states. Adding this attenuated excitations in the spectral domain at each HMM state leads to a considerable improvement of the recognition performance. Furthermore a new approach is presented to adapt the Delta parameters that are usually taken as additional acoustic features. The efficiency of both new techniques has been proved by some experiments on isolated and connected word recognition with the TIDigits speech data base.

引用

页码：781 / 784

页数：4

共 50 条

[31] GazePuffer : Hands-Free Input Method Leveraging Puff Cheeks for VR
Lai, Yunfei
Sun, Minghui
Li, Zhuofeng
2024 IEEE CONFERENCE ON VIRTUAL REALITY AND 3D USER INTERFACES, VR 2024, 2024, : 331 - 341
[32] Usability of a Hands-Free Voice Input Interface for Ecological Momentary Assessment
Adaimi, Rebecca
Ho, Ka Tai
Thomaz, Edison
2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
[33] EyeExpress: Expanding Hands-free Input Vocabulary using Eye Expressions
Ku, Pin-Sung
Wu, Te-Yen
Chen, Mike Y.
ADJUNCT PUBLICATION OF THE 31ST ANNUAL ACM SYMPOSIUM ON USER INTERFACE SOFTWARE AND TECHNOLOGY (UIST'18 ADJUNCT), 2018, : 126 - 127
[34] HANDS-FREE SPEECH RECOGNITION CHALLENGE FOR REAL-WORLD SPEECH DIALOGUE SYSTEMS
Saruwatari, Hiroshi
Kawanami, Hiromichi
Takeuchi, Shota
Takahashi, Yu
Cincarek, Tobias
Shikano, Kiyohiro
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3729 - 3732
[35] Adaptive pitch-based speech detection for hands-free applications
Abu-El-Quran, AR
Goubran, RA
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 305 - 308
[36] Angular Region-wise Speech Enhancement for Hands-free Speakerphone
Hioka, Yusuke
Furuya, Ken'ichi
Kobayashi, Kazunori
Sakauchi, Sumitaka
Haneda, Yoichi
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1403 - 1410
[37] Energy-based speech enhancement technique for hands-free communication
Rahmani, M.
Yousefian, N.
Akbari, A.
ELECTRONICS LETTERS, 2009, 45 (01) : 85 - 86
[38] The Hands-Free speech in post laryngectomy voice rehabilitation with tracheosophageal voice
Serra, Agostino
Grillo, Calogero
Nane, Sebastiano
Ferlito, Salvatore
Martines, Anna Maria
Grillo, Caterina
Cocuzza, Salvatore
ACTA MEDICA MEDITERRANEA, 2010, 26 (02): : 97 - 100
[39] Sector-based detection for hands-free speech enhancement in cars
Lathoud, Guillaume
Bourgeois, Julien
Freudenberger, Juergen
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1)
[40] Sector-Based Detection for Hands-Free Speech Enhancement in Cars
Guillaume Lathoud
Julien Bourgeois
Jürgen Freudenberger
EURASIP Journal on Advances in Signal Processing, 2006

← 1 2 3 4 5 →