A New HMM Adaptation Approach for the Case of a Hands-free Speech Input in Reverberant Rooms

被引：0

作者：

Hirsch, Hans-Guenter ^{[1
]}

Finster, Harald ^{[1
]}

机构：

[1] Niederrhein Univ Appl Sci, Krefeld, Germany

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

robust speech recognition; HMM adaptation; hands-free speech input; reverberation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A new method is presented for adapting the HMMs of a speech recognition system to the condition of a hands-free speech input in a room environment. The reverberation in a room usually has a bad effect on the performance of a recognition system. Reverberation causes an artificial extension of acoustic excitations what gets visible as so called reverberation tail when looking at the envelope of the short-term energy over the whole frequency range or in subbands. The approach is based on the assumption that the acoustic excitation of a speech segment, as modeled by an HMM state, will be seen as attenuated versions at successive HMM states. Adding this attenuated excitations in the spectral domain at each HMM state leads to a considerable improvement of the recognition performance. Furthermore a new approach is presented to adapt the Delta parameters that are usually taken as additional acoustic features. The efficiency of both new techniques has been proved by some experiments on isolated and connected word recognition with the TIDigits speech data base.

引用

页码：781 / 784

页数：4

共 50 条

[1] Hands-free speech recognition using a filtered clean corpus and incremental HMM adaptation
Matassoni, M
Omologo, M
Giuliani, D
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1407 - 1410
[2] Training of HMM with filtered speech material for hands-free recognition
ITC-IRST, Trento, Italy
ICASSP IEEE Int Conf Acoust Speech Signal Process Proc, (449-452):
[3] Training of HMM with filtered speech material for hands-free recognition
Giuliani, D
Matassoni, M
Omologo, M
Svaizer, P
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 449 - 452
[4] Experiments of HMM adaptation for hands-free connected digit recognition
Giuliani, D
Matassoni, M
Omologo, M
Svaizer, P
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 473 - 476
[5] IMPROVED HANDS-FREE AUTOMATIC SPEECH RECOGNITION IN REVERBERANT ENVIRONMENT CONDITION
Gomez, Randy
Nakamura, Keisuke
Mizumoto, Takeshi
Nakadai, Kazuhiro
2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 67 - 71
[6] Speech enhancement for hands-free terminals
Grbic, N
Nordholm, S
Johansson, A
ISPA 2001: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS, 2001, : 435 - 440
[7] Use of microphone array and model adaptation for hands-free speech acquisition and recognition
Chien, JT
Lai, JR
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 141 - 151
[8] Use of Microphone Array and Model Adaptation for Hands-Free Speech Acquisition and Recognition
Jen-Tzung Chien
Jain-Ray Lai
Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 141 - 151
[9] Model adaptation based on HMM decomposition for reverberant speech recognition
Takiguchi, T
Nakamura, S
Huo, Q
Shikano, K
1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 827 - 830
[10] A two-microphone approach for speech enhancement in hands-free communications
Jeannes, RLB
Faucon, G
Ayad, B
1996 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLUMES 1 AND 2 - PROCEEDINGS, 1996, : 424 - 427

← 1 2 3 4 5 →