A probabilistic union model with automatic order selection for noisy speech recognition

被引：11

作者：

Jančovič, P. ^{[1
]}

Ming, J. ^{[1
]}

机构：

[1] School of Computer Science, Queen's University of Belfast, Belfast BT7 INN, United Kingdom

来源：

| 1641年 / Acoustical Society of America卷 / 110期

关键词：

Acoustic noise - Algorithms - Markov processes - Statistical methods;

D O I：

10.1121/1.1387083

中图分类号：

学科分类号：

摘要：

A critical issue in exploiting the potential of the sub-band-based approach to robust speech recognition is the method of combining the sub-band observations, for selecting the bands unaffected by noise. A new method for this purpose, i.e., the probabilistic union model, was recently introduced. This model has been shown to be capable of dealing with band-limited corruption, requiring no knowledge about the band position and statistical distribution of the noise. A parameter within the model, which we call its order, gives the best results when it equals the number of noisy bands. Since this information may not be available in practice, in this paper we introduce an automatic algorithm for selecting the order, based on the state duration pattern generated by the hidden Markov model (HMM). The algorithm has been tested on the TIDIGITS database corrupted by various types of additive band-limited noise with unknown noisy bands. The results have shown that the union model equipped with the new algorithm can achieve a recognition performance similar to that achieved when the number of noisy bands is known. The results show a very significant improvement over the traditional full-band model, without requiring prior information on either the position or the number of noisy bands. The principle of the algorithm for selecting the order based on state duration may also be applied to other sub-band combination methods. © 2001 Acoustical Society of America.

引用

共 50 条

[21] Feature Selection Algorithms for Automatic Speech Recognition
Kalamani, M.
Valarmathy, S.
Poonkuzhali, C.
Catherine, J. N.
2014 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2014,
[22] SPEECH RECOGNITION WITH NO SPEECH OR WITH NOISY SPEECH
Krishna, Gautam
Co Tran
Yu, Jianguo
Tewfik, Ahmed H.
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1090 - 1094
[23] Automatic model selection for Probabilistic PCA
Lopez-Rubio, Ezequiel
Ortiz-de-Lazcano-Lobato, Juan Miguel
Lopez-Rodriguez, Domingo
Vargas-Gonzalez, Maria del Carmen
COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 127 - +
[24] Robust noisy speech recognition with adaptive frequency bank selection
Tian, Y
Wu, J
Wang, ZY
Lu, DJ
FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 75 - 80
[25] Syllable-based automatic Arabic speech recognition in noisy enviroment
Azmi, Mohamed M.
Tolba, Hesham
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1436 - 1441
[26] 2-DPsychoacoustic Modeling for Automatic Speech Recognition in Noisy Environment
Desai, Sampreeta
Khandekar, Prasad D.
Raut, Ketan J.
2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 129 - 132
[27] Automatic speech/speaker recognition in noisy environments using wavelet transform
Alkhaldi, W
Fakhr, W
Hamdy, N
2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 463 - 466
[28] The usage of wavelet packet transformation in automatic noisy speech recognition systems
Kotnik, B
Kacic, Z
Horvat, B
IEEE REGION 8 EUROCON 2003, VOL B, PROCEEDINGS: COMPUTER AS A TOOL, 2003, : 131 - 134
[29] Multi-model approach for noisy speech recognition
Electron Lett, 1 (30-32):
[30] Multi-model approach for noisy speech recognition
Guan, CT
Leung, SH
Lau, WH
ELECTRONICS LETTERS, 1998, 34 (01) : 30 - 32

← 1 2 3 4 5 →