A probabilistic union model with automatic order selection for noisy speech recognition

被引:11
|
作者
Jančovič, P. [1 ]
Ming, J. [1 ]
机构
[1] School of Computer Science, Queen's University of Belfast, Belfast BT7 INN, United Kingdom
来源
| 1641年 / Acoustical Society of America卷 / 110期
关键词
Acoustic noise - Algorithms - Markov processes - Statistical methods;
D O I
10.1121/1.1387083
中图分类号
学科分类号
摘要
A critical issue in exploiting the potential of the sub-band-based approach to robust speech recognition is the method of combining the sub-band observations, for selecting the bands unaffected by noise. A new method for this purpose, i.e., the probabilistic union model, was recently introduced. This model has been shown to be capable of dealing with band-limited corruption, requiring no knowledge about the band position and statistical distribution of the noise. A parameter within the model, which we call its order, gives the best results when it equals the number of noisy bands. Since this information may not be available in practice, in this paper we introduce an automatic algorithm for selecting the order, based on the state duration pattern generated by the hidden Markov model (HMM). The algorithm has been tested on the TIDIGITS database corrupted by various types of additive band-limited noise with unknown noisy bands. The results have shown that the union model equipped with the new algorithm can achieve a recognition performance similar to that achieved when the number of noisy bands is known. The results show a very significant improvement over the traditional full-band model, without requiring prior information on either the position or the number of noisy bands. The principle of the algorithm for selecting the order based on state duration may also be applied to other sub-band combination methods. © 2001 Acoustical Society of America.
引用
收藏
相关论文
共 50 条
  • [21] Feature Selection Algorithms for Automatic Speech Recognition
    Kalamani, M.
    Valarmathy, S.
    Poonkuzhali, C.
    Catherine, J. N.
    2014 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2014,
  • [22] SPEECH RECOGNITION WITH NO SPEECH OR WITH NOISY SPEECH
    Krishna, Gautam
    Co Tran
    Yu, Jianguo
    Tewfik, Ahmed H.
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1090 - 1094
  • [23] Automatic model selection for Probabilistic PCA
    Lopez-Rubio, Ezequiel
    Ortiz-de-Lazcano-Lobato, Juan Miguel
    Lopez-Rodriguez, Domingo
    Vargas-Gonzalez, Maria del Carmen
    COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 127 - +
  • [24] Robust noisy speech recognition with adaptive frequency bank selection
    Tian, Y
    Wu, J
    Wang, ZY
    Lu, DJ
    FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 75 - 80
  • [25] Syllable-based automatic Arabic speech recognition in noisy enviroment
    Azmi, Mohamed M.
    Tolba, Hesham
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1436 - 1441
  • [26] 2-DPsychoacoustic Modeling for Automatic Speech Recognition in Noisy Environment
    Desai, Sampreeta
    Khandekar, Prasad D.
    Raut, Ketan J.
    2016 CONFERENCE ON ADVANCES IN SIGNAL PROCESSING (CASP), 2016, : 129 - 132
  • [27] Automatic speech/speaker recognition in noisy environments using wavelet transform
    Alkhaldi, W
    Fakhr, W
    Hamdy, N
    2002 45TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, CONFERENCE PROCEEDINGS, 2002, : 463 - 466
  • [28] The usage of wavelet packet transformation in automatic noisy speech recognition systems
    Kotnik, B
    Kacic, Z
    Horvat, B
    IEEE REGION 8 EUROCON 2003, VOL B, PROCEEDINGS: COMPUTER AS A TOOL, 2003, : 131 - 134
  • [29] Multi-model approach for noisy speech recognition
    Electron Lett, 1 (30-32):
  • [30] Multi-model approach for noisy speech recognition
    Guan, CT
    Leung, SH
    Lau, WH
    ELECTRONICS LETTERS, 1998, 34 (01) : 30 - 32