A probabilistic union model with automatic order selection for noisy speech recognition

被引:11
|
作者
Jančovič, P. [1 ]
Ming, J. [1 ]
机构
[1] School of Computer Science, Queen's University of Belfast, Belfast BT7 INN, United Kingdom
来源
| 1641年 / Acoustical Society of America卷 / 110期
关键词
Acoustic noise - Algorithms - Markov processes - Statistical methods;
D O I
10.1121/1.1387083
中图分类号
学科分类号
摘要
A critical issue in exploiting the potential of the sub-band-based approach to robust speech recognition is the method of combining the sub-band observations, for selecting the bands unaffected by noise. A new method for this purpose, i.e., the probabilistic union model, was recently introduced. This model has been shown to be capable of dealing with band-limited corruption, requiring no knowledge about the band position and statistical distribution of the noise. A parameter within the model, which we call its order, gives the best results when it equals the number of noisy bands. Since this information may not be available in practice, in this paper we introduce an automatic algorithm for selecting the order, based on the state duration pattern generated by the hidden Markov model (HMM). The algorithm has been tested on the TIDIGITS database corrupted by various types of additive band-limited noise with unknown noisy bands. The results have shown that the union model equipped with the new algorithm can achieve a recognition performance similar to that achieved when the number of noisy bands is known. The results show a very significant improvement over the traditional full-band model, without requiring prior information on either the position or the number of noisy bands. The principle of the algorithm for selecting the order based on state duration may also be applied to other sub-band combination methods. © 2001 Acoustical Society of America.
引用
收藏
相关论文
共 50 条
  • [1] A probabilistic union model with automatic order selection for noisy speech recognition
    Jancovic, P
    Ming, J
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (03): : 1641 - 1648
  • [2] Optimal Automatic Speech Recognition System Selection for Noisy Environments
    Tachioka, Yuuki
    Narita, Tomohiro
    2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [3] AUTOMATIC SPEECH RECOGNITION IN A NOISY AUTOMOTIVE ENVIRONMENT
    WILPON, JG
    RABINER, LR
    DEMARCO, D
    SHIPLEY, KL
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1987, 81 : S94 - S94
  • [4] Temporal feature selection for noisy speech recognition
    Department of Computer Science and Software Engineering, Université Laval, Quebec
    QC
    G1V 0A6, Canada
    Lect. Notes Comput. Sci., (155-166):
  • [5] Temporal Feature Selection for Noisy Speech Recognition
    Trottier, Ludovic
    Chaib-draa, Brahim
    Giguere, Philippe
    ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 155 - 166
  • [6] Subspace Modeling and Selection for Noisy Speech Recognition
    Chien, Jen-Tzung
    Ting, Chuan-Wei
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 789 - 792
  • [7] Blind Model Selection for Automatic Speech Recognition in Reverberant Environments
    Laurent Couvreur
    Christophe Couvreur
    Journal of VLSI signal processing systems for signal, image and video technology, 2004, 36 : 189 - 203
  • [8] Blind model selection for automatic speech recognition in reverberant environments
    Couvreur, L
    Couvreur, C
    JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2004, 36 (2-3): : 189 - 203
  • [9] Perceptual features for automatic speech recognition in noisy environments
    Haque, Serajul
    Togneri, Roberto
    Zaknich, Anthony
    SPEECH COMMUNICATION, 2009, 51 (01) : 58 - 75
  • [10] Improved Noisy Student Training for Automatic Speech Recognition
    Park, Daniel S.
    Zhang, Yu
    Jia, Ye
    Han, Wei
    Chiu, Chung-Cheng
    Li, Bo
    Wu, Yonghui
    Le, Quoc, V
    INTERSPEECH 2020, 2020, : 2817 - 2821