Factorial Speech Processing Models for Noise-Robust Automatic Speech Recognition

被引:0
|
作者
Khademian, Mahdi [1 ]
Homayounpour, Mohammad Mehdi [1 ]
机构
[1] Amirkabir Univ Technol, LIMP, Tehran, Iran
关键词
factorial models of speech processing; state-conditional observation distribution; weighted stereo sampling; two-dimensional Viterbi algorithm;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an introduction of factorial speech processing models for noise-robust automatic speech processing tasks. Factorial models try to use more noise information rather than other robustness techniques for better generative modeling of speech and noise and the way they are combine together. Since factorial models were not completely successful in noise-robust speech processing applications while they have significant achievements in other speech processing areas in the past, we decide to reconsider them and evaluate their effects in the Aurora 2 task. In addition to Aurora noises, two more regular noises are examined in our experiments including Helicopter and Locomotive engine noises. Experiments show that these models are successful when we faced with destructive noises in addition to their unexpected improvements for non-regular non-stationary noises like Babble.
引用
收藏
页码:637 / 642
页数:6
相关论文
共 50 条
  • [41] Deep maxout networks applied to noise-robust speech recognition
    de-la-Calle-Silos, F. (fsilos@tsc.uc3m.es), 1600, Springer Verlag (8854):
  • [42] Noise-robust speech feature processing with empirical mode decomposition
    Wu, Kuo-Hau
    Chen, Chia-Ping
    Yeh, Bing-Feng
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2011, : 1 - 9
  • [43] A NOISE-ROBUST SELF-SUPERVISED PRE-TRAINING MODEL BASED SPEECH REPRESENTATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION
    Zhu, Qiu-Shi
    Zhang, Jie
    Zhang, Zi-Qiang
    Wu, Ming-Hui
    Fang, Xin
    Dai, Li-Rong
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3174 - 3178
  • [44] A noise-robust speech recognition approach incorporating normalized speech/non-speech likelihood into hypothesis scores
    Oonishi, Tasuku
    Iwano, Koji
    Furui, Sadaoki
    SPEECH COMMUNICATION, 2013, 55 (02) : 377 - 386
  • [45] Improved model parameter compensation methods for noise-robust speech recognition
    Chang, YH
    Chung, YJ
    Park, SU
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 561 - 564
  • [46] Robust automatic speech recognition in the presence of impulsive noise
    Potamitis, I
    Fakotakis, N
    Kokkinakis, G
    ELECTRONICS LETTERS, 2001, 37 (12) : 799 - 800
  • [47] Noise robust automatic speech recognition: review and analysis
    Dua M.
    Akanksha
    Dua S.
    International Journal of Speech Technology, 2023, 26 (02) : 475 - 519
  • [48] GAUSSIAN POWER FLOW ORIENTATION COEFFICIENTS FOR NOISE-ROBUST SPEECH RECOGNITION
    Gerazov, Branislav
    Ivanovski, Zoran
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 1467 - 1471
  • [49] Probabilistic vector mapping with trajectory information for noise-robust speech recognition
    Kim, DY
    Un, CK
    ELECTRONICS LETTERS, 1996, 32 (17) : 1550 - 1551
  • [50] Environmental Noise Analysis for Robust Automatic Speech Recognition
    Kishore, N. Sai Bala
    Venkata, M. Rao
    Nagamani, M.
    ADVANCED COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY, 2015, 315