A Bayesian prediction approach to robust speech recognition and online environmental learning

被引:5
|
作者
Chien, JT [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
关键词
Bayesian predictive classification (BPC); online unsupervised learning; speaker adaptation; speech recognition; hidden Markov model;
D O I
10.1016/S0167-6393(01)00032-2
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A robust speech recognizer is developed to tackle the inevitable mismatch between training and testing environments. Because the realistic environments are uncertain and nonstationary, it is necessary to characterize the uncertainty of speech hidden Markov models (HMMs) for recognition and trace the uncertainty incrementally to catch the newest environmental statistics. In this paper, we develop a new Bayesian predictive classification (BPC) for robust decision and online environmental learning. The BPC decision is adequately established by modeling the uncertainties of both the HMM mean rector and precision matrix using a conjugate prior density. The frame-based predictive distributions using multivariate t distributions and approximate Gaussian distributions are herein exploited. After the recognition, the prior density is pooled with the likelihood of the Current test sentence to generate the reproducible prior density. The hyperparameters of the prior density are accordingly adjusted to meet the newest environments and apply for the recognition of upcoming data. As a result, an efficient online unsupervised learning strategy is developed for HMM-based speech recognition without needing adaptation data. In the experiments, the proposed approach is significantly better than conventional plug-in maximum a posteriori (MAP) decision on the recognition of connected Chinese digits in hands-free car environments. This approach is economical in computation. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
下载
收藏
页码:321 / 334
页数:14
相关论文
共 50 条
  • [41] A Bayesian Approach to Robust Inverse Reinforcement Learning
    Wei, Ran
    Zeng, Siliang
    Li, Chenliang
    Garcia, Alfredo
    McDonald, Anthony
    Hong, Mingyi
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [42] An Online Learning Approach for Trend Recognition
    Paulk, David
    8TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS (PETRA 2015), 2015,
  • [43] A maximum likelihood approach to unsupervised online adaptation of stochastic vector mapping function for robust speech recognition
    Zhu, Donglai
    Huo, Qiang
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 773 - +
  • [44] Approach of features with confident weight for robust speech recognition
    Ge Lingnan
    Shirai, Katsuhiko
    Kurematsu, Akira
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (03) : 92 - 99
  • [45] Robust speech recognition using a noise rejection approach
    Khan, E
    Levinson, R
    IEEE INTERNATIONAL JOINT SYMPOSIA ON INTELLIGENCE AND SYSTEMS - PROCEEDINGS, 1998, : 326 - 335
  • [46] A perceptual masking approach for noise robust speech recognition
    Hari Krishna Maganti
    Marco Matassoni
    EURASIP Journal on Audio, Speech, and Music Processing, 2012
  • [47] An Integrated Approach to Robust Speaker Identification and Speech Recognition
    Kwan, C.
    Yin, J.
    Ayhan, B.
    Chu, S.
    Liu, X.
    Puckett, K.
    Zhao, Y.
    Ho, K. C.
    Kruger, M.
    Sityar, I.
    2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, : 1635 - +
  • [48] A Minimax Classification Approach with Application to Robust Speech Recognition
    Merhav, Neri
    Lee, Chin-Hui
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (01): : 90 - 100
  • [49] A Bayesian approach to robust beamforming with embedding environmental uncertainty
    Zhao H.-F.
    Gong X.-Y.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2010, 31 (07): : 951 - 957
  • [50] Approach of feature with confident weight for robust speech recognition
    Ge, YB
    Song, J
    Ge, LN
    Shirai, K
    2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 11 - 14