Maximum likelihood joint estimation of channel and noise for robust speech recognition

被引:0
|
作者
Zhao, YX [1 ]
机构
[1] Univ Missouri, Dept Comp Sci & Comp Engn, Columbia, MO 65211 USA
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
An EM algorithm is formulated in the DFT domain for joint estimation of parameters of distortion channel and additive noise from online degraded speech, and the posterior estimates of short-time speech power spectra are obtained at the convergence of the EM algorithm. Any speech features derivable from power spectra can then be approximately estimated by minimum mean-squared error estimation. Experiments were performed on speaker-independent continuous speech recognition using as features the perceptually based linear prediction cepstral coefficients, energy, and temporal regression coefficients. Speech data were taken from the TIMIT database and were degraded by a distortion channel and colored noise at various SNR levels. Experimental results indicate that the proposed technique leads to convergent identification of channel and noise and significantly improved recognition accuracy.
引用
收藏
页码:1109 / 1112
页数:4
相关论文
共 50 条
  • [21] Maximum-likelihood approach to stochastic matching for robust speech recognition
    Sankar, A
    Lee, CH
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (03): : 190 - 202
  • [22] A novel channel estimate for noise robust speech recognition
    Vanderreydt, Geoffroy
    Demuynck, Kris
    [J]. COMPUTER SPEECH AND LANGUAGE, 2024, 86
  • [23] Robust Maximum Likelihood Estimation
    Bertsimas, Dimitris
    Nohadani, Omid
    [J]. INFORMS JOURNAL ON COMPUTING, 2019, 31 (03) : 445 - 458
  • [24] Maximum likelihood joint angle and delay estimation in unknown noise fields
    Belouchrani, A
    Aouada, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 265 - 268
  • [25] JOINT NOISE ADAPTIVE TRAINING FOR ROBUST AUTOMATIC SPEECH RECOGNITION
    Narayanan, Arun
    Wang, DeLiang
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [26] RAPID JOINT SPEAKER AND NOISE COMPENSATION FOR ROBUST SPEECH RECOGNITION
    Chin, K. K.
    Xu, Haitian
    Gales, Mark J. F.
    Breslin, Catherine
    Knill, Kate
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5500 - 5503
  • [27] Joint Maximum Likelihood Estimation of Channel and Preamble Sequence for WiMAX Systems
    Lee, Jungwon
    Choi, Jihwan P.
    Lou, Hui-Ling
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2008, 7 (11) : 4294 - 4303
  • [28] Genetic algorithm optimisation for maximum likelihood joint channel and data estimation
    Chen, S
    Wu, Y
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1157 - 1160
  • [29] Joint maximum likelihood estimation of channel and preamble sequence for OFDM systems
    Lee, Jungwon
    Choi, Jihwan P.
    Lou, Hui-Ling
    [J]. GLOBECOM 2007: 2007 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-11, 2007, : 4145 - 4149
  • [30] Efficient joint maximum-likelihood channel estimation and signal detection
    Vikalo, Haris
    Hassibi, Babak
    Stoica, Petre
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2006, 5 (07) : 1838 - 1845