Environment compensation based on maximum a posteriori estimation for improved speech recognition

被引:0
|
作者
Shen, HF [1 ]
Guo, J
Liu, G
Huang, PM
Li, QX
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
[2] Univ Sci & Technol Beijing, Beijing 100083, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we describe environment compensation approach based on MAP (maximum a posteriori) estimation assuming that the noise can be modeled as a single Gaussian distribution. It employs the prior information of the noise to deal with environmental variabilities. The acoustic-distorted environment model in the cepstral domain is approximated by the truncated first-order vector Taylor series(VTS) expansion and the clean speech is trained by using Self-Organizing Map (SOM) neural network with the assumption that the speech can be well represented as the multivariate diagonal Gaussian mixtures model (GMM). With the reasonable environment model approximation and effective clustering for the clean model, the noise is well refined using batch-EM algorithm under MAP criterion. Experiment with large vocabulary speaker-independent continuous speech recognition shows that this approach achieves considerable improvement on recognition performance.
引用
收藏
页码:854 / 862
页数:9
相关论文
共 50 条
  • [1] Mixed environment compensation based on maximum a posteriori estimation for robust speech recognition
    Shen, Haifeng
    Liu, Gang
    Guo, Jun
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2009, 32 (1-4) : 1 - 11
  • [2] Mixed environment compensation based on maximum a posteriori estimation for robust speech recognition
    Haifeng Shen
    Gang Liu
    Jun Guo
    [J]. Artificial Intelligence Review, 2009, 32 : 1 - 11
  • [3] Single Channel Speech Separation Using Maximum a Posteriori Estimation
    Radfar, M. H.
    Dansereau, R. M.
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 841 - 844
  • [4] Generalized maximum a posteriori spectral amplitude estimation for speech enhancement
    Tsao, Yu
    Lai, Ying-Hui
    [J]. SPEECH COMMUNICATION, 2016, 76 : 112 - 126
  • [5] Channel compensation of speaker identification based on maximum a posteriori
    Gao, Rong-Chun
    Han, Ji-Qing
    Zhang, Lei
    [J]. Tongxin Xuebao/Journal on Communication, 2009, 30 (03): : 99 - 103
  • [6] Harmonicity based dereverberation with maximum a posteriori estimation
    Nakatani, T
    Juang, BH
    Kinoshita, K
    Miyoshi, M
    [J]. 2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 94 - 97
  • [7] Maximum a Posteriori Speech Enhancement Based on Double Spectrum
    Mowlaee, Pejman
    Scheran, Daniel
    Stahl, Johannes
    Wood, Sean U. N.
    Kleijn, W. Bastiaan
    [J]. INTERSPEECH 2019, 2019, : 2738 - 2742
  • [8] Multi-band Spectral Subtraction of Speech Enhancement Based on Maximum Posteriori Phase Estimation
    Li Z.
    Wu W.
    Zhang Q.
    Ren H.
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2017, 39 (09): : 2282 - 2286
  • [9] Feature compensation based on independent noise estimation for robust speech recognition
    Yong Lü
    Han Lin
    Pingping Wu
    Yitao Chen
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [10] Feature compensation based on independent noise estimation for robust speech recognition
    Lu, Yong
    Lin, Han
    Wu, Pingping
    Chen, Yitao
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)