Multi-environment model adaptation based on vector Taylor series for robust speech recognition

被引:4
|
作者
Lue, Yong [1 ]
Wu, Haiyang [1 ]
Zhou, Lin [1 ]
Wu, Zhenyang [1 ]
机构
[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Model adaptation; Vector Taylor series; Multi-environment model; Speech recognition; MAXIMUM-LIKELIHOOD; HMM ADAPTATION; NOISE; COMPENSATION;
D O I
10.1016/j.patcog.2010.03.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a multi-environment model adaptation method based on vector Taylor series (VTS) for robust speech recognition. In the training phase, the clean speech is contaminated with noise at different signal-to-noise ratio (SNR) levels to produce several types of noisy training speech and each type is used to obtain a noisy hidden Markov model (HMM) set. In the recognition phase, the HMM set which best matches the testing environment is selected, and further adjusted to reduce the environmental mismatch by the VTS-based model adaptation method. In the proposed method, the VTS approximation based on noisy training speech is given and the testing noise parameters are estimated from the noisy testing speech using the expectation-maximization (EM) algorithm. The experimental results indicate that the proposed multi-environment model adaptation method can significantly improve the performance of speech recognizers and outperforms the traditional model adaptation method and the linear regression-based multi-environment method. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3093 / 3099
页数:7
相关论文
共 50 条
  • [21] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Suh, Youngjoo
    Kim, Hoirin
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2010,
  • [22] Histogram Equalization to Model Adaptation for Robust Speech Recognition
    Youngjoo Suh
    Hoirin Kim
    [J]. EURASIP Journal on Advances in Signal Processing, 2010
  • [23] Multi-Channel Feature Adaptation for Robust Speech Recognition
    Zhang, Zhaofeng
    Xiao, Xiong
    Wang, Longbiao
    Dang, Jianwu
    Iwahashi, Masahiro
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [24] Model Adaptation Algorithm Based on Central Subband Regression for Robust Speech Recognition
    Lu, Yong
    Zhou, Lin
    [J]. 2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
  • [25] Taylor Series Expansion of Psychoacoustic Corruption Function for Noise Robust Speech Recognition
    Das, Biswajit
    Panda, Ashish
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 568 - 572
  • [26] A NOISE ROBUST I-VECTOR EXTRACTOR USING VECTOR TAYLOR SERIES FOR SPEAKER RECOGNITION
    Lei, Yun
    Burget, Lukas
    Scheffer, Nicolas
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6788 - 6791
  • [27] Stochastic Vector Mapping-based Feature Enhancement Using Prior Model and Environment Adaptation for Noisy Speech Recognition
    Hsieh, Chia-Hsin
    Wu, Chung-Hsien
    Lin, Jun-Yu
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 29 - 32
  • [28] LASSO ENVIRONMENT MODEL COMBINATION FOR ROBUST SPEECH RECOGNITION
    Xiao, Xiong
    Li, Jinyu
    Chng, Eng Siong
    Li, Haizhou
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4305 - 4308
  • [29] Unsupervised Data-Driven Feature Vector Normalization With Acoustic Model Adaptation for Robust Speech Recognition
    Buera, Luis
    Miguel, Antonio
    Saz, Oscar
    Ortega, Alfonso
    Lleida, Eduardo
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (02): : 296 - 309
  • [30] DYNAMIC ADAPTATION OF HIDDEN MARKOV MODEL FOR ROBUST SPEECH RECOGNITION
    GAO, YQ
    CHEN, YB
    WU, BX
    [J]. 1989 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-3, 1989, : 1336 - 1339