Multi-environment model adaptation based on vector Taylor series for robust speech recognition

被引:4
|
作者
Lue, Yong [1 ]
Wu, Haiyang [1 ]
Zhou, Lin [1 ]
Wu, Zhenyang [1 ]
机构
[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Model adaptation; Vector Taylor series; Multi-environment model; Speech recognition; MAXIMUM-LIKELIHOOD; HMM ADAPTATION; NOISE; COMPENSATION;
D O I
10.1016/j.patcog.2010.03.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a multi-environment model adaptation method based on vector Taylor series (VTS) for robust speech recognition. In the training phase, the clean speech is contaminated with noise at different signal-to-noise ratio (SNR) levels to produce several types of noisy training speech and each type is used to obtain a noisy hidden Markov model (HMM) set. In the recognition phase, the HMM set which best matches the testing environment is selected, and further adjusted to reduce the environmental mismatch by the VTS-based model adaptation method. In the proposed method, the VTS approximation based on noisy training speech is given and the testing noise parameters are estimated from the noisy testing speech using the expectation-maximization (EM) algorithm. The experimental results indicate that the proposed multi-environment model adaptation method can significantly improve the performance of speech recognizers and outperforms the traditional model adaptation method and the linear regression-based multi-environment method. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3093 / 3099
页数:7
相关论文
共 50 条
  • [31] Speech recognition in noisy environments using first-order vector Taylor series
    Kim, DY
    Un, CK
    Kim, NS
    [J]. SPEECH COMMUNICATION, 1998, 24 (01) : 39 - 49
  • [32] Noisy speech recognition based on robust end-point detection and model adaptation
    Zhang, ZP
    Furui, S
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 441 - 444
  • [33] A CSI-Based Multi-Environment Human Activity Recognition Framework
    Alsaify, Baha A.
    Almazari, Mahmoud M.
    Alazrai, Rami
    Alouneh, Sahel
    Daoud, Mohammad I.
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (02):
  • [34] Joint speaker and environment adaptation using Tensor Voice for robust speech recognition
    Jeong, Yongwon
    [J]. SPEECH COMMUNICATION, 2014, 58 : 1 - 10
  • [35] OHRS-MEWA: On-line Handwriting Recognition System with Multi-Environment Writer Adaptation
    Haddad, Lobna
    Hamdani, Tarek M.
    Alimi, Adel M.
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 335 - 340
  • [36] Feature compensation algorithm based on vector Taylor series for speaker recognition
    Wu, Haiyang
    Yang, Feiran
    Zhou, Lin
    Wu, Zhenyang
    [J]. Shengxue Xuebao/Acta Acustica, 2013, 38 (01): : 105 - 112
  • [37] Selection of spectral compressive operator for vector Taylor series-based model adaptation in noisy environments
    Baek, Soonho
    Kang, Hong-Goo
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2014, 135 (06): : EL284 - EL290
  • [38] Selection of spectral compressive operator for vector Taylor series-based model adaptation in noisy environments
    [J]. Kang, H.-G. (hgkang@yonsei.ac.kr), 1600, Acoustical Society of America (135):
  • [39] Searching for robust associations with a multi-environment knockoff filter
    Li, S.
    Sesia, M.
    Romano, Y.
    Candes, E.
    Sabatti, C.
    [J]. BIOMETRIKA, 2022, 109 (03) : 611 - 629
  • [40] A novel HMM model adaptation and compensation method for robust speech recognition
    Ning, GX
    Wei, G
    [J]. INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2005, VOLS 1 AND 2, PROCEEDINGS, 2005, : 274 - 277