Multi-environment model adaptation based on vector Taylor series for robust speech recognition

被引:4
|
作者
Lue, Yong [1 ]
Wu, Haiyang [1 ]
Zhou, Lin [1 ]
Wu, Zhenyang [1 ]
机构
[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Model adaptation; Vector Taylor series; Multi-environment model; Speech recognition; MAXIMUM-LIKELIHOOD; HMM ADAPTATION; NOISE; COMPENSATION;
D O I
10.1016/j.patcog.2010.03.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a multi-environment model adaptation method based on vector Taylor series (VTS) for robust speech recognition. In the training phase, the clean speech is contaminated with noise at different signal-to-noise ratio (SNR) levels to produce several types of noisy training speech and each type is used to obtain a noisy hidden Markov model (HMM) set. In the recognition phase, the HMM set which best matches the testing environment is selected, and further adjusted to reduce the environmental mismatch by the VTS-based model adaptation method. In the proposed method, the VTS approximation based on noisy training speech is given and the testing noise parameters are estimated from the noisy testing speech using the expectation-maximization (EM) algorithm. The experimental results indicate that the proposed multi-environment model adaptation method can significantly improve the performance of speech recognizers and outperforms the traditional model adaptation method and the linear regression-based multi-environment method. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3093 / 3099
页数:7
相关论文
共 50 条
  • [1] SECOND ORDER VECTOR TAYLOR SERIES BASED ROBUST SPEECH RECOGNITION
    Bu, Suliang
    Qian, Yanmin
    Sim, Khe Chai
    You, Yongbin
    Yu, Kai
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [2] ON NOISE ESTIMATION FOR ROBUST SPEECH RECOGNITION USING VECTOR TAYLOR SERIES
    Zhao, Yong
    Juang, Biing-Hwang
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4290 - 4293
  • [3] A vector Taylor series approach for environment-independent speech recognition
    Moreno, PJ
    Raj, B
    Stern, RM
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 733 - 736
  • [4] A new algorithm using improved Vector Taylor Series for robust speech recognition
    Li, YY
    Li, B
    Wang, CY
    Tang, CJ
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT SYSTEMS AND SIGNAL PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2003, : 1146 - 1150
  • [5] Vector Taylor Series Expansion with Auditory Masking for Noise Robust Speech Recognition
    Das, Biswajit
    Panda, Ashish
    [J]. 2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [6] A NOISE-ROBUST SPEECH RECOGNITION METHOD COMPOSED OF WEAK NOISE SUPPRESSION AND WEAK VECTOR TAYLOR SERIES ADAPTATION
    Komeiji, Shuji
    Arakawa, Takayuki
    Koshinaka, Takafumi
    [J]. 2012 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2012), 2012, : 103 - 106
  • [7] Robust Speech Recognition Using Improved Vector Taylor Series Algorithm for Embedded Systems
    Lue, Yong
    Wu, Haiyang
    Wu, Zhenyang
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (02) : 764 - 769
  • [8] Robust i-vector based Adaptation of DNN Acoustic Model for Speech Recognition
    Garimella
    Mandal, Arindam
    Strom, Nikko
    Hoffmeister, Bjorn
    Matsoukas, Spyros
    Parthasarathi, Hari Krishnan
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2877 - 2881
  • [9] Use of Generalised Nonlinearity in Vector Taylor Series Noise Compensation for Robust Speech Recognition
    Loweimi, Erfan
    Barker, Jon
    Hain, Thomas
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3798 - 3802
  • [10] Feature adaptation using deviation vector for robust speech recognition in noisy environment
    Hwang, TH
    Lee, LM
    Wang, HC
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1227 - 1230