Multi-environment model adaptation based on vector Taylor series for robust speech recognition

被引:4
|
作者
Lue, Yong [1 ]
Wu, Haiyang [1 ]
Zhou, Lin [1 ]
Wu, Zhenyang [1 ]
机构
[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Model adaptation; Vector Taylor series; Multi-environment model; Speech recognition; MAXIMUM-LIKELIHOOD; HMM ADAPTATION; NOISE; COMPENSATION;
D O I
10.1016/j.patcog.2010.03.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a multi-environment model adaptation method based on vector Taylor series (VTS) for robust speech recognition. In the training phase, the clean speech is contaminated with noise at different signal-to-noise ratio (SNR) levels to produce several types of noisy training speech and each type is used to obtain a noisy hidden Markov model (HMM) set. In the recognition phase, the HMM set which best matches the testing environment is selected, and further adjusted to reduce the environmental mismatch by the VTS-based model adaptation method. In the proposed method, the VTS approximation based on noisy training speech is given and the testing noise parameters are estimated from the noisy testing speech using the expectation-maximization (EM) algorithm. The experimental results indicate that the proposed multi-environment model adaptation method can significantly improve the performance of speech recognizers and outperforms the traditional model adaptation method and the linear regression-based multi-environment method. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3093 / 3099
页数:7
相关论文
共 50 条
  • [41] INTEGRATING DENOISING AUTOENCODER AND VECTOR TAYLOR SERIES WITH AUDITORY MASKING FOR SPEECH RECOGNITION IN NOISY CONDITIONS
    Das, A. Biswajit
    Panda, Ashish
    [J]. 2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2305 - 2309
  • [42] INTEGRATED DNN-BASED MODEL ADAPTATION TECHNIQUE FOR NOISE-ROBUST SPEECH RECOGNITION
    Lee, Kang Hyun
    Kang, Woo Hyun
    Kang, Tae Gyoon
    Kim, Nam Soo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5245 - 5249
  • [43] HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition
    Li, Jinyu
    Deng, Li
    Yu, Dong
    Gong, Yifan
    Acero, Alex
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4069 - 4072
  • [44] Handling outliers in multi-environment trial data analysis: in the direction of robust SREG model
    Angelini, Julia
    Faviere, Gabriela
    Bortolotto, Eugenia
    Lucio Cervigni, Gerardo Domingo
    Beatriz Quaglino, Marta
    [J]. JOURNAL OF CROP IMPROVEMENT, 2023, 37 (01) : 74 - 98
  • [45] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
    CUNG, HM
    NORMANDIN, Y
    [J]. SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
  • [46] SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION IN MOTORCYCLE ENVIRONMENT
    Mporas, Iosif
    Ganchev, Todor
    Kocsis, Otilia
    Fakotakis, Nikos
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2010, 19 (02) : 159 - 173
  • [47] Feature Adaptation for Robust Mobile Speech Recognition
    Lee, Hyeopwoo
    Yook, Dongsuk
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2012, 58 (04) : 1393 - 1398
  • [48] Mixed model formulations for multi-environment trials
    Basford, KE
    Federer, WT
    DeLacy, IH
    [J]. AGRONOMY JOURNAL, 2004, 96 (01) : 143 - 147
  • [49] Cepstral vector normalization based on stereo data for robust speech recognition
    Buera, Luis
    Lleida, Eduardo
    Miguel, Antonio
    Ortega, Alfonso
    Saz, Oscar
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (03): : 1098 - 1113
  • [50] Robust Almost-Sure Reachability in Multi-Environment MDPs
    van der Vegt, Marck
    Jansen, Nils
    Junges, Sebastian
    [J]. TOOLS AND ALGORITHMS FOR THE CONSTRUCTION AND ANALYSIS OF SYSTEMS, PT I, TACAS 2023, 2023, 13993 : 508 - 526