Noise Robust Speech Recognition Based on Parallel Model Combination Adaptation Using Frequency-Variant

被引:0
|
作者
Choi, Sook-Nam
Chung, Hyun-Yeol
机构
来源
关键词
Parallel model combination; Gaussian mixture model; Frequency-variant; Environment-awareness; Noise model; FV-PMC;
D O I
10.7776/ASK.2013.32.3.252
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The common speech recognition system displays higher recognition performance in a quiet environment, while its performance declines sharply in a real environment where there are noises. To implement a speech recognizer that is robust in different speech settings, this study suggests the method of Parallel Model Combination adaptation using frequency-variant based on environment-awareness (FV-PMC), which uses variants in frequency; acquires the environmental data for speech recognition; applies it to upgrading the speech recognition model; and promotes its performance enhancement. This FV-PMC performs the speech recognition with the recognition model which is generated as followings: i) calculating the average frequency variant in advance among the readily-classified noise groups and setting it as a threshold value; ii) recalculating the frequency variant among noise groups when speech with unknown noises are input; iii) regarding the speech higher than the threshold value of the relevant group as the speech including the noise of its group; and iv) using the speech that includes this noise group. When noises were classified with the proposed FV-PMC, the average accuracy of classification was 56%, and the results from the speech recognition experiments showed the average recognition rate of Set A was 79.05%, the rate of Set B 79.43% m, and the rate of Set C 83.37% respectively. The grand mean of recognition rate was 80.62%, which demonstrates 5.69% more improved effects than the recognition rate of 74.93% of the existing Parallel Model Combination with a clear model, meaning that the proposed method is effective.
引用
收藏
页码:252 / 261
页数:10
相关论文
共 50 条
  • [1] Noise-robust speech recognition by discriminative adaptation in parallel model combination
    Chung, YJ
    [J]. ELECTRONICS LETTERS, 2000, 36 (04) : 370 - 371
  • [2] ROBUST SPEECH RECOGNITION IN ADDITIVE AND CONVOLUTIONAL NOISE USING PARALLEL MODEL COMBINATION
    GALES, MJF
    YOUNG, SJ
    [J]. COMPUTER SPEECH AND LANGUAGE, 1995, 9 (04): : 289 - 307
  • [3] Comparing Jacorian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition
    Pärssinen, K
    Salmela, P
    Harju, M
    Kiss, I
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 193 - 196
  • [4] Robust continuous speech recognition using parallel model combination
    Gales, MJF
    Young, SJ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1996, 4 (05): : 352 - 359
  • [5] APPROXIMATED PARALLEL MODEL COMBINATION FOR EFFICIENT NOISE-ROBUST SPEECH RECOGNITION
    Sim, Khe Chai
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7383 - 7387
  • [6] ROBUST SPEECH RECOGNITION USING DYNAMIC NOISE ADAPTATION
    Rennie, Steven
    Dognin, Pierre
    Fousek, Petr
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4592 - 4595
  • [7] Robust speech recognition by model adaptation and normalization using pre-observed noise
    Kobashikawa, Satoshi
    Takahashi, Satoshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03): : 422 - 429
  • [8] NOISE ADAPTATION ALGORITHMS FOR ROBUST SPEECH RECOGNITION
    CUNG, HM
    NORMANDIN, Y
    [J]. SPEECH COMMUNICATION, 1993, 12 (03) : 267 - 276
  • [9] A Noise Robust Speech Recognition Method Using Model Compensation Based on Speech Enhancement
    Shen, Guanghu
    Jung, Ho-Youl
    Chung, Hyun-Yeol
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (04): : 191 - 199
  • [10] INTEGRATED DNN-BASED MODEL ADAPTATION TECHNIQUE FOR NOISE-ROBUST SPEECH RECOGNITION
    Lee, Kang Hyun
    Kang, Woo Hyun
    Kang, Tae Gyoon
    Kim, Nam Soo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5245 - 5249