Transforming Features to Compensate Speech Recogniser Models for Noise

被引:0
|
作者
van Dalen, R. C. [1 ]
Flego, F. [1 ]
Gales, M. J. F. [1 ]
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 1TN, England
关键词
speech recognition; noise robustness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To make speech recognisers robust to noise, either the features or the models can be compensated. Feature enhancement is often fast; model compensation is often more accurate, because it predicts the corrupted speech distribution. It is therefore able, for example, to take uncertainty about the clean speech into account. This paper re-analyses the recently-proposed predictive linear transformations for noise compensation as minimising the divergence between the predicted corrupted speech and the adapted models. New schemes are then introduced which apply observation-dependent transformations in the front-end to adapt the back-end distributions. One applies transforms in the exact same manner as the popular minimum mean square error (MMSE) feature enhancement scheme, and is as fast. The new method performs better on AURORA 2.
引用
收藏
页码:2459 / 2462
页数:4
相关论文
共 50 条
  • [1] Combined optimisation of baseforms and subword models for an HMM based speech recogniser
    Holter, T
    Svendsen, T
    [J]. ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 321 - 324
  • [2] A speech enhancement algorithm to reduce noise and compensate for partial masking effect
    JEON Yu-yong
    LEE Sang-min
    [J]. Journal of Central South University, 2011, 18 (04) : 1121 - 1127
  • [3] A speech enhancement algorithm to reduce noise and compensate for partial masking effect
    Jeon, Yu-yong
    Lee, Sang-min
    [J]. JOURNAL OF CENTRAL SOUTH UNIVERSITY OF TECHNOLOGY, 2011, 18 (04): : 1121 - 1127
  • [4] A speech enhancement algorithm to reduce noise and compensate for partial masking effect
    Yu-yong Jeon
    Sang-min Lee
    [J]. Journal of Central South University of Technology, 2011, 18 : 1121 - 1127
  • [5] SPEECH ENHANCEMENT IN VEHICULAR ENVIRONMENTS AS A FRONT END FOR ROBUST SPEECH RECOGNISER
    Lena, D. S. K.
    Vijayalakshmi, P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 430 - 435
  • [6] Memristor pattern recogniser: isolated speech word recognition
    Maan, A. K.
    James, A. P.
    Dimitijev, S.
    [J]. ELECTRONICS LETTERS, 2015, 51 (17) : 1370 - 1371
  • [7] Performance evaluation of an automatic speech recogniser incorporating a fast adaptive speech separation algorithm
    Dogançay, K
    Littlefield, J
    Hashemi-Sakhtsari, A
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL VI, PROCEEDINGS: SIGNAL PROCESSING THEORY AND METHODS, 2003, : 205 - 208
  • [8] Extracting speech features from human speech like noise
    Kobayashi, D
    Kajita, S
    Takeda, K
    Itakura, F
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 418 - 421
  • [9] Stochastic features for noise robust speech recognition
    Iwahashi, N
    Pao, H
    Honda, H
    Minamino, K
    Omote, M
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 633 - 636
  • [10] Evaluation of Lombard Speech Models in the Context of Speech in Noise Enhancement
    Korvel, Grazina
    Kakol, Krzysztof
    Kurasova, Olga
    Kostek, Bozena
    [J]. IEEE ACCESS, 2020, 8 : 155156 - 155170