Model-based feature enhancement for noisy speech recognition

被引:0
|
作者
Couvreur, C [1 ]
Van hamme, H [1 ]
机构
[1] Lernout & Hauspie Speech Prod, B-1780 Wemmel, Belgium
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, a new feature enhancement algorithm called model-based feature enhancement (MBFE) is introduced for noise robust speech recognition. In MBFE, statistical models (i.e., Gaussian HMM's) of the clean speech feature vectors and of the perturbing noise feature vectors are used to construct the optimal MMSE estimator of the clean speech feature vectors. The estimated clean speech features are then fed to a recognizer. The performance of MBFE is studied experimentally on a connected-digits recognition task in several additive noise conditions (synthetic white and impulsive noise, car noise, and machine tool noise are considered). The performance of MBFE is also compared to that of a state-of-the-art implementation of non-linear spectral subtraction.
引用
收藏
页码:1719 / 1722
页数:4
相关论文
共 50 条
  • [31] COMPARISON OF DIFFERENT SPEECH ENHANCEMENT METHODS ON RECOGNITION OF NOISY SPEECH
    AHMED, MS
    ALMARZOUG, AM
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 1994, 19 (01): : 45 - 56
  • [32] Feature enhancement for noisy speech recognition with a time-variant linear predictive HMM structure
    Deng, Jianping
    Bouchard, Martin
    Yeap, Tet Hin
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (05): : 891 - 899
  • [33] Multi-objective Noisy based Deep Feature Loss for Speech Enhancement
    Pilarczyk, Rafa L.
    Skarbek, Wladyslaw
    [J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2019, 2019, 11176
  • [34] Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition
    Chu, Shih-Chuan
    Wu, Chung-Hsien
    Lin, Yun-Wen
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 536 - 540
  • [35] Combining Feature and Model-Based Adaptation of RNNLMs for Multi-Genre Broadcast Speech Recognition
    Deena, Saud
    Hasan, Madina
    Doulaty, Mortaza
    Saz, Oscar
    Hain, Thomas
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2343 - 2347
  • [36] HCRF-based Model Compensation for Noisy Speech Recognition
    Hong, Wei-Tyng
    [J]. 2013 IEEE 17TH INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS (ISCE), 2013, : 277 - 278
  • [37] A Model-Based Soft Decision Approach for Speech Enhancement
    Xianyun Wang
    Changchun Bao
    Feng Bao
    [J]. China Communications, 2017, 14 (09) : 11 - 22
  • [38] A Model-Based Soft Decision Approach for Speech Enhancement
    Wang, Xianyun
    Bao, Changchun
    Bao, Feng
    [J]. CHINA COMMUNICATIONS, 2017, 14 (09) : 11 - 22
  • [39] Speech enhancement strategy for speech recognition microcontroller under noisy environments
    Chan, Kit Yan
    Nordholm, Sven
    Yiu, Ka Fai Cedric
    Togneri, Roberto
    [J]. NEUROCOMPUTING, 2013, 118 : 279 - 288
  • [40] Speech Enhancement and Recognition of Compressed Speech Signal in Noisy Reverberant Conditions
    Suman, Maloji
    Khan, Habibulla
    Latha, M. Madhavi
    Kumari, Devarakonda Aruna
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS 2012 (INDIA 2012), 2012, 132 : 379 - +