Adaptive model-based speech enhancement

被引:6
|
作者
Logan, B
Robinson, T
机构
[1] Compaq Comp Corp, Cambridge Res Lab, Cambridge, MA 02142 USA
[2] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England
关键词
speech enhancement; autoregressive hidden Markov models; robust speech recognition;
D O I
10.1016/S0167-6393(00)00038-8
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We investigate the enhancement of speech corrupted by unknown independent additive noise when only a single microphone is available. We present adaptive enhancement systems based on an existing non-adaptive technique [Ephraim. Y., 19992a. IEEE Transactions on Signal Processing 40 (4), 725-735]. This approach models the speech and noise statistics using autoregressive hidden Markov models (AR-HMMs). We develop two main extensions. The first estimates the noise statistics from detected pauses. The second forms maximum likelihood (ML) estimates of the unknown noise parameters using the whole utterance. Both techniques operate within the AR-HMM framework. We have previously shown that the ability of AR-HMMs to model speech can be improved by the incorporation of perceptual frequency using the bilinear transform. We incorporate this improvement into our enhancement systems. We evaluate our techniques on the NOISEX-92 and Resource Management (RM) databases, giving indications of performance on simple and more complex tasks, respectively. Both enhancement schemes proposed are able to improve substantially on baseline results. The technique of forming ML estimates of the noise parameters is found to be the most effective. Its performance is evaluated over a wide range of noise conditions ranging front -6 to 18 dB and on various types of stationary real-world noises. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:351 / 368
页数:18
相关论文
共 50 条
  • [1] INDIRECT MODEL-BASED SPEECH ENHANCEMENT
    Le Roux, Jonathan
    Hershey, John R.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4045 - 4048
  • [2] Adaptive Model-Based Mammogram Enhancement
    Haindl, Michal
    Remes, Vaclav
    [J]. 10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 65 - 72
  • [3] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
    Jiang Wenbin
    Ying Rendong
    Liu Peilin
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
  • [4] ON THE INFLUENCE OF INHARMONICITIES IN MODEL-BASED SPEECH ENHANCEMENT
    Norholm, Sidsel Marie
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [5] Model-Based Speech Enhancement in the Modulation Domain
    Wang, Yu
    Brookes, Mike
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) : 580 - 594
  • [6] Model-based eigenspectrum estimation for speech enhancement
    Bhunjun, Vinesh
    Brookes, Mike
    Naylor, Patrick
    [J]. 2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 1331 - +
  • [7] Model-Based Speech Enhancement for Automotive Applications
    Krini, Mohamed
    Schmidt, Gerhard
    [J]. 2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 638 - 643
  • [8] A Model-Based Soft Decision Approach for Speech Enhancement
    Xianyun Wang
    Changchun Bao
    Feng Bao
    [J]. China Communications, 2017, 14 (09) : 11 - 22
  • [9] Model-Based Feature Enhancement for Reverberant Speech Recognition
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
  • [10] Spectral difference for statistical model-based speech enhancement in speech recognition
    Soojeong Lee
    Joon-Hyuk Chang
    [J]. Multimedia Tools and Applications, 2017, 76 : 24917 - 24929