Adaptive model-based speech enhancement

被引：6

作者：

Logan, B

Robinson, T

机构：

[1] Compaq Comp Corp, Cambridge Res Lab, Cambridge, MA 02142 USA

[2] Univ Cambridge, Dept Engn, Cambridge CB2 1PZ, England

来源：

SPEECH COMMUNICATION | 2001年 / 34卷 / 04期

关键词：

speech enhancement; autoregressive hidden Markov models; robust speech recognition;

D O I：

10.1016/S0167-6393(00)00038-8

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

We investigate the enhancement of speech corrupted by unknown independent additive noise when only a single microphone is available. We present adaptive enhancement systems based on an existing non-adaptive technique [Ephraim. Y., 19992a. IEEE Transactions on Signal Processing 40 (4), 725-735]. This approach models the speech and noise statistics using autoregressive hidden Markov models (AR-HMMs). We develop two main extensions. The first estimates the noise statistics from detected pauses. The second forms maximum likelihood (ML) estimates of the unknown noise parameters using the whole utterance. Both techniques operate within the AR-HMM framework. We have previously shown that the ability of AR-HMMs to model speech can be improved by the incorporation of perceptual frequency using the bilinear transform. We incorporate this improvement into our enhancement systems. We evaluate our techniques on the NOISEX-92 and Resource Management (RM) databases, giving indications of performance on simple and more complex tasks, respectively. Both enhancement schemes proposed are able to improve substantially on baseline results. The technique of forming ML estimates of the noise parameters is found to be the most effective. Its performance is evaluated over a wide range of noise conditions ranging front -6 to 18 dB and on various types of stationary real-world noises. (C) 2001 Elsevier Science B.V. All rights reserved.

引用

页码：351 / 368

页数：18

共 50 条

[1] INDIRECT MODEL-BASED SPEECH ENHANCEMENT
Le Roux, Jonathan
Hershey, John R.
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4045 - 4048
[2] Adaptive Model-Based Mammogram Enhancement
Haindl, Michal
Remes, Vaclav
[J]. 10TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY AND INTERNET-BASED SYSTEMS SITIS 2014, 2014, : 65 - 72
[3] NOISE IDENTIFICATION FOR MODEL-BASED SPEECH ENHANCEMENT
Jiang Wenbin
Ying Rendong
Liu Peilin
[J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 478 - 483
[4] ON THE INFLUENCE OF INHARMONICITIES IN MODEL-BASED SPEECH ENHANCEMENT
Norholm, Sidsel Marie
Jensen, Jesper Rindom
Christensen, Mads Graesboll
[J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[5] Model-Based Speech Enhancement in the Modulation Domain
Wang, Yu
Brookes, Mike
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (03) : 580 - 594
[6] Model-based eigenspectrum estimation for speech enhancement
Bhunjun, Vinesh
Brookes, Mike
Naylor, Patrick
[J]. 2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 1331 - +
[7] Model-Based Speech Enhancement for Automotive Applications
Krini, Mohamed
Schmidt, Gerhard
[J]. 2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 638 - 643
[8] A Model-Based Soft Decision Approach for Speech Enhancement
Xianyun Wang
Changchun Bao
Feng Bao
[J]. China Communications, 2017, 14 (09) : 11 - 22
[9] Model-Based Feature Enhancement for Reverberant Speech Recognition
Krueger, Alexander
Haeb-Umbach, Reinhold
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
[10] Spectral difference for statistical model-based speech enhancement in speech recognition
Soojeong Lee
Joon-Hyuk Chang
[J]. Multimedia Tools and Applications, 2017, 76 : 24917 - 24929

← 1 2 3 4 5 →