SUPERVISED SPEECH DEREVERBERATION IN NOISY ENVIRONMENTS USING EXEMPLAR-BASED SPARSE REPRESENTATIONS

被引：0

作者：

Baby, Deepak ^{[1
]}

Van Hamme, Hugo ^{[1
]}

机构：

[1] Katholieke Univ Leuven, Dept ESAT, Leuven, Belgium

来源：

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年

关键词：

speech dereverberation; non-negative matrix deconvolution; non-negative matrix factorisation; SEPARATION; MIXTURES;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Exemplar-based techniques, where the noisy speech is decomposed as a linear combination of the speech and noise exemplars stored in a dictionary, have been successfully used for speech enhancement in noisy environments. This paper extends this technique to achieve speech dereverberation in noisy environments by means of a non-negative approximation of the noisy reverberant speech in the frequency domain. A novel approach for estimating the room impulse response (RIR) together with the speech and noise estimates using a non-negative matrix deconvolution (NMD) -based technique is proposed. In addition, we extend an existing technique based on non-negative matrix factorisation (NMF) that performs speech dereverberation in noise-free environments to noisy scenarios. New estimators for jointly obtaining the RIR and exemplar weights for the NMD and NMF -based formulations are presented. The proposed techniques are evaluated on the noise-free and noisy reverberant speech in the CHiME-2 WSJ0 database and are shown to yield better speech enhancement in terms of signal-to-distortion ratio (SDR), perceptual evaluation of speech quality (PESQ) and cepstral distance (CD) measures.

引用

页码：156 / 160

页数：5

共 50 条

[1] Exemplar-Based Voice Conversion Using Sparse Representation in Noisy Environments
Takashima, Ryoichi
Takiguchi, Tetsuya
Ariki, Yasuo
[J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (10) : 1946 - 1953
[2] Joint Denoising and Dereverberation Using Exemplar-Based Sparse Representations and Decaying Norm Constraint
Baby, Deepak
Van Hamme, Hugo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (10) : 2024 - 2035
[3] EMBEDDING TIME WARPING IN EXEMPLAR-BASED SPARSE REPRESENTATIONS OF SPEECH
Yilmaz, Emre
Gemmeke, Jort F.
Van Hamme, Hugo
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8076 - 8080
[4] Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition
Gemmeke, Jort F.
Virtanen, Tuomas
Hurmalainen, Antti
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (07): : 2067 - 2080
[5] Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech
Reddy, Mittapalle Kiran
Alku, Paavo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1386 - 1396
[6] Multimodal Exemplar-based Voice Conversion using Lip Features in Noisy Environments
Masaka, Kenta
Aihara, Ryo
Takiguchi, Tetsuya
Ariki, Yasuo
[J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1159 - 1163
[7] ON EXEMPLAR-BASED EXEMPLAR REPRESENTATIONS - REPLY
NOSOFSKY, RM
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1988, 117 (04) : 412 - 414
[8] NOISE-ROBUST SPEECH RECOGNITION WITH EXEMPLAR-BASED SPARSE REPRESENTATIONS USING ALPHA-BETA DIVERGENCE
Yilmaz, Emre
Gemmeke, Jort F.
Van Hamme, Hugo
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[9] Reducing Computational Complexities of Exemplar-Based Sparse Representations With Applications to Large Vocabulary Speech Recognition
Sainath, Tara N.
Ramabhadran, Bhuvana
Nahamoo, David
Kanevsky, Dimitri
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 792 - 795
[10] Noise Robust Exemplar Matching Using Sparse Representations of Speech
Yilmaz, Emre
Gemmeke, Jort Florent
Van Hamme, Hugo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (08) : 1306 - 1319

← 1 2 3 4 5 →