COUPLED DICTIONARY TRAINING FOR EXEMPLAR-BASED SPEECH ENHANCEMENT

被引：0

作者：

Baby, Deepak ^{[1
]}

Virtanen, Tuomas ^{[2
]}

Barker, Tom ^{[2
]}

Van Hamme, Hugo ^{[1
]}

机构：

[1] Katholieke Univ Leuven, Dept ESAT, Louvain, Belgium

[2] Tampere Univ Technol, Dept Signal Proc, Tampere, Finland

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Non-negative matrix factorisation; coupled dictionary training; speech enhancement; modulation envelope; SOURCE SEPARATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In exemplar-based speech enhancement systems, lower dimensional features are preferred over the full-scale DFT features for their reduced computational complexity and the ability to better generalize for the unseen cases. But in order to obtain the Wiener-like filter for noisy DFT enhancement, the speech and noise estimates obtained in the feature space need to be mapped to the DFT space, which yield a low-rank approximation of the estimates resulting in a sub-optimal filter. This paper proposes a novel method using coupled dictionaries where the exemplars for the required feature space and the DFT space are jointly extracted and the estimates are directly obtained in the DFT space following the decomposition in the chosen feature space. Simulation experiments revealed that the proposed approach, where the activations of exemplars calculated using the Mel resolution are directly used to obtain the Wiener filter in the DFT space, results in improved signal-to-distortion ratio (SDR) when compared to the system without coupled dictionaries. To further motivate the use of coupled dictionaries, the paper also investigates the use of modulation envelope features for the exemplar-based speech enhancement.

引用

页数：5

共 50 条

[1] Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition
Baby, Deepak
Virtanen, Tuomas
Gemmeke, Jort F.
van Hamme, Hugo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1788 - 1799
[2] SPEECH SEGMENT CLUSTERING FOR REAL-TIME EXEMPLAR-BASED SPEECH ENHANCEMENT
Nesbitt, David
Crookes, Danny
Ming, Ji
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5419 - 5423
[3] EXEMPLAR-BASED SPEECH ENHANCEMENT FOR DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION
Baby, Deepak
Gemmeke, Jort F.
Virtanen, Tuomas
Van hamme, Hugo
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4485 - 4489
[4] Exemplar-based speech waveform generation
Watts, Oliver
Valentini-Botinhao, Cassia
Espic, Felipe
King, Simon
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2022 - 2026
[5] Exemplar-Based Processing for Speech Recognition
Sainath, Tara N.
Ramabhadran, Bhuvana
Nahamoo, David
Kanevsky, Dimitri
Van Compernolle, Dirk
Demuynck, Kris
Gemmeke, Jort Florent
Bellegarda, Jerome R.
Sundaram, Shiva
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 98 - 113
[6] Exemplar-Based Emotive Speech Synthesis
Wu, Xixin
Cao, Yuewen
Lu, Hui
Liu, Songxiang
Kang, Shiyin
Wu, Zhiyong
Liu, Xunying
Meng, Helen
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 874 - 886
[7] Dictionary optimization and clustering for exemplar-based voice conversion
Sun, Wei
Yu, Yibiao
[J]. FIFTH INTERNATIONAL WORKSHOP ON PATTERN RECOGNITION, 2020, 11526
[8] SEMI-SUPERVISED NOISE DICTIONARY ADAPTATION FOR EXEMPLAR-BASED NOISE ROBUST SPEECH RECOGNITION
Luan, Yi
Saito, Daisuke
Kashiwagi, Yosuke
Minematsu, Nobuaki
Hirose, Keikichi
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[9] ROBUST INTERNAL EXEMPLAR-BASED IMAGE ENHANCEMENT
Xian, Yang
Tian, Yingli
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2379 - 2383
[10] Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition
Kallasjoki, Heikki
Gemmeke, Jort F.
Palomaki, Kalle J.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 368 - 380

← 1 2 3 4 5 →