SPEECH SEGMENT CLUSTERING FOR REAL-TIME EXEMPLAR-BASED SPEECH ENHANCEMENT

被引：0

作者：

Nesbitt, David ^{[1
]}

Crookes, Danny ^{[1
]}

Ming, Ji ^{[1
]}

机构：

[1] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT7 1NN, Antrim, North Ireland

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2018年

关键词：

speech enhancement; exemplar-based; real-time; embedded; clustering; ALGORITHM; NOISE;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Exemplar-based (or Corpus-based) speech enhancement algorithms have great potential but are typically slow due to needing to search through the entire corpus. The properties of speech can be exploited to improve these algorithms. Firstly, a corpus can be clustered by a phonetic ordering into a search tree which can be used to find a best matching segment. This dramatically reduces the search space, reducing the time complexity of searching a corpus of n segments from O(n) to O(log(n)) . Secondly, clustering can be used to give a lossy compression of a speech corpus by replacing original segments with codewords. These techniques are shown in comparison with sequential search and non-compressed corpora using a simple speech enhancement algorithm. A combination of these techniques for a corpus of a quarter of WSJO results in a speedup of approximately 3000x.

引用

页码：5419 / 5423

页数：5

共 50 条

[1] Coupled Dictionaries for Exemplar-Based Speech Enhancement and Automatic Speech Recognition
Baby, Deepak
Virtanen, Tuomas
Gemmeke, Jort F.
van Hamme, Hugo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (11) : 1788 - 1799
[2] COUPLED DICTIONARY TRAINING FOR EXEMPLAR-BASED SPEECH ENHANCEMENT
Baby, Deepak
Virtanen, Tuomas
Barker, Tom
Van Hamme, Hugo
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[3] EXEMPLAR-BASED SPEECH ENHANCEMENT FOR DEEP NEURAL NETWORK BASED AUTOMATIC SPEECH RECOGNITION
Baby, Deepak
Gemmeke, Jort F.
Virtanen, Tuomas
Van hamme, Hugo
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4485 - 4489
[4] Exemplar-based speech waveform generation
Watts, Oliver
Valentini-Botinhao, Cassia
Espic, Felipe
King, Simon
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2022 - 2026
[5] Exemplar-Based Processing for Speech Recognition
Sainath, Tara N.
Ramabhadran, Bhuvana
Nahamoo, David
Kanevsky, Dimitri
Van Compernolle, Dirk
Demuynck, Kris
Gemmeke, Jort Florent
Bellegarda, Jerome R.
Sundaram, Shiva
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 98 - 113
[6] Exemplar-Based Emotive Speech Synthesis
Wu, Xixin
Cao, Yuewen
Lu, Hui
Liu, Songxiang
Kang, Shiyin
Wu, Zhiyong
Liu, Xunying
Meng, Helen
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 874 - 886
[7] EMBEDDING TIME WARPING IN EXEMPLAR-BASED SPARSE REPRESENTATIONS OF SPEECH
Yilmaz, Emre
Gemmeke, Jort F.
Van Hamme, Hugo
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 8076 - 8080
[8] Real-Time Exemplar-Based Face Sketch Synthesis
Song, Yibing
Bao, Linchao
Yang, Qingxiong
Yang, Ming-Hsuan
[J]. COMPUTER VISION - ECCV 2014, PT VI, 2014, 8694 : 800 - 813
[9] Enhancing Exemplar-Based Posteriors for Speech Recognition Tasks
Sainath, Tara N.
Nahamoo, David
Kanevsky, Dimitri
Ramabhadran, Bhuvana
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2127 - 2130
[10] Estimating Uncertainty to Improve Exemplar-Based Feature Enhancement for Noise Robust Speech Recognition
Kallasjoki, Heikki
Gemmeke, Jort F.
Palomaki, Kalle J.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 368 - 380

← 1 2 3 4 5 →