A Speech Enhancement Method Based on Multi-Task Bayesian Compressive Sensing

被引：5

作者：

You, Hanxu ^{[1
]}

Ma, Zhixian ^{[1
]}

Li, Wei ^{[1
]}

Zhu, Jie ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2017年 / E100D卷 / 03期

基金：

中国国家自然科学基金;

关键词：

speech enhancement; compressive sensing; overcomplete dictionary; sparse representation; SPARSE;

D O I：

10.1587/transinf.2016EDP7350

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Traditional speech enhancement (SE) algorithms usually have fluctuant performance when they deal with different types of noisy speech signals. In this paper, we propose multi-task Bayesian compressive sensing based speech enhancement (MT-BCS-SE) algorithm to achieve not only comparable performance to but also more stable performance than traditional SE algorithms. MT-BCS-SE algorithm utilizes the dependence information among compressive sensing (CS) measurements and the sparsity of speech signals to perform SE. To obtain sufficient sparsity of speech signals, we adopt overcomplete dictionary to transform speech signals into sparse representations. K-SVD algorithm is employed to learn various overcomplete dictionaries. The influence of the overcomplete dictionary on MT-BCS-SE algorithm is evaluated through large numbers of experiments, so that the most suitable dictionary could be adopted by MT-BCS-SE algorithm for obtaining the best performance. Experiments were conducted on well-known NOIZEUS corpus to evaluate the performance of the proposed algorithm. In these cases of NOIZEUS corpus, MT-BCS-SE is shown that to be competitive or even superior to traditional SE algorithms, such as optimally-modified log-spectral amplitude (OMLSA), multi-band spectral subtraction (SSMul), and minimum mean square error (MMSE), in terms of signal-noise ratio (SNR), speech enhancement gain (SEG) and perceptual evaluation of speech quality (PESQ) and to have better stability than traditional SE algorithms.

引用

页码：556 / 563

页数：8

共 50 条

[31] Full-Vectorial 3D Microwave Imaging of Sparse Scatterers through a Multi-Task Bayesian Compressive Sensing Approach
Salucci, Marco
Poli, Lorenzo
Oliveri, Giacomo
JOURNAL OF IMAGING, 2019, 5 (01)
[32] A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
Park, Byeongseon
Yamamoto, Ryuichi
Tachibana, Kentaro
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, 2022-September : 1931 - 1935
[33] A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
Park, Byeongseon
Yamamoto, Ryuichi
Tachibana, Kentaro
INTERSPEECH 2022, 2022, : 1931 - 1935
[34] Continuous multi-task Bayesian Optimisation with correlation
Pearce, Michael
Branke, Juergen
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 270 (03) : 1074 - 1085
[35] Wideband Spectrum Sensing based on Collaborative Multi-Task Learning
Zhang, Weishan
Wang, Yue
Yu, Fuxun
Qin, Zhuwei
Chen, Xiang
Tian, Zhi
2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 693 - 698
[36] Multi-Task Joint Learning for Embedding Aware Audio-Visual Speech Enhancement
Wang, Chenxi
Chen, Hang
Du, Jun
Yin, Baocai
Pan, Jia
2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 255 - 259
[37] Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
Wan, Runzhe
Ge, Lin
Song, Rui
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[38] Improvement of Wiener Filter based Speech Enhancement using Compressive Sensing
Sulong, Amart
Kadir, Kushsairy
Gunawan, Teddy S.
Khalifa, Othman O.
2014 IEEE INTERNATIONAL CONFERENCE ON SMART INSTRUMENTATION, MEASUREMENT AND APPLICATIONS (ICSIMA), 2014,
[39] Speech Emotion Recognition with Multi-task Learning
Cai, Xingyu
Yuan, Jiahong
Zheng, Renjie
Huang, Liang
Church, Kenneth
INTERSPEECH 2021, 2021, : 4508 - 4512
[40] TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS
Indurthi, Sathish
Zaidi, Mohd Abbas
Lakumarapu, Nikhil Kumar
Lee, Beomseok
Han, Hyojung
Ahn, Seokchan
Kim, Sangha
Kim, Chanwoo
Hwang, Inchul
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7723 - 7727

← 1 2 3 4 5 →