A Speech Enhancement Method Based on Multi-Task Bayesian Compressive Sensing

被引:5
|
作者
You, Hanxu [1 ]
Ma, Zhixian [1 ]
Li, Wei [1 ]
Zhu, Jie [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
speech enhancement; compressive sensing; overcomplete dictionary; sparse representation; SPARSE;
D O I
10.1587/transinf.2016EDP7350
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional speech enhancement (SE) algorithms usually have fluctuant performance when they deal with different types of noisy speech signals. In this paper, we propose multi-task Bayesian compressive sensing based speech enhancement (MT-BCS-SE) algorithm to achieve not only comparable performance to but also more stable performance than traditional SE algorithms. MT-BCS-SE algorithm utilizes the dependence information among compressive sensing (CS) measurements and the sparsity of speech signals to perform SE. To obtain sufficient sparsity of speech signals, we adopt overcomplete dictionary to transform speech signals into sparse representations. K-SVD algorithm is employed to learn various overcomplete dictionaries. The influence of the overcomplete dictionary on MT-BCS-SE algorithm is evaluated through large numbers of experiments, so that the most suitable dictionary could be adopted by MT-BCS-SE algorithm for obtaining the best performance. Experiments were conducted on well-known NOIZEUS corpus to evaluate the performance of the proposed algorithm. In these cases of NOIZEUS corpus, MT-BCS-SE is shown that to be competitive or even superior to traditional SE algorithms, such as optimally-modified log-spectral amplitude (OMLSA), multi-band spectral subtraction (SSMul), and minimum mean square error (MMSE), in terms of signal-noise ratio (SNR), speech enhancement gain (SEG) and perceptual evaluation of speech quality (PESQ) and to have better stability than traditional SE algorithms.
引用
收藏
页码:556 / 563
页数:8
相关论文
共 50 条
  • [31] Full-Vectorial 3D Microwave Imaging of Sparse Scatterers through a Multi-Task Bayesian Compressive Sensing Approach
    Salucci, Marco
    Poli, Lorenzo
    Oliveri, Giacomo
    JOURNAL OF IMAGING, 2019, 5 (01)
  • [32] A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
    Park, Byeongseon
    Yamamoto, Ryuichi
    Tachibana, Kentaro
    Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, 2022, 2022-September : 1931 - 1935
  • [33] A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
    Park, Byeongseon
    Yamamoto, Ryuichi
    Tachibana, Kentaro
    INTERSPEECH 2022, 2022, : 1931 - 1935
  • [34] Continuous multi-task Bayesian Optimisation with correlation
    Pearce, Michael
    Branke, Juergen
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2018, 270 (03) : 1074 - 1085
  • [35] Wideband Spectrum Sensing based on Collaborative Multi-Task Learning
    Zhang, Weishan
    Wang, Yue
    Yu, Fuxun
    Qin, Zhuwei
    Chen, Xiang
    Tian, Zhi
    2022 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2022, : 693 - 698
  • [36] Multi-Task Joint Learning for Embedding Aware Audio-Visual Speech Enhancement
    Wang, Chenxi
    Chen, Hang
    Du, Jun
    Yin, Baocai
    Pan, Jia
    2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 255 - 259
  • [37] Metadata-based Multi-Task Bandits with Bayesian Hierarchical Models
    Wan, Runzhe
    Ge, Lin
    Song, Rui
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [38] Improvement of Wiener Filter based Speech Enhancement using Compressive Sensing
    Sulong, Amart
    Kadir, Kushsairy
    Gunawan, Teddy S.
    Khalifa, Othman O.
    2014 IEEE INTERNATIONAL CONFERENCE ON SMART INSTRUMENTATION, MEASUREMENT AND APPLICATIONS (ICSIMA), 2014,
  • [39] Speech Emotion Recognition with Multi-task Learning
    Cai, Xingyu
    Yuan, Jiahong
    Zheng, Renjie
    Huang, Liang
    Church, Kenneth
    INTERSPEECH 2021, 2021, : 4508 - 4512
  • [40] TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS
    Indurthi, Sathish
    Zaidi, Mohd Abbas
    Lakumarapu, Nikhil Kumar
    Lee, Beomseok
    Han, Hyojung
    Ahn, Seokchan
    Kim, Sangha
    Kim, Chanwoo
    Hwang, Inchul
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7723 - 7727