DNN-Based Speech Enhancement via Integrating NMF and CASA

被引:0
|
作者
Yan, Bofang [1 ]
Bao, Changchun [1 ]
Bai, Zhigang [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep neural network; nonnegative matrix factorization; computational auditory scene analysis; Wiener filter; speech enhancement; MONAURAL SPEECH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we come up with a novel speech enhancement method, which integrates nonnegative matrix factorization (NMF) and computational auditory scene analysis (CASA) using deep neural network (DNN). Firstly, we can obtain the basis matrices of speech and noise respectively via NMF and get the ideal ratio mask (IRM) that is based on CASA by using deep neural network. Then, a linear minimum mean square error (LMMSE) filter in fast Fourier transform (FFT) domain is constructed and transformed to the Gammatone domain. Finally, an integrated Wiener-like filter is obtained by combining the filter of NMF with the mask of CASA. By comparing with NMF and CASA methods, the experiments present the superiority of the proposed method.
引用
收藏
页码:435 / 439
页数:5
相关论文
共 50 条
  • [1] A NEW COST FUNCTION FOR DNN-BASED SPEECH ENHANCEMENT COMBINING NMF AND CASA
    Yan, Bofang
    Bao, Changchun
    Bai, Zhigang
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 255 - 259
  • [2] Boosting DNN-Based Speech Enhancement via Explicit Transformations
    Wang, Qing
    Du, Jun
    Dai, Li-Rong
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [3] DNN-BASED ENHANCEMENT OF NOISY AND REVERBERANT SPEECH
    Zhao, Yan
    Wang, DeLiang
    Merks, Ivo
    Zhang, Tao
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6525 - 6529
  • [4] DNN-BASED SPEECH ENHANCEMENT USING MBE MODEL
    Huang, Qizheng
    Bao, Changchun
    Wang, Xianyun
    Xiang, Yang
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 196 - 200
  • [5] DNN-Based Cepstral Excitation Manipulation for Speech Enhancement
    Elshamy, Samy
    Fingscheidt, Tim
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1803 - 1814
  • [6] DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation
    Feng, Xinyang
    Li, Nuo
    He, Zunwen
    Zhang, Yan
    Zhang, Wancheng
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 541 - 545
  • [7] DNN-Based Calibrated-Filter Models for Speech Enhancement
    Yazid Attabi
    Benoit Champagne
    Wei-Ping Zhu
    [J]. Circuits, Systems, and Signal Processing, 2021, 40 : 2926 - 2949
  • [8] DNN-BASED AR-WIENER FILTERING FOR SPEECH ENHANCEMENT
    Yang, Yan
    Bao, Changchun
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2901 - 2905
  • [9] DNN-Based Calibrated-Filter Models for Speech Enhancement
    Attabi, Yazid
    Champagne, Benoit
    Zhu, Wei-Ping
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (06) : 2926 - 2949
  • [10] A Survey on Low-Latency DNN-Based Speech Enhancement
    Drgas, Szymon
    [J]. SENSORS, 2023, 23 (03)