DNN-Based Low-Musical-Noise Single-Channel Speech Enhancement Based on Higher-Order-Moments Matching

被引:0
|
作者
Mizoguchi, Satoshi [1 ]
Saito, Yuki [1 ]
Takamichi, Shinnosuke [1 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Univ Tokyo, Tokyo 1138656, Japan
关键词
speech enhancement; musical noise; kurtosis; moment matching; deep learning;
D O I
10.1587/transinf.2021EDP7041
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose deep neural network (DNN)-based speech enhancement that reduces musical noise and achieves better auditory impressions. The musical noise is an artifact generated by nonlinear signal processing and negatively affects the auditory impressions. We aim to develop musical-noise-free speech enhancement methods that suppress the musical noise generation and produce perceptually-comfortable enhanced speech. DNN-based speech enhancement using a soft mask achieves high noise reduction but generates musical noise in non-speech regions. Therefore, first, we define kurtosis matching for DNN-based low-musical-noise speech enhancement. Kurtosis is the fourth-order moment and is known to correlate with the amount of musical noise. The kurtosis matching is a penalty term of the DNN training and works to reduce the amount of musical noise. We further extend this scheme to standardized-moment matching. The extended scheme involves using moments whose orders are higher than kurtosis and generalizes the conventional musical-noise-free method based on kurtosis matching. We formulate standardized-moment matching and explore how effectively the higher-order moments reduce the amount of musical noise. Experimental evaluation results 1) demonstrate that kurtosis matching can reduce musical noise without negatively affecting noise suppression and 2) newly reveal that the sixth-moment matching also achieves low-musical-noise speech enhancement as well as kurtosis matching.
引用
收藏
页码:1971 / 1980
页数:10
相关论文
共 50 条
  • [1] Dual-channel DNN-based Speech Enhancement for Smartphones
    Martin-Donas, Juan M.
    Gomez, Angel M.
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    [J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [2] A Survey on Low-Latency DNN-Based Speech Enhancement
    Drgas, Szymon
    [J]. SENSORS, 2023, 23 (03)
  • [3] An Adaptation Method in Noise Mismatch Conditions for DNN-based Speech Enhancement
    Xu Si-Ying
    Niu Tong
    Qu Dan
    Long Xing-Yan
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (10): : 4930 - 4951
  • [4] A Multi-Task Scheme for Supervised DNN-Based Single-Channel Speech Enhancement by Using Speech Presence Probability as the Secondary Training Target
    Wang, Lei
    Zhu, Jie
    Sun, Kangbo
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (11) : 1963 - 1970
  • [5] DNN TRAINING BASED ON CLASSIC GAIN FUNCTION FOR SINGLE-CHANNEL SPEECH ENHANCEMENT AND RECOGNITION
    Tu, Yan-Hui
    Du, Jun
    Lee, Chin-Hui
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 910 - 914
  • [6] DNN BASED MULTIFRAME SINGLE-CHANNEL NOISE REDUCTION FILTERS
    Pan, Ningning
    Chen, Jingdong
    Benesty, Jacob
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8782 - 8786
  • [7] NOISE ROBUST EXEMPLAR MATCHING WITH COUPLED DICTIONARIES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Yilmaz, Emre
    Baby, Deepak
    Van Hamme, Hugo
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 874 - 878
  • [8] DNN-Based Speech Enhancement Using Soft Audible Noise Masking for Wind Noise Reduction
    Haichuan Bai
    Fengpei Ge
    Yonghong Yan
    [J]. China Communications, 2018, 15 (09) : 235 - 243
  • [9] DNN-Based Speech Enhancement Using Soft Audible Noise Masking for Wind Noise Reduction
    Bai, Haichuan
    Ge, Fengpei
    Yan, Yonghong
    [J]. CHINA COMMUNICATIONS, 2018, 15 (09) : 235 - 243
  • [10] Concatenated Identical DNN (CI-DNN) to Reduce Noise-Type Dependence in DNN-Based Speech Enhancement
    Xu, Ziyi
    Strake, Maximilian
    Fingscheidt, Tim
    [J]. 2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,