Utilizing Neural Network and Critical Band Processing for Speech Enhancement

被引:0
|
作者
Yong, Pei Chee [1 ]
Chan, Kit Yan [1 ]
Nordholm, Sven [1 ]
机构
[1] Curtin Univ, Dept Elect & Comp Engn, Kent St, Bentley, WA 6102, Australia
关键词
NOISE;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In speech enhancement, the optimal minimum mean square error (MMSE) short-time spectral amplitude estimator requires knowledge about the probability density functions of speech and noise in the short-time Fourier transform domain, for every signal-to-noise-ratio (SNR). However, both of these quantities are unknown and are usually non-stationary in real-world scenario. To tackle this problem, this paper proposes a speech enhancement approach based on a set of Neural Networks of which each Neural Network is developed particularly for a critical band and a predefined SNR. In this speech enhancement approach, the Neural Network simulates a set of gain functions which attempts to match human hearing and optimises a particular SNR. The Neural Networks are trained using one speech signal contaminated with pink noise. The trained Neural Networks are evaluated using a test set consisting of 28 noisy speech signals. The speech enhancement results are compared to a state of the art MMSE based speech enhancement technique in terms of four speech quality metrics namely noise reduction ratio (NRR), intelligibility frequency weighted segmental SNR (IFWSNRseg), perceptual evaluation of speech quality (PESO) and short-time objective intelligibility (STOI). Through the evaluation, the effectiveness of the Neural Networks can be observed.
引用
收藏
页码:1300 / 1303
页数:4
相关论文
共 50 条
  • [1] The Application of Deep Neural Network in Speech Enhancement Processing
    Chen Jian-ming
    Liang Zhi-cheng
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, : 1263 - 1266
  • [2] A FULLY CONVOLUTIONAL NEURAL NETWORK FOR COMPLEX SPECTROGRAM PROCESSING IN SPEECH ENHANCEMENT
    Ouyangi, Zhiheng
    Yu, Hongjiang
    Zhu, Wei-Ping
    Champagne, Benoit
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5756 - 5760
  • [3] Neural network filters for speech enhancement
    Swiss Federal Inst of Technology, Zurich, Switzerland
    [J]. IEEE Trans Speech Audio Process, 6 (433-438):
  • [4] Single Channel Speech Enhancement Utilizing Iterative Processing of Multi-Band Spectral Subtraction Algorithm
    Upadhyay, Navneet
    Karmakar, Abhijit
    [J]. 2012 2ND INTERNATIONAL CONFERENCE ON POWER, CONTROL AND EMBEDDED SYSTEMS (ICPCES 2012), 2012,
  • [5] A Fully Convolutional Neural Network for Speech Enhancement
    Park, Se Rim
    Lee, Jin Won
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1993 - 1997
  • [6] An optimized convolutional neural network for speech enhancement
    Karthik A.
    Mazher Iqbal J.L.
    [J]. International Journal of Speech Technology, 2023, 26 (04) : 1117 - 1129
  • [7] RESIDUAL RECURRENT NEURAL NETWORK FOR SPEECH ENHANCEMENT
    Abdulbaqi, Jalal
    Gu, Yue
    Chen, Shuhong
    Marsic, Ivan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 6659 - 6663
  • [8] NEURAL-NETWORK FILTERS FOR SPEECH ENHANCEMENT
    KNECHT, WG
    SCHENKEL, ME
    MOSCHYTZ, GS
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (06): : 433 - 438
  • [9] Speech Enhancement Using Wavelet Neural Network with Sub-Band Adaptive Matched Filter
    Yang, Dan
    Xu, Bin
    Ye, Linlin
    Wang, Xu
    [J]. MECHATRONICS AND INFORMATION TECHNOLOGY, PTS 1 AND 2, 2012, 2-3 : 127 - 130
  • [10] RECURRENT CONVOLUTIONAL NEURAL NETWORK FOR SPEECH PROCESSING
    Zhao, Yue
    Jin, Xingyu
    Hu, Xiaolin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5300 - 5304