Plastic multi-resolution auditory model based neural network for speech enhancement

被引:0
|
作者
Lai, Chen-Yen [1 ]
Lo, Yu-Wen [1 ]
Shen, Yih-Liang [1 ]
Chi, Tai-Shih [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 300, Taiwan
关键词
MODULATION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a plastic auditory model based neural network for speech enhancement. The proposed system integrates a spectro-temporal analytical auditory model with a multi-layer fully-connected network to form a quasi-CNN structure. The initial kernels of the convolutional layer are derived from the neuro-physiological auditory model. To simulate the plasticity of cortical neurons for attentional hearing, the kernels are allowed to adjust themselves pertaining to the task at hand. For the application of speech enhancement, the Fourier spectrogram instead of the auditory spectrogram is used as input to the proposed neural network such that the cleaned speech signal can be well reconstructed. The proposed system performs comparably with standard DNN and CNN systems when plenty resources are available. Meanwhile, under the limited-resource condition, the proposed system outperforms standard systems in all test settings.
引用
收藏
页码:605 / 609
页数:5
相关论文
共 50 条
  • [1] Multi-resolution auditory cepstral coefficient and adaptive mask for speech enhancement with deep neural network
    Ruwei Li
    Xiaoyue Sun
    Yanan Liu
    Dengcai Yang
    Liang Dong
    [J]. EURASIP Journal on Advances in Signal Processing, 2019
  • [2] Multi-resolution auditory cepstral coefficient and adaptive mask for speech enhancement with deep neural network
    Li, Ruwei
    Sun, Xiaoyue
    Liu, Yanan
    Yang, Dengcai
    Dong, Liang
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2019, 2019 (1)
  • [3] A Multi-Resolution Approach to GAN-Based Speech Enhancement
    Kim, Hyung Yong
    Yoon, Ji Won
    Cheon, Sung Jun
    Kang, Woo Hyun
    Kim, Nam Soo
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (02): : 1 - 15
  • [4] Multi-Resolution Spectral Input for Convolutional Neural Network-Based Speech Recognition
    Toth, Laszlo
    [J]. 2017 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2017,
  • [5] ACOUSTIC MODELING OF SPEECH WAVEFORM BASED ON MULTI-RESOLUTION, NEURAL NETWORK SIGNAL PROCESSING
    Tueske, Zoltan
    Schlueter, Ralf
    Ney, Hermann
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4859 - 4863
  • [6] Speech signal enhancement based on adaptive multi-resolution form of SVD
    Lu Yanhong
    Qin Xiaohong
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 2, PROCEEDINGS, 2008, : 137 - 140
  • [7] Research for multi-resolution wavelet neural network
    Han, FQ
    Gao, YH
    Ma, L
    Li, YH
    Li, JP
    [J]. Wavelet Analysis and Active Media Technology Vols 1-3, 2005, : 1095 - 1100
  • [8] On the training of a multi-resolution CMAC neural network
    Menozzi, A
    Chow, MY
    [J]. IECON '97 - PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON INDUSTRIAL ELECTRONICS, CONTROL, AND INSTRUMENTATION, VOLS. 1-4, 1997, : 1130 - 1135
  • [9] On the training of a multi-resolution CMAC neural network
    Menozzi, A
    Chow, MY
    [J]. ISIE '97 - PROCEEDINGS OF THE IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, VOLS 1-3, 1997, : 1201 - 1205
  • [10] A multi-resolution envelope-power based model for speech intelligibility
    Jorgensen, Soren
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (01): : 436 - 446