DENSELY CONNECTED NETWORK WITH TIME-FREQUENCY DILATED CONVOLUTION FOR SPEECH ENHANCEMENT

被引:0
|
作者
Li, Yaxing [1 ]
Li, Xiaoqi [1 ]
Dong, Yuanjie [1 ]
Li, Meng [1 ]
Xu, Shan [1 ]
Xiong, Shengwu [1 ]
机构
[1] Wuhan Univ Technol, Sch Comp Sci & Technol, Wuhan, Hubei, Peoples R China
关键词
Dense connectivity; dilated convolution; speech enhancement;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The data driven speech enhancement approaches using regression-based deep neural network usually result in enormous number of model parameters, which increase the computational load and the difficulty of model training. In order to improve the model efficiency, we propose a densely connected network with time-frequency (T-F) dilated convolution for speech enhancement. The T-F dilated convolution block is designed to enlarge the receptive field and capture the contextual information in both temporal and frequency domains. Considering the computational efficiency, the 1-D convolution with the bottleneck structure is exploited in the T-F convolution block. Each T-F convolution block is then densely connected to ensure maximum information flow between layers and alleviate the vanishing gradient problem of the network. The experimental results reveal that the proposed scheme not only improves the computational efficiency significantly but also produces satisfactory enhancement performance comparing the competing methods.
引用
收藏
页码:6860 / 6864
页数:5
相关论文
共 50 条
  • [21] SAR image despeckling using a dilated densely connected network
    Gui, Yunchuan
    Xue, Lei
    Li, Xiuhe
    [J]. REMOTE SENSING LETTERS, 2018, 9 (09) : 857 - 866
  • [22] Joint Time-Frequency Segmentation Algorithm for Transient Speech Decomposition and Speech Enhancement
    Tantibundhit, Charturong
    Pernkopf, Franz
    Kubin, Gernot
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1417 - 1428
  • [23] Time-Frequency Mask-based Speech Enhancement using Convolutional Generative Adversarial Network
    Shah, Neil
    Patil, Hemant A.
    Soni, Meet H.
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1246 - 1251
  • [24] Speech endpoint detection based on speech time-frequency enhancement and spectral entropy
    Fan Yingle
    Li Yi
    Wu Chuanyan
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4682 - 4684
  • [25] Fusion-Net: Time-Frequency Information Fusion Y-Network for Speech Enhancement
    Nareddula, Santhan Kumar Reddy
    Gorthi, Subrahmanyam
    Gorthi, Rama Krishna Sai S.
    [J]. INTERSPEECH 2021, 2021, : 3360 - 3364
  • [26] Densely-connected Convolutional Recurrent Network for Fundamental Frequency Estimation in Noisy Speech
    Zhang, Yixuan
    Wang, Heming
    Wang, DeLiang
    [J]. INTERSPEECH 2022, 2022, : 401 - 405
  • [27] Wavelet-Based Speech Enhancement Using Time-Frequency Adaptation
    Wang, Kun-Ching
    [J]. EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2009,
  • [28] A time-frequency fusion model for multi-channel speech enhancement
    Zeng, Xiao
    Xu, Shiyun
    Wang, Mingjiang
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2024, 2024 (01):
  • [29] Time-frequency masking based supervised speech enhancement framework using fuzzy deep belief network
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya K.
    [J]. APPLIED SOFT COMPUTING, 2019, 74 : 583 - 602
  • [30] Wavelet-Based Speech Enhancement Using Time-Frequency Adaptation
    Kun-Ching Wang
    [J]. EURASIP Journal on Advances in Signal Processing, 2009