Research on Speech Enhancement Algorithm of Multiresolution Cochleagram Based on Skip Connection Deep Neural Network

被引:4
|
作者
Lan, Chaofeng [1 ]
Wang, YuQiao [1 ]
Zhang, Lei [2 ]
Liu, Chundong [1 ]
Lin, Xiaojia [1 ]
机构
[1] Harbin Univ Sci & Technol, Coll Measurement & Commun Engn, Harbin 150080, Peoples R China
[2] Beidahuang Ind Grp Gen Hosp, Harbin 150088, Peoples R China
关键词
MASK;
D O I
10.1155/2022/5208372
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The speech enhancement effect of traditional deep learning algorithms is not ideal under low signal-to-noise ratios (SNR). Skip connections-deep neural network (Skip-DNN) improves the traditional deep neural network (DNN) by adding skip connections between each layer of the neural network to solve the degradation problem of DNN. In this paper, the Multiresolution Cochleagram (MRCG) features in the gammachirp transform domain are denoised to obtain the improved MRCG (I-MRCG). The noise reduction method adopts the Minimum Mean-Square Error Short-Time Spectral Amplitude Estimator (MMSE-STSA) and takes I-MRCG as the input feature and Skip-DNN as the training network to improve the speech enhancement effect of the model. This paper also proposes an improved source-to-distortion ratio (SDR) loss function. When the loss function uses the improved SDR, it will improve the performance of Skip-DNN speech enhancement model. The experiments in this paper are performed on the Edinburgh dataset. When using I-MRCG as the input feature of Skip-DNN, the average perceptual evaluation of speech quality (PESQ) is 2.9137, and the average short-time objective intelligibility (STOI) is 0.8515. Compared with MRCG as Skip-DNN input features, the improvements are 0.91% and 0.71%, respectively. When the improved SDR is used as the loss function of the speech model, the average PESQ is 2.9699 and the average STOI is 0.8547. Compared with other loss functions, the improved SDR has a better enhancement effect when used as the loss function of the speech enhancement model.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] A novel skip connection mechanism based on channel-wise cross transformer for speech enhancement
    Jiang, Weiqi
    Sun, Chengli
    Chen, Feilong
    Leng, Yan
    Guo, Qiaosheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 34849 - 34866
  • [42] A novel skip connection mechanism based on channel-wise cross transformer for speech enhancement
    Weiqi Jiang
    Chengli Sun
    Feilong Chen
    Yan Leng
    Qiaosheng Guo
    Multimedia Tools and Applications, 2024, 83 : 34849 - 34866
  • [43] Speech Enhancement for Optical Laser Microphone With Deep Neural Network
    Cai, Chengkai
    Iwai, Kenta
    Nishiura, Takanobu
    Yamashita, Yoichi
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 449 - 454
  • [44] Speech Enhancement Algorithm Based on a Convolutional Neural Network Reconstruction of the Temporal Envelope of Speech in Noisy Environments
    Soleymanpour, Rahim
    Soleymanpour, Mohammad
    Brammer, Anthony J.
    Johnson, Michael T.
    Kim, Insoo
    IEEE ACCESS, 2023, 11 : 5328 - 5336
  • [45] Deep neural network-based linear predictive parameter estimations for speech enhancement
    Li, Yaxing
    Kang, Sangwon
    IET SIGNAL PROCESSING, 2017, 11 (04) : 469 - 476
  • [46] A Reduced Complexity MFCC-based Deep Neural Network Approach for Speech Enhancement
    Razani, Ryan
    Chung, Hanwook
    Attabi, Yazid
    Champagne, Benoit
    2017 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2017, : 331 - 336
  • [47] Cross-language Transfer Learning for Deep Neural Network Based Speech Enhancement
    Xu, Yong
    Du, Jun
    Dai, Li-Rong
    Lee, Chin-Hui
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 336 - +
  • [48] DESCINet: A hierarchical deep convolutional neural network with skip connection for long time series forecasting
    Silva, Andre Quintiliano Bezerra
    Goncalves, Wesley Nunes
    Matsubara, Edson Takashi
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [49] Skip connection information enhancement network for retinal vessel segmentation
    Liang, Jing
    Jiang, Yun
    Yan, Hao
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2024, 62 (10) : 3163 - 3178
  • [50] Dual channel neural network speech enhancement algorithm based on time frequency masking
    Jia, Hairong
    Mei, Shulin
    Zhang, Min
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (06): : 43 - 49