Denoising Speech Based on Deep Learning and Wavelet Decomposition

被引:9
|
作者
Wang, Li [1 ]
Zheng, Weiguang [2 ]
Ma, Xiaojun [3 ]
Lin, Shiming [4 ,5 ]
机构
[1] Hubei Univ Arts & Sci, Coll Chinese Literature & Media, Xiangyang 441000, Peoples R China
[2] Guilin Univ Elect Technol, Sch Mech & Elect Engn, Guilin 541004, Peoples R China
[3] Qinghai GLI Technol Ltd, Xining 810001, Peoples R China
[4] Xiamen Univ, Sch Informat, Natl Demonstrat Software Sch, Xiamen 361005, Peoples R China
[5] Changji Univ, Dept Comp Engn, Changji 831100, Peoples R China
关键词
ENHANCEMENT; NOISE;
D O I
10.1155/2021/8677043
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The work proposed a denoising speech method using deep learning. The predictor and target network signals were the amplitude spectra of the wavelet-decomposition vectors of the noisy audio signal and clean audio signal, respectively. The output of the network was the amplitude spectrum of the denoised signal. Besides, the regression network used the input of the predictor to minimize the mean square error between its output and input targets. The denoised wavelet-decomposition vector was transformed back to the time domain by the output amplitude spectrum and the phase of the wavelet-decomposition vector. Then, the denoised speech was obtained by the inverse wavelet transform. This method overcame the problem that the frequency and time resolution of the short-time Fourier transform could not be adjusted. The noise reduction effect in each frequency band was improved due to the gradual reduction of the noise energy in the wavelet-decomposition process. The experimental results showed that the method has a good denoising effect in the whole frequency band.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Fringe pattern denoising based on deep learning
    Yan, Ketao
    Yu, Yingjie
    Huang, Chongtian
    Sui, Liansheng
    Qian, Kemao
    Asundi, Anand
    [J]. OPTICS COMMUNICATIONS, 2019, 437 : 148 - 152
  • [42] Overview of Image Denoising Based on Deep Learning
    Liu, Baozhong
    Liu, Jianbin
    [J]. 2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
  • [43] EEG decomposition and denoising using wavelet transform
    Zhou, WD
    Hao, XW
    [J]. IEEE-EMBS ASIA PACIFIC CONFERENCE ON BIOMEDICAL ENGINEERING - PROCEEDINGS, PTS 1 & 2, 2000, : 638 - 639
  • [44] Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement
    Zezario, Ryandhimas E.
    Huang, Jen-Wei
    Lu, Xugang
    Tsao, Yu
    Hwang, Hsin-Te
    Wang, Hsin-Min
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 373 - 377
  • [45] Multiscale collaborative speech denoising based on deep stacking network
    Jiang, Wei
    Zheng, Hao
    Nie, Shuai
    Liu, Wenju
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [46] Unsupervised Speech Denoising Method based on Deep Neural Network
    Chen, Xiaohan
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 254 - 258
  • [47] Deep Learning-Based Empirical and Sub-Space Decomposition for Speech Enhancement
    Mraihi, Khaoula
    Ben Messaoud, Mohamed Anouar
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (06) : 3596 - 3626
  • [48] Can Denoising Enhance Prediction Accuracy of Learning Models? A Case of Wavelet Decomposition Approach
    Tamilselvi, C.
    Yeasin, Md
    Paul, Ranjit Kumar
    Paul, Amrit Kumar
    [J]. FORECASTING, 2024, 6 (01): : 81 - 99
  • [49] Deep learning for denoising
    Yu, Siwei
    Ma, Jianwei
    Wang, Wenlong
    [J]. GEOPHYSICS, 2019, 84 (06) : V333 - V350
  • [50] A modified speech denoising algorithm based on the continuous wavelet transformer and wiener filter
    Saoud, S.
    Bennasr, M.
    Cherif, A.
    [J]. INNOVATIVE AND INTELLIGENT TECHNOLOGY-BASED SERVICES FOR SMART ENVIRONMENTS-SMART SENSING AND ARTIFICIAL INTELLIGENCE, 2021, : 119 - 126