Denoising Speech Based on Deep Learning and Wavelet Decomposition

被引：9

作者：

Wang, Li ^{[1
]}

Zheng, Weiguang ^{[2
]}

Ma, Xiaojun ^{[3
]}

Lin, Shiming ^{[4
,5
]}

机构：

[1] Hubei Univ Arts & Sci, Coll Chinese Literature & Media, Xiangyang 441000, Peoples R China

[2] Guilin Univ Elect Technol, Sch Mech & Elect Engn, Guilin 541004, Peoples R China

[3] Qinghai GLI Technol Ltd, Xining 810001, Peoples R China

[4] Xiamen Univ, Sch Informat, Natl Demonstrat Software Sch, Xiamen 361005, Peoples R China

[5] Changji Univ, Dept Comp Engn, Changji 831100, Peoples R China

来源：

SCIENTIFIC PROGRAMMING | 2021年 / 2021卷

关键词：

ENHANCEMENT; NOISE;

D O I：

10.1155/2021/8677043

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

The work proposed a denoising speech method using deep learning. The predictor and target network signals were the amplitude spectra of the wavelet-decomposition vectors of the noisy audio signal and clean audio signal, respectively. The output of the network was the amplitude spectrum of the denoised signal. Besides, the regression network used the input of the predictor to minimize the mean square error between its output and input targets. The denoised wavelet-decomposition vector was transformed back to the time domain by the output amplitude spectrum and the phase of the wavelet-decomposition vector. Then, the denoised speech was obtained by the inverse wavelet transform. This method overcame the problem that the frequency and time resolution of the short-time Fourier transform could not be adjusted. The noise reduction effect in each frequency band was improved due to the gradual reduction of the noise energy in the wavelet-decomposition process. The experimental results showed that the method has a good denoising effect in the whole frequency band.

引用

页数：10

共 50 条

[41] Fringe pattern denoising based on deep learning
Yan, Ketao
Yu, Yingjie
Huang, Chongtian
Sui, Liansheng
Qian, Kemao
Asundi, Anand
[J]. OPTICS COMMUNICATIONS, 2019, 437 : 148 - 152
[42] Overview of Image Denoising Based on Deep Learning
Liu, Baozhong
Liu, Jianbin
[J]. 2018 INTERNATIONAL SEMINAR ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (SCSET 2018), 2019, 1176
[43] EEG decomposition and denoising using wavelet transform
Zhou, WD
Hao, XW
[J]. IEEE-EMBS ASIA PACIFIC CONFERENCE ON BIOMEDICAL ENGINEERING - PROCEEDINGS, PTS 1 & 2, 2000, : 638 - 639
[44] Deep Denoising Autoencoder Based Post Filtering for Speech Enhancement
Zezario, Ryandhimas E.
Huang, Jen-Wei
Lu, Xugang
Tsao, Yu
Hwang, Hsin-Te
Wang, Hsin-Min
[J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 373 - 377
[45] Multiscale collaborative speech denoising based on deep stacking network
Jiang, Wei
Zheng, Hao
Nie, Shuai
Liu, Wenju
[J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
[46] Unsupervised Speech Denoising Method based on Deep Neural Network
Chen, Xiaohan
[J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2018, : 254 - 258
[47] Deep Learning-Based Empirical and Sub-Space Decomposition for Speech Enhancement
Mraihi, Khaoula
Ben Messaoud, Mohamed Anouar
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (06) : 3596 - 3626
[48] Can Denoising Enhance Prediction Accuracy of Learning Models? A Case of Wavelet Decomposition Approach
Tamilselvi, C.
Yeasin, Md
Paul, Ranjit Kumar
Paul, Amrit Kumar
[J]. FORECASTING, 2024, 6 (01): : 81 - 99
[49] Deep learning for denoising
Yu, Siwei
Ma, Jianwei
Wang, Wenlong
[J]. GEOPHYSICS, 2019, 84 (06) : V333 - V350
[50] A modified speech denoising algorithm based on the continuous wavelet transformer and wiener filter
Saoud, S.
Bennasr, M.
Cherif, A.
[J]. INNOVATIVE AND INTELLIGENT TECHNOLOGY-BASED SERVICES FOR SMART ENVIRONMENTS-SMART SENSING AND ARTIFICIAL INTELLIGENCE, 2021, : 119 - 126

← 1 2 3 4 5 →