A wavelet network-based speech enhancement system using noisy-as-clean strategy

被引:0
|
作者
Hajiaghababa, Fatemeh [1 ]
Abutalebi, Hamid Reza [1 ]
机构
[1] Yazd Univ, Elect Engn Dept, Yazd, Iran
关键词
Speech enhancement; wavelet network; noisy-as-clean; noisy target training; NEURAL-NETWORKS;
D O I
10.1142/S0219691323500339
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, the field of speech enhancement has greatly benefited from the rapid development of neural networks. However, the requirement for large amounts of noisy and clean speech pairs for training limits the widespread use of these models. Wavelet network-based speech enhancement typically relies on clean speech signals as a training target. This paper presents a new method that combines a neural network with the wavelet theory for speech enhancement without the need for clean speech signals as targets in training mode. Five wide evaluation criteria, namely short-time objective intelligibility (STOI), signal-to-noise ratio (SNR), segmental signal-to-noise ratio (SNRseg), weighted spectral slope (WSS) and logarithmic spectral distance (LSD), have been used to confirm the effectiveness of the proposed method. The results show that the proposed method performs similar to a wavelet neural network (WNN) trained with clean signals, or even superior to those obtained from the clean target-based strategies.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech
    Chen, Li-Wei
    Cheng, Yao-Fei
    Lee, Hung-Shin
    Tsao, Yu
    Wang, Hsin-Min
    INTERSPEECH 2023, 2023, : 2473 - 2477
  • [2] Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
    Fujimura, Takuya
    Koizumi, Yuma
    Yatabe, Kohei
    Miyazaki, Ryoichi
    29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 436 - 440
  • [3] Speech Enhancement using Convolution Neural Network-based Spectrogram Denoising
    Hu Xuhong
    Yan Lin-Huang
    Lu Xun
    Guan Yuan-Sheng
    Hu Wenlin
    Wang Jie
    PROCEEDINGS OF 2021 7TH INTERNATIONAL CONFERENCE ON CONDITION MONITORING OF MACHINERY IN NON-STATIONARY OPERATIONS (CMMNO), 2021, : 310 - 318
  • [4] Enhancement of Coded Speech Using Neural Network-Based Side Information
    Hwang, Soojoong
    Cheon, Youngju
    Han, Sangwook
    Jang, Inseon
    Shin, Jong Won
    IEEE ACCESS, 2021, 9 : 121532 - 121540
  • [5] Single Channel Speech Enhancement System using Convolutional Neural Network based Autoencoder for Noisy Environments
    Buragohain, Rantu
    Ashishkumar, Gudmalwar
    Rao, Ch V. Rama
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [6] Enhancement of speech signals in a noisy environment based on wavelet based adaptive filtering
    Dept. of Electronics and Communication Engineering, Govt. College of Engineering, Tirunelveli, India
    Int. J. Signal Process. Image Process. Pattern Recogn., 9 (69-76): : 69 - 76
  • [7] Silence and speech segmentation for noisy speech using a wavelet based algorithm
    Mei, X., 2001, Chinese Institute of Electronics (10):
  • [8] Silence and speech segmentation for noisy speech using a wavelet based algorithm
    Mei, XD
    Sun, SH
    CHINESE JOURNAL OF ELECTRONICS, 2001, 10 (04): : 439 - 443
  • [9] Deep Convolutional Neural Network-based Speech Signal Enhancement Using Extensive Speech Features
    Garg, Anil
    Sahu, O. P.
    INTERNATIONAL JOURNAL OF COMPUTATIONAL METHODS, 2022, 19 (08)
  • [10] Integrating Uncertainty Into Neural Network-Based Speech Enhancement
    Fang, Huajian
    Becker, Dennis
    Wermter, Stefan
    Gerkmann, Timo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 1587 - 1600