A wavelet network-based speech enhancement system using noisy-as-clean strategy

被引:0
|
作者
Hajiaghababa, Fatemeh [1 ]
Abutalebi, Hamid Reza [1 ]
机构
[1] Yazd Univ, Elect Engn Dept, Yazd, Iran
关键词
Speech enhancement; wavelet network; noisy-as-clean; noisy target training; NEURAL-NETWORKS;
D O I
10.1142/S0219691323500339
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In recent years, the field of speech enhancement has greatly benefited from the rapid development of neural networks. However, the requirement for large amounts of noisy and clean speech pairs for training limits the widespread use of these models. Wavelet network-based speech enhancement typically relies on clean speech signals as a training target. This paper presents a new method that combines a neural network with the wavelet theory for speech enhancement without the need for clean speech signals as targets in training mode. Five wide evaluation criteria, namely short-time objective intelligibility (STOI), signal-to-noise ratio (SNR), segmental signal-to-noise ratio (SNRseg), weighted spectral slope (WSS) and logarithmic spectral distance (LSD), have been used to confirm the effectiveness of the proposed method. The results show that the proposed method performs similar to a wavelet neural network (WNN) trained with clean signals, or even superior to those obtained from the clean target-based strategies.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] A STUDY OF TRAINING TARGETS FOR DEEP NEURAL NETWORK-BASED SPEECH ENHANCEMENT USING NOISE PREDICTION
    Odelowo, Babafemi O.
    Anderson, David V.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5409 - 5413
  • [22] Coded Speech Enhancement Using Neural Network-Based Vector-Quantized Residual Features
    Cheon, Youngju
    Hwang, Soojoong
    Han, Sangwook
    Jang, Inseon
    Shin, Jong Won
    INTERSPEECH 2021, 2021, : 1664 - 1668
  • [23] Convolutional Neural Network-based Speech Enhancement for Cochlear Implant Recipients
    Mamun, Nursadul
    Khorram, Soheil
    Hansen, John H. L.
    INTERSPEECH 2019, 2019, : 4265 - 4269
  • [24] Deep Residual Network-Based Augmented Kalman Filter for Speech Enhancement
    Roy, Sujan Kumar
    Paliwal, Kuldip K.
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 667 - 673
  • [25] Wavelet fuzzy neural network-based predictive control system
    El-Rabaie, NM
    Awad, HA
    Mahmoud, TA
    MELECON 2004: PROCEEDINGS OF THE 12TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE, VOLS 1-3, 2004, : 307 - 310
  • [26] An improved wavelet-based speech enhancement by using speech signal features
    Ayat, Saeed
    Manzuri-Shalmani, M. T.
    Dianat, Roohollah
    COMPUTERS & ELECTRICAL ENGINEERING, 2006, 32 (06) : 411 - 425
  • [27] Wavelet based speech enhancement using a new thresholding algorithm
    Ayat, S
    Manzuri, MT
    Dianat, R
    PROCEEDINGS OF THE 2004 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2004, : 238 - 241
  • [28] Speech detection in the noisy environment using wavelet transform
    Kacur, J
    Frank, J
    Rozinaj, G
    PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 661 - 666
  • [29] INCORPORATING REAL-WORLD NOISY SPEECH IN NEURAL-NETWORK-BASED SPEECH ENHANCEMENT SYSTEMS
    Xia, Yangyang
    Xu, Buye
    Kumar, Anurag
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 564 - 570
  • [30] iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric Learning
    Li, Haoyu
    Fu, Szu-Wei
    Tsao, Yu
    Yamagishi, Junichi
    INTERSPEECH 2020, 2020, : 1336 - 1340