Speech enhancement using sparse dictionary learning in wavelet packet transform domain

被引:11
|
作者
Mavaddaty, Samira [1 ]
Ahadi, Seyed Mohammad [1 ]
Seyedin, Sanaz [1 ]
机构
[1] Amirkabir Univ Technol, Elect Engn Dept, 424 Hafez Ave, Tehran, Iran
来源
关键词
Speech enhancement; Dictionary learning; Sparse representation; Domain adaptation; Voice activity detector; Wavelet packet transform; VOICE ACTIVITY DETECTION; LOW-RANK;
D O I
10.1016/j.csl.2017.01.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sparse coding, as a successful representation method for many signals, has been recently employed in speech enhancement. This paper presents a new learning-based speech enhancement algorithm via sparse representation in the wavelet packet transform domain. We propose sparse dictionary learning procedures for training data of speech and noise signals based on a coherence criterion, for each subband of decomposition level. Using these learning algorithms, self-coherence between atoms of each dictionary and mutual coherence between speech and noise dictionary atoms are minimized along with the approximation error. The speech enhancement algorithm is introduced in two scenarios, supervised and semi-supervised. In each scenario, a voice activity detector scheme is employed based on the energy of sparse coefficient matrices when the observation data is coded over corresponding dictionaries. In the proposed supervised scenario, we take advantage of domain adaptation techniques to transform a learned noise dictionary to a dictionary adapted to noise conditions captured based on the test environment circumstances. Using this step, observation data is sparsely coded, based on the current situation of the noisy space, with low sparse approximation error. This technique has a prominent role in obtaining better enhancement results particularly when the noise is non-stationary. In the proposed semi-supervised scenario, adaptive thresholding of wavelet coefficients is carried out based on the variance of the estimated noise in each frame of different subbands. The proposed approaches lead to significantly better speech enhancement results in comparison with the earlier methods in this context and the traditional procedures, based on different objective and subjective measures as well as a statistical test. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:22 / 47
页数:26
相关论文
共 50 条
  • [1] A new speech enhancement algorithm using wavelet packet transform
    Guo, Jichang
    Wang, Wenliang
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE INFORMATION COMPUTING AND AUTOMATION, VOLS 1-3, 2008, : 504 - 506
  • [2] Combined discrete wavelet transform and wavelet packet decomposition for speech enhancement
    Wang, Zhen-li
    Yang, Jie
    Zhang, Xiong-wei
    [J]. 2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 1107 - +
  • [3] New algorithm for speech enhancement using node threshold wavelet packet transform
    Department of Instrument Science, Yanshan University, Qinhuangdao 066004, China
    [J]. Yi Qi Yi Biao Xue Bao, 2007, 5 (952-955):
  • [4] A new speech enhancement method based on wavelet packet transform
    Wang, Jizeng
    Wang, Chanfei
    [J]. CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 261 - 265
  • [5] An unsupervised approach for improving speech enhancement using wavelet packet transform and adaptive thresholding
    Shafieian, Mohammadali
    Rahmanian, Mojdeh
    [J]. INGENIERIA UC, 2019, 26 (03): : 319 - 336
  • [6] Classification of ECG Arrhythmia Using Wavelet Packet Transform Analysis and Sparse Learning Method
    Mavaddati, Samira
    [J]. IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2023, 47 (04) : 1583 - 1593
  • [7] Classification of ECG Arrhythmia Using Wavelet Packet Transform Analysis and Sparse Learning Method
    Samira Mavaddati
    [J]. Iranian Journal of Science and Technology, Transactions of Electrical Engineering, 2023, 47 : 1583 - 1593
  • [8] Speech enhancement by overweighting gain with nonlinear structure in wavelet packet transform
    Jung, Sung-Ill
    Kwon, Younghun
    Yang, Sung-Il
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2007, E90B (08) : 2147 - 2150
  • [9] Multi-domain speech compression based on wavelet packet transform
    Wu, XD
    Li, YM
    Chen, HY
    [J]. ELECTRONICS LETTERS, 1998, 34 (02) : 154 - 155
  • [10] Speech enhancement using robust weighting factors for critical-band-wavelet-packet transform
    Lu, CT
    Wang, HC
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 721 - 724