Speech Enhancement Based on Discrete Wavelet Packet Transform and Itakura-Saito Nonnegative Matrix Factorisation

被引:3
|
作者
Liu, Houguang [1 ]
Wang, Wenbo [1 ]
Xue, Lin [1 ]
Yang, Jianhua [1 ]
Wang, Zhihua [1 ]
Hua, Chunli [1 ]
机构
[1] China Univ Min & Technol, Sch Mechatron Engn, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
speech enhancement; discrete wavelet packet transform; nonnegative matrix factorisation; Itakura-Saito divergence; NOISE; ALGORITHMS; SEPARATION; QUALITY; NMF;
D O I
10.24425/aoa.2020.134072
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Nonnegative matrix factorization (NMF) is one of the most popular machine learning tools for speech enhancement (SE). However, there are two problems reducing the performance of the traditional NMF-based SE algorithms. One is related to the overlap-and-add operation used in the short time Fourier transform (STFT) based signal reconstruction, and the other is the Euclidean distance used commonly as an objective function; these methods can cause distortion in the SE process. In order to get over these shortcomings, we propose a novel SE joint framework which combines the discrete wavelet packet transform (DWPT) and the Itakura-Saito nonnegative matrix factorisation (ISNMF). In this approach, the speech signal was first split into a series of subband signals using the DWPT. Then, the ISNMF was used to enhance the speech for each subband signal. Finally, the inverse DWPT (IDWT) was utilised to reconstruct these enhanced speech subband signals. The experimental results show that the proposed joint framework effectively enhances the performance of speech enhancement and performs better in the unseen noise case compared to the traditional NMF methods.
引用
收藏
页码:565 / 572
页数:8
相关论文
共 50 条
  • [21] Speech enhancement by overweighting gain with nonlinear structure in wavelet packet transform
    Jung, Sung-Ill
    Kwon, Younghun
    Yang, Sung-Il
    [J]. IEICE TRANSACTIONS ON COMMUNICATIONS, 2007, E90B (08) : 2147 - 2150
  • [22] IMAGE ENHANCEMENT BASED ON DISCRETE WAVELET TRANSFORM
    Sumathi, M.
    Murthi, V. Krishna
    [J]. IIOAB JOURNAL, 2016, 7 (10) : 12 - 15
  • [23] Speech enhancement using sparse dictionary learning in wavelet packet transform domain
    Mavaddaty, Samira
    Ahadi, Seyed Mohammad
    Seyedin, Sanaz
    [J]. COMPUTER SPEECH AND LANGUAGE, 2017, 44 : 22 - 47
  • [24] Speech enhancement for nonstationary noises by wavelet packet transform and adaptive noise estimation
    Lei, SF
    Tung, YK
    [J]. ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 41 - 44
  • [25] An enhanced psychoacoustic model based on the discrete wavelet packet transform
    He, Xing
    Scordilis, Michael S.
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2006, 343 (07): : 738 - 755
  • [26] Psychoacoustic Music Analysis Based on the Discrete Wavelet Packet Transform
    He, Xing
    Scordilis, Michael S.
    [J]. JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2008, 2008
  • [27] Speech/Music Discrimination Based on Discrete Wavelet Transform
    Ntalampiras, Stavros
    Fakotakis, Nikos
    [J]. ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, SETN 2008, 2008, 5138 : 205 - 211
  • [28] Speech Enhancement Based on Reducing the Detail Portion of Speech Spectrograms in Modulation Domain via Discrete Wavelet Transform
    Lee, Shih-kuang
    Wang, Syu-Siang
    Tsao, Yu
    Hung, Jeih-weih
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 16 - 20
  • [29] Speech Enhancement Based on Codebook Constrained Nonnegative Matrix Factorization
    Bai, Zhigang
    Bao, Changchun
    Yan, Bofang
    [J]. 2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 361 - 365
  • [30] Packet-Loss Robust Scalable Speech Coding Using the Discrete Wavelet Transform
    Seto, Koji
    Ogunfunmi, Tokunbo
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2014, : 129 - 132