Single-Channel Speech Enhancement Based on Improved Frame-Iterative Spectral Subtraction in the Modulation Domain

被引:0
|
作者
Li, Chao [1 ]
Jiang, Ting [1 ]
Wu, Sheng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金; 国家自然科学基金重大项目;
关键词
short-time modulation domain; single-channel speech enhancement; modulation improved frame iterative spectral subtraction; low SNRs; MEAN-SQUARE ERROR; NOISE; MAGNITUDE;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Aiming at the problem of music noise introduced by classical spectral subtraction, a short-time modulation domain (STM) spectral subtraction method has been successfully applied for single-channel speech enhancement. However, due to the inaccurate voice activity detection (VAD), the residual music noise and enhanced performance still need to be further improved, especially in the low signal to noise ratio (SNR) scenarios. To address this issue, an improved frame iterative spectral subtraction in the STM domain (IMModSSub) is proposed. More specifically, with the inter-frame correlation, the noise subtraction is directly applied to handle the noisy signal for each frame in the STM domain. Then, the noisy signal is classified into speech or silence frames based on a predefined threshold of segmented SNR. With these classification results, a corresponding mask function is developed for noisy speech after noise subtraction. Finally, exploiting the increased sparsity of speech signal in the modulation domain, the orthogonal matching pursuit (OMP) technique is employed to the speech frames for improving the speech quality and intelligibility. The effectiveness of the proposed method is evaluated with three types of noise, including white noise, pink noise, and hfchannel noise. The obtained results show that the proposed method outperforms some established baselines at lower SNRs (5 to +5 dB).
引用
收藏
页码:100 / 115
页数:16
相关论文
共 50 条
  • [31] Iterative Closed-Loop Phase-Aware Single-Channel Speech Enhancement
    Mowlaee, Pejman
    Saeidi, Rahim
    IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (12) : 1235 - 1239
  • [32] Two-Stage Single-Channel Speech Enhancement with Multi-Frame Filtering
    Lin, Shaoxiong
    Zhang, Wangyou
    Qian, Yanmin
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [33] Phase Processing for Single-Channel Speech Enhancement
    Gerkmann, Timo
    Krawczyk-Becker, Martin
    Le Roux, Jonathan
    IEEE SIGNAL PROCESSING MAGAZINE, 2015, 32 (02) : 55 - 66
  • [34] SINGLE CHANNEL SPEECH ENHANCEMENT IN THE MODULATION DOMAIN: NEW INSIGHTS IN THE MODULATION CHANNEL SELECTION FRAMEWORK
    Boldt, Jesper B.
    Bertelsen, Andreas T.
    Gran, Fredrik
    Jorgensen, Soren
    Dau, Torsten
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5748 - 5752
  • [35] SINGLE-CHANNEL SPECTRAL ANALYSIS AND SPECTRUM ENHANCEMENT
    STARSHAK, AJ
    LARSEN, RD
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1969, (SEP): : PH47 - &
  • [36] Non-Air Conducted Speech Enhancement Based on Iterative Spectral Subtraction Method
    Li, Sheng
    Wang, JianQi
    Jing, XiJing
    Li, Sheng
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [37] On DCT-based MMSE estimation of short time spectral amplitude for single-channel speech enhancement
    Shi, Sisi
    Paliwal, Kuldip
    Busch, Andrew
    APPLIED ACOUSTICS, 2023, 202
  • [38] Research of speech enhancement based on wavelet-domain spectral subtraction algorithm
    Xu, Yan
    Cha, Cheng
    Wang, Wei-Han
    Tiedao Xuebao/Journal of the China Railway Society, 2006, 28 (06): : 64 - 68
  • [39] Musical-Noise-Free Speech Enhancement Based on Optimized Iterative Spectral Subtraction
    Miyazaki, Ryoichi
    Saruwatari, Hiroshi
    Inoue, Takayuki
    Takahashi, Yu
    Shikano, Kiyohiro
    Kondo, Kazunobu
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (07): : 2080 - 2094
  • [40] Single-Channel Speech Enhancement Techniques for Distant Speech Recognition
    Ashwini, Jaya
    Kumaraswamy, Ramaswamy
    JOURNAL OF INTELLIGENT SYSTEMS, 2013, 22 (02) : 81 - 93