Psychoacoustic model-driven spectral subtraction for monaural speech enhancement

被引:0
|
作者
Upadhyay N. [1 ]
机构
[1] Department of Electronics and Communication Engineering, The LNM Institute of Information Technology, Jaipur
关键词
Adaptive noise estimation; Monaural speech enhancement; Psychoacoustic model; Spectral subtraction;
D O I
10.1007/s10772-023-10062-9
中图分类号
学科分类号
摘要
In this paper, we investigate a psychoacoustic model-driven spectral subtraction framework for enhancement of noisy speech. In the proposed framework, the noisy speech spectrum is separated into six distinct and unevenly frequency-spaced subbands as per the psychoacoustic model of the human hearing system, and spectral over-subtraction is applied independently in each subband. The noise in each subband is estimated using an adaptive noise estimator that does not require a speech pause tracker. To compute and update the noise, the noisy speech power is adaptively smoothed using a smoothing factor controlled by a posterior SNR. The performance of the proposed framework is evaluated using SNR, segmental SNR (SegSNR), and PESQ scores for a variety of non-stationary and stationary noise environments at varying SNR levels. The experimental results show that the proposed framework outperforms various up-to-date speech enhancement technologies on three extensively used objective metrics assessments and speech spectrograms. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:963 / 979
页数:16
相关论文
共 50 条
  • [21] Perceptually Motivated Generalized Spectral Subtraction for Speech Enhancement
    Zoghlami, Novlene
    Lachiri, Zied
    Ellouze, Noureddine
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 136 - 143
  • [22] Real and imaginary modulation spectral subtraction for speech enhancement
    Zhang, Yi
    Zhao, Yunxin
    SPEECH COMMUNICATION, 2013, 55 (04) : 509 - 522
  • [23] A recursive parametric spectral subtraction algorithm for speech enhancement
    You, Ming-Chan
    Mao, Cheng-Yi
    Wang, Jeen-Shing
    Chuang, Fang-Chen
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 826 - +
  • [24] Enhancement of noisy speech by spectral subtraction and residual modification
    Krishnamoorthy, P.
    Prasanna, S. R. Mahadeva
    2006 ANNUAL IEEE INDIA CONFERENCE, 2006, : 124 - +
  • [25] COMPLEX SPECTRAL MAPPING WITH A CONVOLUTIONAL RECURRENT NETWORK FOR MONAURAL SPEECH ENHANCEMENT
    Tan, Ke
    Wang, DeLiang
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6865 - 6869
  • [26] Speech dereverberation method based on spectral subtraction and spectral line enhancement
    Chen, Zhe
    Wang, Rui
    Yin, Fuliang
    Wang, Bingqian
    Peng, Wenwen
    APPLIED ACOUSTICS, 2016, 112 : 201 - 210
  • [27] Improved perceptually inspired speech enhancement using a psychoacoustic model
    Hu, RQ
    Anderson, DV
    CONFERENCE RECORD OF THE THIRTY-EIGHTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 2004, : 440 - 444
  • [28] Speech enhancement based on the decomposition of speech into deterministic and stochastic components and psychoacoustic model
    Jo, Seokhwan
    Yoo, Chang D.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 897 - +
  • [29] SPEECH ENHANCEMENT EMPLOYING SPECTRAL SUBTRACTION AND LINEAR PREDICTIVE ANALYSIS
    CROZIER, PM
    CHEETHAM, BMG
    HOLT, C
    MUNDAY, E
    ELECTRONICS LETTERS, 1993, 29 (12) : 1094 - 1095
  • [30] A hybrid speech enhancement system based on HMM and spectral subtraction
    Ghoreishi, MH
    Sheikhzadeh, H
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1855 - 1858