Psychoacoustic model-driven spectral subtraction for monaural speech enhancement

被引:0
|
作者
Upadhyay N. [1 ]
机构
[1] Department of Electronics and Communication Engineering, The LNM Institute of Information Technology, Jaipur
关键词
Adaptive noise estimation; Monaural speech enhancement; Psychoacoustic model; Spectral subtraction;
D O I
10.1007/s10772-023-10062-9
中图分类号
学科分类号
摘要
In this paper, we investigate a psychoacoustic model-driven spectral subtraction framework for enhancement of noisy speech. In the proposed framework, the noisy speech spectrum is separated into six distinct and unevenly frequency-spaced subbands as per the psychoacoustic model of the human hearing system, and spectral over-subtraction is applied independently in each subband. The noise in each subband is estimated using an adaptive noise estimator that does not require a speech pause tracker. To compute and update the noise, the noisy speech power is adaptively smoothed using a smoothing factor controlled by a posterior SNR. The performance of the proposed framework is evaluated using SNR, segmental SNR (SegSNR), and PESQ scores for a variety of non-stationary and stationary noise environments at varying SNR levels. The experimental results show that the proposed framework outperforms various up-to-date speech enhancement technologies on three extensively used objective metrics assessments and speech spectrograms. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:963 / 979
页数:16
相关论文
共 50 条
  • [31] Enhancement of alaryngeal speech utilizing spectral subtraction and minimum statistics
    Kabir, Raonaak
    Greenblatt, Aaron
    Panetta, Karen
    Agaian, Sos
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3704 - +
  • [32] A real time spectral subtraction based speech enhancement scheme
    Flogeras, D
    Doraiswami, R
    Kaye, ME
    CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1071 - 1074
  • [33] DNN-based monaural speech enhancement with temporal and spectral variations equalization
    Kang, Tae Gyoon
    Shin, Jong Won
    Kim, Nam Soo
    DIGITAL SIGNAL PROCESSING, 2018, 74 : 102 - 110
  • [34] Harmonic Attention for Monaural Speech Enhancement
    Wang, Tianrui
    Zhu, Weibin
    Gao, Yingying
    Zhang, Shilei
    Feng, Junlan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2424 - 2436
  • [35] Monaural speech enhancement with dilated convolutions
    Pirhosseinloo, Shadi
    Brumberg, Jonathan S.
    INTERSPEECH 2019, 2019, : 3143 - 3147
  • [36] Speech enhancement based on a combined spectral subtraction with spectral estimation in various noise environment
    Wang, Guangyan
    Wang, Xia
    Zha, Xiaoqun
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1424 - 1429
  • [37] Power Spectral Density Error Analysis of Spectral Subtraction Type of Speech Enhancement Methods
    Peter Händel
    EURASIP Journal on Advances in Signal Processing, 2007
  • [38] Multi-Band Spectral Subtraction Method for Electrolarynx Speech Enhancement
    Li, Sheng
    Wan, MingXi
    Wang, SuPin
    ALGORITHMS, 2009, 2 (01) : 550 - 564
  • [39] Tamil Speech Enhancement Using Non-Linear Spectral Subtraction
    Prabhakaran, G.
    Indra, J.
    Kasthuri, N.
    2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
  • [40] Power spectral density error analysis of spectral subtraction type of speech enhancement methods
    Haendel, Peter
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)