Psychoacoustic model-driven spectral subtraction for monaural speech enhancement

被引:0
|
作者
Upadhyay N. [1 ]
机构
[1] Department of Electronics and Communication Engineering, The LNM Institute of Information Technology, Jaipur
关键词
Adaptive noise estimation; Monaural speech enhancement; Psychoacoustic model; Spectral subtraction;
D O I
10.1007/s10772-023-10062-9
中图分类号
学科分类号
摘要
In this paper, we investigate a psychoacoustic model-driven spectral subtraction framework for enhancement of noisy speech. In the proposed framework, the noisy speech spectrum is separated into six distinct and unevenly frequency-spaced subbands as per the psychoacoustic model of the human hearing system, and spectral over-subtraction is applied independently in each subband. The noise in each subband is estimated using an adaptive noise estimator that does not require a speech pause tracker. To compute and update the noise, the noisy speech power is adaptively smoothed using a smoothing factor controlled by a posterior SNR. The performance of the proposed framework is evaluated using SNR, segmental SNR (SegSNR), and PESQ scores for a variety of non-stationary and stationary noise environments at varying SNR levels. The experimental results show that the proposed framework outperforms various up-to-date speech enhancement technologies on three extensively used objective metrics assessments and speech spectrograms. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:963 / 979
页数:16
相关论文
共 50 条
  • [1] Monaural speech segregation based on fusion of source-driven with model-driven techniques
    Radfar, Mohammad H.
    Dansereau, Richard M.
    Sayadiyan, Abolghasem
    SPEECH COMMUNICATION, 2007, 49 (06) : 464 - 476
  • [2] CMAC spectral subtraction for speech enhancement
    Wahab, A
    Tan, EC
    Abut, H
    ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 707 - 710
  • [3] A PSYCHOACOUSTIC SPECTRAL SUBTRACTION METHOD FOR NOISE SUPPRESSION IN AUTOMATIC SPEECH RECOGNITION
    Haque, Serajul
    Togneri, Roberto
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1618 - 1621
  • [4] Modulation Domain Spectral Subtraction for Speech Enhancement
    Paliwal, Kuldip
    Schwerin, Belinda
    Wojcicki, Kamil
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1343 - 1346
  • [5] Enhancement of alaryngeal speech using spectral subtraction
    Pandey, PC
    Bhandarkar, SM
    Bachher, GK
    Lehana, PK
    DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 591 - 594
  • [6] A spatial procedure to spectral subtraction for speech enhancement
    Thimmaraja Yadava G
    Nagaraja B G
    Jayanna H S
    Multimedia Tools and Applications, 2022, 81 : 23633 - 23647
  • [7] Supplementary schemes to spectral subtraction for speech enhancement
    Hu, HT
    Kuo, FJ
    Wang, HJ
    SPEECH COMMUNICATION, 2002, 36 (3-4) : 205 - 218
  • [8] A spatial procedure to spectral subtraction for speech enhancement
    Yadava, Thimmaraja G.
    Nagaraja, B. G.
    Jayanna, H. S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 23633 - 23647
  • [9] Speech Enhancement Based on Spectral Subtraction for Speech Recognition System
    Han, Jung-woo
    Kim, Se-young
    Kim, Ki-man
    Jung, Ji-won
    Yun, Young
    IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 417 - 418
  • [10] Adaptive noise spectral estimation for spectral subtraction speech enhancement
    Hu, H. T.
    Yu, C.
    IET SIGNAL PROCESSING, 2007, 1 (03) : 156 - 163