Psychoacoustic model-driven spectral subtraction for monaural speech enhancement

被引：0

作者：

Upadhyay N. ^{[1
]}

机构：

[1] Department of Electronics and Communication Engineering, The LNM Institute of Information Technology, Jaipur

来源：

International Journal of Speech Technology | 2023年 / 26卷 / 04期

关键词：

Adaptive noise estimation; Monaural speech enhancement; Psychoacoustic model; Spectral subtraction;

D O I：

10.1007/s10772-023-10062-9

中图分类号：

学科分类号：

摘要：

In this paper, we investigate a psychoacoustic model-driven spectral subtraction framework for enhancement of noisy speech. In the proposed framework, the noisy speech spectrum is separated into six distinct and unevenly frequency-spaced subbands as per the psychoacoustic model of the human hearing system, and spectral over-subtraction is applied independently in each subband. The noise in each subband is estimated using an adaptive noise estimator that does not require a speech pause tracker. To compute and update the noise, the noisy speech power is adaptively smoothed using a smoothing factor controlled by a posterior SNR. The performance of the proposed framework is evaluated using SNR, segmental SNR (SegSNR), and PESQ scores for a variety of non-stationary and stationary noise environments at varying SNR levels. The experimental results show that the proposed framework outperforms various up-to-date speech enhancement technologies on three extensively used objective metrics assessments and speech spectrograms. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

引用

页码：963 / 979

页数：16

共 50 条

[1] Monaural speech segregation based on fusion of source-driven with model-driven techniques
Radfar, Mohammad H.
Dansereau, Richard M.
Sayadiyan, Abolghasem
SPEECH COMMUNICATION, 2007, 49 (06) : 464 - 476
[2] CMAC spectral subtraction for speech enhancement
Wahab, A
Tan, EC
Abut, H
ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 707 - 710
[3] A PSYCHOACOUSTIC SPECTRAL SUBTRACTION METHOD FOR NOISE SUPPRESSION IN AUTOMATIC SPEECH RECOGNITION
Haque, Serajul
Togneri, Roberto
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1618 - 1621
[4] Modulation Domain Spectral Subtraction for Speech Enhancement
Paliwal, Kuldip
Schwerin, Belinda
Wojcicki, Kamil
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 1343 - 1346
[5] Enhancement of alaryngeal speech using spectral subtraction
Pandey, PC
Bhandarkar, SM
Bachher, GK
Lehana, PK
DSP 2002: 14TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING PROCEEDINGS, VOLS 1 AND 2, 2002, : 591 - 594
[6] A spatial procedure to spectral subtraction for speech enhancement
Thimmaraja Yadava G
Nagaraja B G
Jayanna H S
Multimedia Tools and Applications, 2022, 81 : 23633 - 23647
[7] Supplementary schemes to spectral subtraction for speech enhancement
Hu, HT
Kuo, FJ
Wang, HJ
SPEECH COMMUNICATION, 2002, 36 (3-4) : 205 - 218
[8] A spatial procedure to spectral subtraction for speech enhancement
Yadava, Thimmaraja G.
Nagaraja, B. G.
Jayanna, H. S.
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 23633 - 23647
[9] Speech Enhancement Based on Spectral Subtraction for Speech Recognition System
Han, Jung-woo
Kim, Se-young
Kim, Ki-man
Jung, Ji-won
Yun, Young
IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE 2011), 2011, : 417 - 418
[10] Adaptive noise spectral estimation for spectral subtraction speech enhancement
Hu, H. T.
Yu, C.
IET SIGNAL PROCESSING, 2007, 1 (03) : 156 - 163

← 1 2 3 4 5 →