Psychoacoustic model-driven spectral subtraction for monaural speech enhancement

被引：0

作者：

Upadhyay N. ^{[1
]}

机构：

[1] Department of Electronics and Communication Engineering, The LNM Institute of Information Technology, Jaipur

来源：

International Journal of Speech Technology | 2023年 / 26卷 / 04期

关键词：

Adaptive noise estimation; Monaural speech enhancement; Psychoacoustic model; Spectral subtraction;

D O I：

10.1007/s10772-023-10062-9

中图分类号：

学科分类号：

摘要：

In this paper, we investigate a psychoacoustic model-driven spectral subtraction framework for enhancement of noisy speech. In the proposed framework, the noisy speech spectrum is separated into six distinct and unevenly frequency-spaced subbands as per the psychoacoustic model of the human hearing system, and spectral over-subtraction is applied independently in each subband. The noise in each subband is estimated using an adaptive noise estimator that does not require a speech pause tracker. To compute and update the noise, the noisy speech power is adaptively smoothed using a smoothing factor controlled by a posterior SNR. The performance of the proposed framework is evaluated using SNR, segmental SNR (SegSNR), and PESQ scores for a variety of non-stationary and stationary noise environments at varying SNR levels. The experimental results show that the proposed framework outperforms various up-to-date speech enhancement technologies on three extensively used objective metrics assessments and speech spectrograms. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

引用

页码：963 / 979

页数：16

共 50 条

[31] Enhancement of alaryngeal speech utilizing spectral subtraction and minimum statistics
Kabir, Raonaak
Greenblatt, Aaron
Panetta, Karen
Agaian, Sos
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 3704 - +
[32] A real time spectral subtraction based speech enhancement scheme
Flogeras, D
Doraiswami, R
Kaye, ME
CCECE 2003: CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-3, PROCEEDINGS: TOWARD A CARING AND HUMANE TECHNOLOGY, 2003, : 1071 - 1074
[33] DNN-based monaural speech enhancement with temporal and spectral variations equalization
Kang, Tae Gyoon
Shin, Jong Won
Kim, Nam Soo
DIGITAL SIGNAL PROCESSING, 2018, 74 : 102 - 110
[34] Harmonic Attention for Monaural Speech Enhancement
Wang, Tianrui
Zhu, Weibin
Gao, Yingying
Zhang, Shilei
Feng, Junlan
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2424 - 2436
[35] Monaural speech enhancement with dilated convolutions
Pirhosseinloo, Shadi
Brumberg, Jonathan S.
INTERSPEECH 2019, 2019, : 3143 - 3147
[36] Speech enhancement based on a combined spectral subtraction with spectral estimation in various noise environment
Wang, Guangyan
Wang, Xia
Zha, Xiaoqun
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1424 - 1429
[37] Power Spectral Density Error Analysis of Spectral Subtraction Type of Speech Enhancement Methods
Peter Händel
EURASIP Journal on Advances in Signal Processing, 2007
[38] Multi-Band Spectral Subtraction Method for Electrolarynx Speech Enhancement
Li, Sheng
Wan, MingXi
Wang, SuPin
ALGORITHMS, 2009, 2 (01) : 550 - 564
[39] Tamil Speech Enhancement Using Non-Linear Spectral Subtraction
Prabhakaran, G.
Indra, J.
Kasthuri, N.
2014 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2014,
[40] Power spectral density error analysis of spectral subtraction type of speech enhancement methods
Haendel, Peter
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2007, 2007 (1)

← 1 2 3 4 5 →