Psychoacoustic model-driven spectral subtraction for monaural speech enhancement

被引：0

作者：

Upadhyay N. ^{[1
]}

机构：

[1] Department of Electronics and Communication Engineering, The LNM Institute of Information Technology, Jaipur

来源：

International Journal of Speech Technology | 2023年 / 26卷 / 04期

关键词：

Adaptive noise estimation; Monaural speech enhancement; Psychoacoustic model; Spectral subtraction;

D O I：

10.1007/s10772-023-10062-9

中图分类号：

学科分类号：

摘要：

In this paper, we investigate a psychoacoustic model-driven spectral subtraction framework for enhancement of noisy speech. In the proposed framework, the noisy speech spectrum is separated into six distinct and unevenly frequency-spaced subbands as per the psychoacoustic model of the human hearing system, and spectral over-subtraction is applied independently in each subband. The noise in each subband is estimated using an adaptive noise estimator that does not require a speech pause tracker. To compute and update the noise, the noisy speech power is adaptively smoothed using a smoothing factor controlled by a posterior SNR. The performance of the proposed framework is evaluated using SNR, segmental SNR (SegSNR), and PESQ scores for a variety of non-stationary and stationary noise environments at varying SNR levels. The experimental results show that the proposed framework outperforms various up-to-date speech enhancement technologies on three extensively used objective metrics assessments and speech spectrograms. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

引用

页码：963 / 979

页数：16

共 50 条

[11] Modified Magnitude Spectral Subtraction Methods for Speech Enhancement
Naik, D. C.
Murthy, A. Sreenivasa
Nuthakki, Ramesh
2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 274 - 279
[12] Speech Enhancement Based on a Modified Spectral Subtraction Method
Islam, Md. T.
Shahnaz, C.
Fattah, S. A.
2014 IEEE 57TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2014, : 1085 - 1088
[13] Speech enhancement using spectral subtraction with wavelet transform
Nishimura, R
Asano, F
Suzuki, Y
Sone, T
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1998, 81 (01): : 24 - 31
[14] An Iterative Graph Spectral Subtraction Method for Speech Enhancement
Yan, Xue
Yang, Zhen
Wang, Tingting
Guo, Haiyan
SPEECH COMMUNICATION, 2020, 123 : 35 - 42
[15] Adaptive β-order generalized spectral subtraction for speech enhancement
Li, Junfeng
Sakamoto, Shuichi
Hongo, Satoshi
Akagi, Masato
Suzuki, Yoiti
SIGNAL PROCESSING, 2008, 88 (11) : 2764 - 2776
[16] Application of spectral subtraction method on enhancement of electrolarynx speech
Liu, Hanjun
Zhao, Qin
Wan, Mingxi
Wang, Supin
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01): : 398 - 406
[17] Speech enhancement by spectral subtraction based on subspace decomposition
Murakami, T
Hoya, T
Ishida, Y
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (03): : 690 - 701
[18] Speech Enhancement Algorithm Based on Improved Spectral Subtraction
Gao, Liuyang
Guo, Yunfei
Li, Shaomei
Chen, Fucai
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 3, 2009, : 140 - 143
[19] An Improved Spectral Subtraction Algorithm for Speech Enhancement System
Na, Shun
Li, Weixing
Liu, Yang
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING FOR MECHANICS AND MATERIALS, 2016, 97 : 318 - 323
[20] Method based on fractional spectral subtraction for speech enhancement
Institute of Communications Engineering, PLAUST, Nanjing 210007, China
Dianzi Yu Xinxi Xuebao, 2007, 5 (1096-1100): : 1096 - 1100

← 1 2 3 4 5 →