Psychoacoustic model-driven spectral subtraction for monaural speech enhancement

被引:0
|
作者
Upadhyay N. [1 ]
机构
[1] Department of Electronics and Communication Engineering, The LNM Institute of Information Technology, Jaipur
关键词
Adaptive noise estimation; Monaural speech enhancement; Psychoacoustic model; Spectral subtraction;
D O I
10.1007/s10772-023-10062-9
中图分类号
学科分类号
摘要
In this paper, we investigate a psychoacoustic model-driven spectral subtraction framework for enhancement of noisy speech. In the proposed framework, the noisy speech spectrum is separated into six distinct and unevenly frequency-spaced subbands as per the psychoacoustic model of the human hearing system, and spectral over-subtraction is applied independently in each subband. The noise in each subband is estimated using an adaptive noise estimator that does not require a speech pause tracker. To compute and update the noise, the noisy speech power is adaptively smoothed using a smoothing factor controlled by a posterior SNR. The performance of the proposed framework is evaluated using SNR, segmental SNR (SegSNR), and PESQ scores for a variety of non-stationary and stationary noise environments at varying SNR levels. The experimental results show that the proposed framework outperforms various up-to-date speech enhancement technologies on three extensively used objective metrics assessments and speech spectrograms. © 2023, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
引用
收藏
页码:963 / 979
页数:16
相关论文
共 50 条
  • [11] Modified Magnitude Spectral Subtraction Methods for Speech Enhancement
    Naik, D. C.
    Murthy, A. Sreenivasa
    Nuthakki, Ramesh
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 274 - 279
  • [12] Speech Enhancement Based on a Modified Spectral Subtraction Method
    Islam, Md. T.
    Shahnaz, C.
    Fattah, S. A.
    2014 IEEE 57TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2014, : 1085 - 1088
  • [13] Speech enhancement using spectral subtraction with wavelet transform
    Nishimura, R
    Asano, F
    Suzuki, Y
    Sone, T
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 1998, 81 (01): : 24 - 31
  • [14] An Iterative Graph Spectral Subtraction Method for Speech Enhancement
    Yan, Xue
    Yang, Zhen
    Wang, Tingting
    Guo, Haiyan
    SPEECH COMMUNICATION, 2020, 123 : 35 - 42
  • [15] Adaptive β-order generalized spectral subtraction for speech enhancement
    Li, Junfeng
    Sakamoto, Shuichi
    Hongo, Satoshi
    Akagi, Masato
    Suzuki, Yoiti
    SIGNAL PROCESSING, 2008, 88 (11) : 2764 - 2776
  • [16] Application of spectral subtraction method on enhancement of electrolarynx speech
    Liu, Hanjun
    Zhao, Qin
    Wan, Mingxi
    Wang, Supin
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01): : 398 - 406
  • [17] Speech enhancement by spectral subtraction based on subspace decomposition
    Murakami, T
    Hoya, T
    Ishida, Y
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (03): : 690 - 701
  • [18] Speech Enhancement Algorithm Based on Improved Spectral Subtraction
    Gao, Liuyang
    Guo, Yunfei
    Li, Shaomei
    Chen, Fucai
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 3, 2009, : 140 - 143
  • [19] An Improved Spectral Subtraction Algorithm for Speech Enhancement System
    Na, Shun
    Li, Weixing
    Liu, Yang
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING FOR MECHANICS AND MATERIALS, 2016, 97 : 318 - 323
  • [20] Method based on fractional spectral subtraction for speech enhancement
    Institute of Communications Engineering, PLAUST, Nanjing 210007, China
    Dianzi Yu Xinxi Xuebao, 2007, 5 (1096-1100): : 1096 - 1100