Enhancing Speech Quality Using Spectral Subtraction and Time-Frequency Filtering

被引:0
|
作者
Nagaraja, B. G. [1 ]
Yadava, G. Thimmaraja [2 ]
Patil, C. M. [1 ]
机构
[1] Vidyavardhaka Coll Engn, Mysuru, Karnataka, India
[2] Nitte Meenakshi Inst Technol, Bengaluru, Karnataka, India
关键词
Spectral subtraction; time-frequency; PESQ; SNR; NCM; SPEAKER RECOGNITION; MODELING TECHNIQUES; FEATURE-EXTRACTION; ENHANCEMENT;
D O I
10.1007/978-3-031-64070-4_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech enhancement plays a vital role in almost all speech processing applications, where spectral subtraction based on voice activity detection (SS-VAD) is most commonly used. SS-VAD works well for moderate background noise conditions but deteriorates for low signal-to-noise-ratio (SNR) signal due to resulting residual noise consisting of musical tones. To address this issue, we introduce a novel approach called SS-time-frequency (SS-TF) filtering for enhancing speech quality. In this innovative method, we replace the conventional residual noise reduction technique with time-frequency (TF) filtering to effectively reduce additive noise. According to the perceptual evaluation of speech quality (PESQ), mean SNR and normalized covariance metric (NCM), the proposed approach outperforms SS-VAD for low SNR conditions. Further, estimation of average memory consumption and execution time has been done for both SS-VAD and the proposed methods.
引用
收藏
页码:259 / 272
页数:14
相关论文
共 50 条
  • [1] Spectral subtraction in the time-frequency domain using wavelet packets
    Bodin, P
    Villemoes, LF
    [J]. 1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 47 - 48
  • [2] Time-frequency methods for enhancing speech
    Kenny, OP
    Nelson, DJ
    [J]. ADVANCED SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VII, 1997, 3162 : 48 - 57
  • [3] Speech activity detection using time-frequency auditory spectral pattern
    Mondal, Sujoy
    Das Barman, Abhirup
    [J]. APPLIED ACOUSTICS, 2020, 167
  • [4] Denoising Method Based on Spectral Subtraction in Time-Frequency Domain
    Hao, Lei
    Cao, Shuai
    Zhou, Pengfei
    Chen, Lei
    Zhang, Yi
    Li, Kai
    Xie, Dongdong
    Geng, Yijun
    [J]. ADVANCES IN CIVIL ENGINEERING, 2021, 2021
  • [5] A time-frequency modulation model of speech quality
    Kates, James M.
    Arehart, Kathryn H.
    [J]. 2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 61 - +
  • [6] A. generalized perceptual time-frequency subtraction method for speech enhancement
    Shao, Yu
    Chang, Chip-Hong
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2537 - +
  • [7] Noise reduction in biomedical speech signal processing based on time and frequency Kalman filtering combined with spectral subtraction
    Aureliano da Silva, Leandro
    Basilio Joaquim, Marcelo
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2008, 34 (02) : 154 - 164
  • [8] Speech endpoint detection based on speech time-frequency enhancement and spectral entropy
    Fan Yingle
    Li Yi
    Wu Chuanyan
    [J]. 2005 27TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-7, 2005, : 4682 - 4684
  • [9] Time-frequency plane Wiener filtering for robust processing of speech signals
    Ang, A
    Ang, EL
    Premkumar, AB
    Madhukumar, AS
    [J]. IEEE TENCON'97 - IEEE REGIONAL 10 ANNUAL CONFERENCE, PROCEEDINGS, VOLS 1 AND 2: SPEECH AND IMAGE TECHNOLOGIES FOR COMPUTING AND TELECOMMUNICATIONS, 1997, : 35 - 38
  • [10] Enhancing Time-Frequency Concentration and Accuracy Using an Improved Time-Frequency Synchrosqueezing Transform
    Yang, Yaocheng
    Zhang, Jialiang
    Li, Yifan
    Ni, Qing
    Wang, Biao
    [J]. IEEE Sensors Journal, 2024, 24 (23) : 39334 - 39343