Time-frequency methods for enhancing speech

被引:3
|
作者
Kenny, OP
Nelson, DJ
机构
关键词
speech enhancement; image enhancement; Wiener filtering; singular value decomposition; prolate-spheroidal filter; Hermite polynomials; time-frequency distribution;
D O I
10.1117/12.284192
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speech signals have the property that they are broad-band white conveying information at a very low rate. The resulting signal has a time-frequency representation which is redundant and slowly varying in both time and frequency. In this paper, a new method for separating speech from noise and interference is presented. This new method uses image enhancement techniques applied to time-frequency representations of the corrupted speech signal. The image enhancement techniques are based on the assumption that speech and/or the noise and interference may be locally represented as a mixture of two-dimensional Gaussian distributions. The signal surface is expanded using a Hermite polynomial expansion and the signal surface is separated from the noise surface by a principal-component process. A Wiener gain surface is calculated from the enhanced image, and the enhanced signal is reconstructed from the Wiener gain surface using a time varying filter constructed from a basis of prolate-spheroidal filters.
引用
收藏
页码:48 / 57
页数:10
相关论文
共 50 条
  • [1] Enhancing Speech Quality Using Spectral Subtraction and Time-Frequency Filtering
    Nagaraja, B. G.
    Yadava, G. Thimmaraja
    Patil, C. M.
    [J]. ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT III, 2024, 2092 : 259 - 272
  • [2] Tuning Time-Frequency methods for the detection of metered HF speech
    Nelson, DJ
    Smith, LH
    [J]. ADVANCED SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS XII, 2002, 4791 : 71 - 79
  • [3] Isolate Speech Recognition Based on Time-Frequency Analysis Methods
    Mantilla-Caeiros, Alfredo
    Nakano Miyatake, Mariko
    Perez-Meana, Hector
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 297 - +
  • [4] On time-frequency masking in voiced speech
    Skoglund, J
    Kleijn, WB
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 361 - 369
  • [5] Time-frequency representations in speech perception
    Gomez-Vilda, Pedro
    Ferrandez-Vicente, Jose M.
    Rodellar-Biarge, Victoria
    Fernandez-Baillo, Roberto
    [J]. NEUROCOMPUTING, 2009, 72 (4-6) : 820 - 830
  • [6] TIME-FREQUENCY METHODS IN ACOUSTICS
    FLANDRIN, P
    SESSAREGO, JP
    [J]. JOURNAL DE PHYSIQUE, 1990, 51 (C2): : 707 - 716
  • [7] Watermarking of speech signals in the time-frequency domain
    Al-Khassaweneh, Mahmood
    Al-Zoubi, Hussein
    Aviyente, Selin
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2009, : 317 - +
  • [8] Segmentation on time-frequency domain for speech segregation
    Lim, Sung-Kil
    Lee, Hyon-Soo
    [J]. 2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 433 - +
  • [9] Time-frequency distributions for automatic speech recognition
    Potamianos, A
    Maragos, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (03): : 196 - 200
  • [10] TIME-FREQUENCY ATTENTION FOR MONAURAL SPEECH ENHANCEMENT
    Zhang, Qiquan
    Song, Qi
    Ni, Zhaoheng
    Nicolson, Aaron
    Li, Haizhou
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7852 - 7856