PHASE-SENSITIVE REAL-TIME CAPABLE SPEECH ENHANCEMENT UNDER VOICED-UNVOICED UNCERTAINTY

被引:0
|
作者
Krawczyk, Martin [1 ]
Rehr, Robert [1 ]
Gerkmann, Timo [1 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Speech Signal Proc Grp, D-26111 Oldenburg, Germany
关键词
speech enhancement; noise reduction; phase estimation; amplitude estimation; SPECTRAL MAGNITUDE ESTIMATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In many short-time Fourier transform (STFT)-based single channel speech enhancement algorithms, the clean speech spectral amplitude is estimated from a noisy observation to suppress additive noise. For the estimation, only the noisy amplitudes and functions thereof, like the a priori or a posteriori signal-to-noise ratio (SNR), are utilized. Information about the clean speech spectral phase is mostly not employed. In this work we present a comprehensive speech enhancement setup that combines phase-sensitive and phase-insensitive amplitude estimation, improving the perceptual speech quality of the enhanced signal in terms of PESQ compared to phase-insensitive amplitude estimation alone. The proposed algorithm is real-time capable in the sense that it is implemented in a causal block-wise manner and the computational complexity is feasible.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Speech enhancement based on a voiced-unvoiced speech model
    Goh, Z
    Tan, KC
    Tan, BTG
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 401 - 404
  • [2] Global Soft Decision Based Speech Enhancement Using Voiced-Unvoiced Uncertainty and Harmonic Phase Decomposition Technique
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [3] Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model
    Goh, Z
    Tan, KC
    Tan, BTG
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 510 - 524
  • [4] Speech Enhancement Using Modified MMSE-LSA and Phase Reconstruction in Voiced and Unvoiced Speech
    Jia, Hairong
    Wang, Weimei
    Wang, Dong
    Zhang, Xueying
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (02)
  • [5] A Collelogram based Pitch and Voiced/Unvoiced Classification Method for Real-Time Speech Analysis in Noisy Environment
    Hamid, Md Ekramul
    Molla, Md. Khademul Islam
    2017 4TH ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWCONCSE 2017), 2017, : 93 - 98
  • [6] Real-time pitch extraction of voiced speech
    George, DE
    Salari, E
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1997, 20 (04) : 379 - 387
  • [7] Real-time pitch extraction of voiced speech
    Dept of Physics and Astronomy, University of Toledo, Toledo, OH 43606-3390, United States
    不详
    J Network Comput Appl, 4 (379-387):
  • [8] Phase-sensitive Speech Enhancement for Cochlear Implant Processing
    Jafari, Pourya S.
    Kang, Hou-Yong
    Wang, Xiaosong
    Fu, Qian-Jie
    Jiang, Hui
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5104 - 5107
  • [9] Mask estimation incorporating phase-sensitive information for speech enhancement
    Wang, Xianyun
    Bao, Changchun
    APPLIED ACOUSTICS, 2019, 156 : 101 - 112
  • [10] Improved Semi-Supervised NMF Based Real-Time Capable Speech Enhancement
    Hu, Yonggang
    Zhang, Xiongwei
    Zou, Xia
    Sun, Meng
    Min, Gang
    Li, Yinan
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (01) : 402 - 406