PHASE-SENSITIVE REAL-TIME CAPABLE SPEECH ENHANCEMENT UNDER VOICED-UNVOICED UNCERTAINTY

被引：0

作者：

Krawczyk, Martin ^{[1
]}

Rehr, Robert ^{[1
]}

Gerkmann, Timo ^{[1
]}

机构：

[1] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, Speech Signal Proc Grp, D-26111 Oldenburg, Germany

来源：

2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO) | 2013年

关键词：

speech enhancement; noise reduction; phase estimation; amplitude estimation; SPECTRAL MAGNITUDE ESTIMATION;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In many short-time Fourier transform (STFT)-based single channel speech enhancement algorithms, the clean speech spectral amplitude is estimated from a noisy observation to suppress additive noise. For the estimation, only the noisy amplitudes and functions thereof, like the a priori or a posteriori signal-to-noise ratio (SNR), are utilized. Information about the clean speech spectral phase is mostly not employed. In this work we present a comprehensive speech enhancement setup that combines phase-sensitive and phase-insensitive amplitude estimation, improving the perceptual speech quality of the enhanced signal in terms of PESQ compared to phase-insensitive amplitude estimation alone. The proposed algorithm is real-time capable in the sense that it is implemented in a causal block-wise manner and the computational complexity is feasible.

引用

页数：5

共 50 条

[1] Speech enhancement based on a voiced-unvoiced speech model
Goh, Z
Tan, KC
Tan, BTG
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 401 - 404
[2] Global Soft Decision Based Speech Enhancement Using Voiced-Unvoiced Uncertainty and Harmonic Phase Decomposition Technique
Samui, Suman
Chakrabarti, Indrajit
Ghosh, Soumya Kanti
2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
[3] Kalman-filtering speech enhancement method based on a voiced-unvoiced speech model
Goh, Z
Tan, KC
Tan, BTG
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 510 - 524
[4] Speech Enhancement Using Modified MMSE-LSA and Phase Reconstruction in Voiced and Unvoiced Speech
Jia, Hairong
Wang, Weimei
Wang, Dong
Zhang, Xueying
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (02)
[5] A Collelogram based Pitch and Voiced/Unvoiced Classification Method for Real-Time Speech Analysis in Noisy Environment
Hamid, Md Ekramul
Molla, Md. Khademul Islam
2017 4TH ASIA-PACIFIC WORLD CONGRESS ON COMPUTER SCIENCE AND ENGINEERING (APWCONCSE 2017), 2017, : 93 - 98
[6] Real-time pitch extraction of voiced speech
George, DE
Salari, E
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1997, 20 (04) : 379 - 387
[7] Real-time pitch extraction of voiced speech
Dept of Physics and Astronomy, University of Toledo, Toledo, OH 43606-3390, United States
不详
J Network Comput Appl, 4 (379-387):
[8] Phase-sensitive Speech Enhancement for Cochlear Implant Processing
Jafari, Pourya S.
Kang, Hou-Yong
Wang, Xiaosong
Fu, Qian-Jie
Jiang, Hui
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5104 - 5107
[9] Mask estimation incorporating phase-sensitive information for speech enhancement
Wang, Xianyun
Bao, Changchun
APPLIED ACOUSTICS, 2019, 156 : 101 - 112
[10] Improved Semi-Supervised NMF Based Real-Time Capable Speech Enhancement
Hu, Yonggang
Zhang, Xiongwei
Zou, Xia
Sun, Meng
Min, Gang
Li, Yinan
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (01) : 402 - 406

← 1 2 3 4 5 →