Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech

被引：8

作者：

Norholm, Sidsel Marie ^{[1
]}

Jensen, Jesper Rindom ^{[1
]}

Christensen, Mads Grsboll ^{[1
]}

机构：

[1] Aalborg Univ, AD MT, Audio Anal Lab, Dept Architecture Design & Media Technol, DK-9000 Aalborg, Denmark

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2016年 / 24卷 / 04期

关键词：

Chirp model; harmonic signal model; non-stationary speech; speech enhancement; ALGORITHM; SIGNALS;

D O I：

10.1109/TASLP.2016.2514492

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, single channel speech enhancement in the time domain is considered. We address the problem of modelling non-stationary speech by describing the voiced speech parts by a harmonic linear chirp model instead of using the traditional harmonic model. This means that the speech signal is not assumed stationary, instead the fundamental frequency can vary linearly within each frame. The linearly constrained minimum variance (LCMV) filter and the amplitude and phase estimation (APES) filter are derived in this framework and compared to the harmonic versions of the same filters. It is shown through simulations on synthetic and speech signals, that the chirp versions of the filters perform better than their harmonic counterparts in terms of output signal-to-noise ratio (SNR) and signal reduction factor. For synthetic signals, the output SNR for the harmonic chirp APES based filter is increased 3 dB compared to the harmonic APES based filter at an input SNR of 10 dB, and at the same time the signal reduction factor is decreased. For speech signals, the increase is 1.5 dB along with a decrease in the signal reduction factor of 0.7. As an implicit part of the APES filter, a noise covariance matrix estimate is obtained. We suggest using this estimate in combination with other filters such as the Wiener filter. The performance of the Wiener filter and LCMV filter are compared using the APES noise covariance matrix estimate and a power spectral density (PSD) based noise covariance matrix estimate. It is shown that the APES covariance matrix works well in combination with the Wiener filter, and the PSD based covariance matrix works well in combination with the LCMV filter.

引用

页码：645 / 658

页数：14

共 50 条

[21] An Algorithm of Single-Microphone Telephone Speech Enhancement in Non-Stationary Noise Environment
Yao, Yuan
Wang, Xia
Xue, Tao
[J]. 2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2012,
[22] A Novel Expectation-Maximization Framework for Speech Enhancement in Non-Stationary Noise Environments
Lun, Daniel P. K.
Shen, Tak-Wai
Ho, K. C.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 335 - 346
[23] Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise
[J]. 1600, Institution of Engineering and Technology, United States (08):
[24] Non-stationary noise estimation with adaptive filters
Bennis, RJM
Chu, QP
Mulder, JA
[J]. AIAA GUIDANCE, NAVIGATION, AND CONTROL CONFERENCE, VOLS 1-3: A COLLECTION OF TECHNICAL PAPERS, 1999, : 1769 - 1782
[25] Spectral estimation of non-stationary white noise
Allen, JC
Hobbs, SL
[J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1997, 334B (01): : 99 - 116
[26] A Non-iterative Kalman Filter for Single Channel Speech Enhancement in Non-stationary Noise Condition
Roy, Sujan Kumar
Paliwal, Kuldip K.
[J]. 2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
[27] FEATURE ENHANCEMENT BY BIDIRECTIONAL LSTM NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION IN HIGHLY NON-STATIONARY NOISE
Woellmer, Martin
Zhang, Zixing
Weninger, Felix
Schuller, Bjoern
Rigoll, Gerhard
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6822 - 6826
[28] Consistent estimation of signal parameters in non-stationary noise
Friedmann, J.
Fishler, E.
Messer, H.
[J]. 2000, IEEE, Los Alamitos, CA, United States : 225 - 228
[29] Consistent estimation of signal parameters in non-stationary noise
Friedmann, J
Fishler, E
Messer, H
[J]. PROCEEDINGS OF THE TENTH IEEE WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, 2000, : 225 - 228
[30] Enhancement of Non-Stationary Speech using Harmonic Chirp Filters
Norholm, Sidsel Marie
Jensen, Jesper Rindom
Christensen, Mads Graesboll
[J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1755 - 1759

← 1 2 3 4 5 →