Enhancement and Noise Statistics Estimation for Non-Stationary Voiced Speech

被引:8
|
作者
Norholm, Sidsel Marie [1 ]
Jensen, Jesper Rindom [1 ]
Christensen, Mads Grsboll [1 ]
机构
[1] Aalborg Univ, AD MT, Audio Anal Lab, Dept Architecture Design & Media Technol, DK-9000 Aalborg, Denmark
关键词
Chirp model; harmonic signal model; non-stationary speech; speech enhancement; ALGORITHM; SIGNALS;
D O I
10.1109/TASLP.2016.2514492
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, single channel speech enhancement in the time domain is considered. We address the problem of modelling non-stationary speech by describing the voiced speech parts by a harmonic linear chirp model instead of using the traditional harmonic model. This means that the speech signal is not assumed stationary, instead the fundamental frequency can vary linearly within each frame. The linearly constrained minimum variance (LCMV) filter and the amplitude and phase estimation (APES) filter are derived in this framework and compared to the harmonic versions of the same filters. It is shown through simulations on synthetic and speech signals, that the chirp versions of the filters perform better than their harmonic counterparts in terms of output signal-to-noise ratio (SNR) and signal reduction factor. For synthetic signals, the output SNR for the harmonic chirp APES based filter is increased 3 dB compared to the harmonic APES based filter at an input SNR of 10 dB, and at the same time the signal reduction factor is decreased. For speech signals, the increase is 1.5 dB along with a decrease in the signal reduction factor of 0.7. As an implicit part of the APES filter, a noise covariance matrix estimate is obtained. We suggest using this estimate in combination with other filters such as the Wiener filter. The performance of the Wiener filter and LCMV filter are compared using the APES noise covariance matrix estimate and a power spectral density (PSD) based noise covariance matrix estimate. It is shown that the APES covariance matrix works well in combination with the Wiener filter, and the PSD based covariance matrix works well in combination with the LCMV filter.
引用
收藏
页码:645 / 658
页数:14
相关论文
共 50 条
  • [21] An Algorithm of Single-Microphone Telephone Speech Enhancement in Non-Stationary Noise Environment
    Yao, Yuan
    Wang, Xia
    Xue, Tao
    [J]. 2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2012,
  • [22] A Novel Expectation-Maximization Framework for Speech Enhancement in Non-Stationary Noise Environments
    Lun, Daniel P. K.
    Shen, Tak-Wai
    Ho, K. C.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 335 - 346
  • [23] Voicing detection based on adaptive aperiodicity thresholding for speech enhancement in non-stationary noise
    [J]. 1600, Institution of Engineering and Technology, United States (08):
  • [24] Non-stationary noise estimation with adaptive filters
    Bennis, RJM
    Chu, QP
    Mulder, JA
    [J]. AIAA GUIDANCE, NAVIGATION, AND CONTROL CONFERENCE, VOLS 1-3: A COLLECTION OF TECHNICAL PAPERS, 1999, : 1769 - 1782
  • [25] Spectral estimation of non-stationary white noise
    Allen, JC
    Hobbs, SL
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1997, 334B (01): : 99 - 116
  • [26] A Non-iterative Kalman Filter for Single Channel Speech Enhancement in Non-stationary Noise Condition
    Roy, Sujan Kumar
    Paliwal, Kuldip K.
    [J]. 2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [27] FEATURE ENHANCEMENT BY BIDIRECTIONAL LSTM NETWORKS FOR CONVERSATIONAL SPEECH RECOGNITION IN HIGHLY NON-STATIONARY NOISE
    Woellmer, Martin
    Zhang, Zixing
    Weninger, Felix
    Schuller, Bjoern
    Rigoll, Gerhard
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6822 - 6826
  • [28] Consistent estimation of signal parameters in non-stationary noise
    Friedmann, J.
    Fishler, E.
    Messer, H.
    [J]. 2000, IEEE, Los Alamitos, CA, United States : 225 - 228
  • [29] Consistent estimation of signal parameters in non-stationary noise
    Friedmann, J
    Fishler, E
    Messer, H
    [J]. PROCEEDINGS OF THE TENTH IEEE WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, 2000, : 225 - 228
  • [30] Enhancement of Non-Stationary Speech using Harmonic Chirp Filters
    Norholm, Sidsel Marie
    Jensen, Jesper Rindom
    Christensen, Mads Graesboll
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1755 - 1759