Speech enhancement based on short-time spectral amplitude estimation with two-channel beamformer

被引:0
|
作者
Tohoku Univ, Sendai-shi, Japan [1 ]
机构
来源
IEICE Trans Fund Electron Commun Comput Sci | / 12卷 / 2151-2158期
关键词
Acoustic noise - Estimation - Microphones - Spectrum analysis - Transfer functions;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a new spectral subtraction technique with two microphone inputs is proposed. In conventional spectral subtraction using a single microphone, the averaged noise spectrum is subtracted from the observed short-time input spectrum. This results in reduction of mean value of noise spectrum only, the component varying around the mean value remaining intact. In the method proposed in this paper, the short-time noise spectrum excluding the speech component is estimated by introducing the blocking matrix used in the Griffiths-Jim-type adaptive beamformer with two microphone inputs, combined with the spectral compensation technique. By subtracting the estimated short-time noise spectrum from the input spectrum, not only the mean value of the noise spectrum but also the component varying around the mean value can be reduced. This method can be interpreted as a 'partial' construction of the adaptive beamformer where only the amplitude of the short-time noise spectrum is estimated, while the adaptive beamformer is equivalent to the estimator of the complex short-time noise spectrum. By limiting the estimation to the amplitude spectrum, the proposed system achieves better performance than the adaptive beamformer in the case when the number of sound sources exceeds the number of microphones.
引用
收藏
相关论文
共 50 条
  • [31] Speech Source Separation and Noise Reduction using a MMSE Short-Time Spectral Amplitude Estimator
    Imsiya, K. A.
    Nandana, B. T.
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [32] A Speech Enhancement Method by Coupling Speech Detection and Spectral Amplitude Estimation
    Deng, Feng
    Bao, Chang-Chun
    Bao, Feng
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3233 - 3237
  • [33] BAYESIAN SPECTRAL AMPLITUDE ESTIMATION FOR SPEECH ENHANCEMENT WITH CORRELATED SPECTRAL COMPONENTS
    Plourde, Eric
    Champagne, Benoit
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 397 - 400
  • [34] THEORETICAL ANALYSIS OF BIASED MMSE SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR AND ITS EXTENSION TO MUSICAL-NOISE-FREE SPEECH ENHANCEMENT
    Nakai, Shunsuke
    Saruwatari, Hiroshi
    Miyazaki, Ryoichi
    Nakamura, Satoshi
    Kondo, Kazunobu
    2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 122 - 126
  • [35] Speech enhancement based on a microphone array and log-spectral amplitude estimation
    Cohen, I
    Berdugo, B
    22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 4 - 6
  • [36] Speech enhancement using MMSE short time spectral estimation with gamma distributed speech priors
    Martin, R
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 253 - 256
  • [37] Generalized maximum a posteriori spectral amplitude estimation for speech enhancement
    Tsao, Yu
    Lai, Ying-Hui
    SPEECH COMMUNICATION, 2016, 76 : 112 - 126
  • [38] Multichannel speech enhancement using Bayesian spectral amplitude estimation
    Lotter, T
    Benien, C
    Vary, P
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 880 - 883
  • [39] Adaptive short-time analysis-synthesis for speech enhancement
    Rudoy, Daniel
    Basu, Prabahan
    Quatieri, Thomas E.
    Dunn, Bob
    Wolfe, Patrick J.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4905 - +
  • [40] A Two-Channel Noise Estimator for Speech Enhancement in a Highly Nonstationary Environment
    Choi, Min-Seok
    Kang, Hong-Goo
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 905 - 915