BLIND SPEECH SEPARATION EMPLOYING DIRECTIONAL STATISTICS IN AN EXPECTATION MAXIMIZATION FRAMEWORK

被引:0
|
作者
Dang Hai Tran Vu [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Gesamthsch Paderborn, Dept Commun Engn, D-33098 Paderborn, Germany
关键词
Noisy Source Separation; Sparse Signal Separation; EM-Algorithm; Directional Statistics; Speech Enhancement;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we propose to employ directional statistics in a complex vector space to approach the problem of blind speech separation in the presence of spatially correlated noise. We interpret the values of the short time Fourier transform of the microphone signals to be draws from a mixture of complex Watson distributions, a probabilistic model which naturally accounts for spatial aliasing. The parameters of the density are related to the a priori source probabilities, the power of the sources and the transfer function ratios from sources to sensors. Estimation formulas are derived for these parameters by employing the Expectation Maximization (EM) algorithm. The E-step corresponds to the estimation of the source presence probabilities for each time-frequency bin, while the M-step leads to a maximum signal-to-noise ratio (MaxSNR) beamformer in the presence of uncertainty about the source activity. Experimental results are reported for an implementation in a generalized sidelobe canceller (GSC) like spatial beamforming configuration for 3 speech sources with significant coherent noise in reverberant environments, demonstrating the usefulness of the novel modeling framework.
引用
收藏
页码:241 / 244
页数:4
相关论文
共 50 条
  • [31] Effect of Central Limit Theorem non-compliance on blind separation of speech by negentropy maximization
    Prasad, Rajkishore
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    ACOUSTICAL SCIENCE AND TECHNOLOGY, 2005, 26 (06) : 511 - 522
  • [32] Blind Speech Separation in Convolutive Mixtures Using Non-Gaussianity Maximization and Inverse Filters
    Nam Vuong-Hoang
    Trung Nguyen-Quoc
    Linh Tran-Hoai
    2010 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2010, : 190 - 194
  • [33] A recursive expectation-maximization algorithm for speaker tracking and separation
    Ofer Schwartz
    Sharon Gannot
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [34] A recursive expectation-maximization algorithm for speaker tracking and separation
    Schwartz, Ofer
    Gannot, Sharon
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [35] A Comparative Study of Blind Speech Separation Using Subspace Methods and Higher Order Statistics
    Benabderrahmane, Yasmina
    Selouani, Sid Ahmed
    O'Shaughnessy, Douglas
    Hamam, Habib
    SIGNAL PROCESSING, IMAGE PROCESSING, AND PATTERN RECOGNITION, 2009, 61 : 117 - +
  • [36] Blind image separation through kurtosis maximization
    Chen, N
    De Leon, P
    CONFERENCE RECORD OF THE THIRTY-FIFTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1 AND 2, 2001, : 318 - 322
  • [37] BLIND SIGNAL SEPARATION BY ENTROPY MAXIMIZATION (INFOMAX)
    Jin, Qinggui
    Wang, Guirong
    Liu, Yuancheng
    2010 6TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS NETWORKING AND MOBILE COMPUTING (WICOM), 2010,
  • [38] An Efficient Hybrid Threshold for Image Deconvolution in Expectation Maximization Framework
    Ravi Pratap Singh
    Manoj Kumar Singh
    Circuits, Systems, and Signal Processing, 2025, 44 (3) : 1938 - 1982
  • [39] Computationally Efficient and Versatile Framework for Joint Optimization of Blind Speech Separation and Dereverberation
    Nakatani, Tomohiro
    Ikeshita, Rintaro
    Kinoshita, Keisuke
    Sawada, Hiroshi
    Araki, Shoko
    INTERSPEECH 2020, 2020, : 91 - 95
  • [40] Self-supervised Blind Motion Deblurring with Deep Expectation Maximization
    Li, Ji
    Wang, Weixi
    Nan, Yuesong
    Ji, Hui
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13986 - 13996