ALPHA-STABLE MULTICHANNEL AUDIO SOURCE SEPARATION

被引:0
|
作者
Leglaive, Simon [1 ]
Simsekli, Umut [1 ]
Liutkus, Antoine [2 ]
Badeau, Roland [1 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, LTCI, Telecom ParisTech, F-75013 Paris, France
[2] INRIA, Speech Proc Team, Villers Les Nancy, France
关键词
Alpha-stable distributions; Multichannel source separation; Informed source separation; Monte Carlo Expectation-Maximization; NONNEGATIVE MATRIX FACTORIZATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we focus on modeling multichannel audio signals in the short-time Fourier transform domain for the purpose of source separation. We propose a probabilistic model based on a class of heavy-tailed distributions, in which the observed mixtures and the latent sources are jointly modeled by using a certain class of multivariate alpha-stable distributions. As opposed to the conventional Gaussian models, where the observations are constrained to lie just within a few standard deviations from the mean, the proposed heavy-tailed model allows us to account for spurious data or important uncertainties in the model. We develop a Monte Carlo Expectation-Maximization algorithm for inferring the sources from the proposed model. We show that our approach leads to significant performance improvements in audio source separation under corrupted mixtures and in spatial audio object coding.
引用
收藏
页码:576 / 580
页数:5
相关论文
共 50 条
  • [1] Scalable Source Localization with Multichannel Alpha-Stable Distributions
    Fontaine, Mathieu
    Vanwynsberghe, Charles
    Liutkus, Antoine
    Badeau, Roland
    [J]. 2017 25TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2017, : 11 - 15
  • [2] BLIND SOURCE SEPARATION USING MIXTURES OF ALPHA-STABLE DISTRIBUTIONS
    Keriven, Nicolas
    Deleforge, Antoine
    Liutkus, Antoine
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 771 - 775
  • [3] Separation of alpha-stable random vectors
    Fontaine, Mathieu
    Badeau, Roland
    Liutkus, Antoine
    [J]. SIGNAL PROCESSING, 2020, 170
  • [4] Multichannel Audio Source Separation With Probabilistic Reverberation Priors
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) : 2453 - 2465
  • [5] MULTICHANNEL AUDIO SOURCE SEPARATION WITH PROBABILISTIC REVERBERATION MODELING
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. 2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [6] Multichannel Audio Source Separation With Deep Neural Networks
    Nugraha, Aditya Arie
    Liutkus, Antoine
    Vincent, Emmanuel
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (09) : 1652 - 1664
  • [7] Enhanced Temporal Feature Integration in Audio Semantics via Alpha-Stable Modeling
    Vrysis, Lazaros
    Hadjileontiadis, Leontios
    Thoidis, Iordanis
    Dimoulas, Charalampos
    Papanikolaou, George
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2021, 69 (04): : 227 - 237
  • [8] SEPNET: A DEEP SEPARATION MATRIX PREDICTION NETWORK FOR MULTICHANNEL AUDIO SOURCE SEPARATION
    Inoue, Shota
    Kameoka, Hirokazu
    Li, Li
    Makino, Shoji
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 191 - 195
  • [9] Student's t Source and Mixing Models for Multichannel Audio Source Separation
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (06) : 1150 - 1164
  • [10] Bayesian Multichannel Audio Source Separation Based on Integrated Source and Spatial Models
    Itakura, Kousuke
    Bando, Yoshiaki
    Nakamura, Eita
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    Kawahara, Tatsuya
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 831 - 846