ALPHA-STABLE MULTICHANNEL AUDIO SOURCE SEPARATION

被引:0
|
作者
Leglaive, Simon [1 ]
Simsekli, Umut [1 ]
Liutkus, Antoine [2 ]
Badeau, Roland [1 ]
Richard, Gael [1 ]
机构
[1] Univ Paris Saclay, LTCI, Telecom ParisTech, F-75013 Paris, France
[2] INRIA, Speech Proc Team, Villers Les Nancy, France
关键词
Alpha-stable distributions; Multichannel source separation; Informed source separation; Monte Carlo Expectation-Maximization; NONNEGATIVE MATRIX FACTORIZATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we focus on modeling multichannel audio signals in the short-time Fourier transform domain for the purpose of source separation. We propose a probabilistic model based on a class of heavy-tailed distributions, in which the observed mixtures and the latent sources are jointly modeled by using a certain class of multivariate alpha-stable distributions. As opposed to the conventional Gaussian models, where the observations are constrained to lie just within a few standard deviations from the mean, the proposed heavy-tailed model allows us to account for spurious data or important uncertainties in the model. We develop a Monte Carlo Expectation-Maximization algorithm for inferring the sources from the proposed model. We show that our approach leads to significant performance improvements in audio source separation under corrupted mixtures and in spatial audio object coding.
引用
收藏
页码:576 / 580
页数:5
相关论文
共 50 条
  • [31] ALPHA-STABLE EXTENSIVE GAME FORMS
    ICHIISHI, T
    [J]. MATHEMATICS OF OPERATIONS RESEARCH, 1987, 12 (04) : 626 - 633
  • [32] Symmetric alpha-stable filter theory
    Bodenschatz, JS
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1945 - 1948
  • [33] Symmetric alpha-stable filter theory
    Bodenschatz, JS
    Nikias, CL
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (09) : 2301 - 2306
  • [34] The Symmetric alpha-Stable Privacy Mechanism
    Zawacki, Christopher C.
    Abed, Eyad H.
    [J]. 2024 58TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS, CISS, 2024,
  • [35] Gaussian Modeling-Based Multichannel Audio Source Separation Exploiting Generic Source Spectral Model
    Thanh Thi Hien Duong
    Duong, Ngoc Q. K.
    Phuong Cong Nguyen
    Cuong Quoc Nguyen
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (01) : 32 - 43
  • [36] Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models
    Hasumi, Takuya
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    Kitamura, Daichi
    Takahashi, Yu
    Kondo, Kazunobu
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1226 - 1233
  • [37] Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization
    Fontaine, Mathieu
    Di Carlo, Diego
    Sekiguchi, Kouhei
    Nugraha, Aditya Arie
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 26 - 30
  • [38] Audio source separation
    Davies, M
    [J]. MATHEMATICS IN SIGNAL PROCESSING V, 2002, (71): : 57 - 68
  • [39] JOINT AUDIO SOURCE SEPARATION AND DEREVERBERATION BASED ON MULTICHANNEL FACTORIAL HIDDEN MARKOV MODEL
    Higuchi, Takuya
    Kameoka, Hirokazu
    [J]. 2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [40] UNIFIED APPROACH FOR AUDIO SOURCE SEPARATION WITH MULTICHANNEL FACTORIAL HMM AND DOA MIXTURE MODEL
    Higuchi, Takuya
    Kameoka, Hirokazu
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2043 - 2047