AUDIO SOURCE SEPARATION WITH TIME-FREQUENCY VELOCITIES

被引:0
|
作者
Wolf, Guy [1 ]
Mallat, Stephane [1 ]
Shamma, Shihab [2 ]
机构
[1] Ecole Normale Super, Dept Comp Sci, 45 Rue Ulm, F-75005 Paris, France
[2] Ecole Normale Super, Dept Cognit Studies, F-75005 Paris, France
基金
欧洲研究理事会;
关键词
Audio source separation; harmonic templates; velocity; wavelets; NONNEGATIVE MATRIX FACTORIZATION; AMPLITUDE-MODULATION; SPEECH SEPARATION; PITCH;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Separating complex audio sources from a single measurement channel, with no training data, is highly challenging, We introduce a new approach, which relies on the time dynamics of rigid audio models, based on harmonic templates. The velocity vectors of such models are defined and computed in a time-frequency scalogram calculated with a wavelet transform. Similarly to rigid object segmentation in videos, multiple audio sources are discriminated by approximating their velocity vectors with low-dimensional models. The different audio sources are segmented by optimizing a harmonic template selection, which provides piecewise constant velocity approximations. Numerical experiments give examples of blind source separation from single channel audio signals.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Audio source separation with multiple microphones on time-frequency representations
    Sawada, Hiroshi
    [J]. INDEPENDENT COMPONENT ANALYSES, COMPRESSIVE SAMPLING, WAVELETS, NEURAL NET, BIOSYSTEMS, AND NANOENGINEERING XI, 2013, 8750
  • [2] Stereo audio source separation based on time-frequency masking and multilevel thresholding
    Cobos, Maximo
    Lopez, Jose J.
    [J]. DIGITAL SIGNAL PROCESSING, 2008, 18 (06) : 960 - 976
  • [3] BENCHMARKING FLEXIBLE ADAPTIVE TIME-FREQUENCY TRANSFORMS FOR UNDERDETERMINED AUDIO SOURCE SEPARATION
    Nesbit, Andrew
    Vincent, Emmanuel
    Plumbley, Mark D.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 37 - +
  • [4] ROBUST UNDERDETERMINED BLIND AUDIO SOURCE SEPARATION OF SPARSE SIGNALS IN THE TIME-FREQUENCY DOMAIN
    Sbai, Si Mohamed Aziz
    Aissa-El-Bey, Abdeldjalil
    Pastor, Dominique
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3716 - 3719
  • [5] Cycle GAN-Based Audio Source Separation Using Time-Frequency Masking
    Joseph, Sujo
    Rajan, Rajeev
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (02) : 1163 - 1180
  • [6] Constructing Time-Frequency Dictionaries for Source Separation via Time-Frequency Masking and Source Localisation
    de Frein, Ruairi
    Rickard, Scott T.
    Pearlmutter, Barak A.
    [J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 573 - +
  • [7] Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models
    Févotte, C
    Cardoso, JF
    [J]. 2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 78 - 81
  • [8] Underdetermined source separation in the time-frequency domain
    Shan, Zeyong
    Swary, Jacob
    Aviyente, Selin
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 945 - +
  • [9] MULTICHANNEL AUDIO SOURCE SEPARATION: VARIATIONAL INFERENCE OF TIME-FREQUENCY SOURCES FROM TIME-DOMAIN OBSERVATIONS
    Leglaive, Simon
    Badeau, Roland
    Richard, Gael
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 26 - 30
  • [10] Musical source separation using time-frequency source priors
    Vincent, E
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 91 - 98