BENCHMARKING FLEXIBLE ADAPTIVE TIME-FREQUENCY TRANSFORMS FOR UNDERDETERMINED AUDIO SOURCE SEPARATION

被引:3
|
作者
Nesbit, Andrew [1 ]
Vincent, Emmanuel [2 ]
Plumbley, Mark D. [1 ]
机构
[1] Queen Mary Univ London, Elect Engn & Comp Sci, Mile End Rd, London E1 4NS, England
[2] INRIA, IRISA, METISS Grp, F-35042 Rennes, France
基金
英国工程与自然科学研究理事会;
关键词
Time-frequency analysis; Discrete cosine transforms; Source separation; Benchmark; Evaluation;
D O I
10.1109/ICASSP.2009.4959514
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We have implemented several fast and flexible adaptive lapped orthogonal transform (LOT) schemes for underdetermined audio source separation. This is generally addressed by time-frequency masking, requiring die sources to be disjoint in the time-frequency domain. We have already shown that disjointness can be increased via adaptive dyadic LOTs. By taking inspiration from the windowing schemes used in many audio coding frameworks, we improve on earlier results in two ways. Firstly, we consider nondyadic LOTs which match the time-varying signal structures better. Secondly, we allow fora greater range of overlapping window profiles to decrease window boundary artifacts. This new scheme is benchmarked through oracle evaluations, and is shown to decrease computation time by over an order of magnitude compared to using very general schemes, whilst maintaining high separation performance and flexible signal adaptivity. As the results demonstrate, this work may find practical applications in high fidelity audio source separation.
引用
收藏
页码:37 / +
页数:2
相关论文
共 50 条
  • [31] A NEW TIME-FREQUENCY APPROACH FOR UNDERDETERMINED CONVOLUTIVE BLIND SPEECH SEPARATION
    Bouafif, Mariem
    Lachiri, Zied
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 3226 - 3230
  • [32] On the integration of time-frequency masking speech separation and recognition in underdetermined environments
    Jafari, Ingrid
    Haque, Serajul
    Togneri, Roberto
    Nordholm, Sven
    [J]. 2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1613 - 1617
  • [33] Cycle GAN-Based Audio Source Separation Using Time-Frequency Masking
    Joseph, Sujo
    Rajan, Rajeev
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (02) : 1163 - 1180
  • [34] Blind separation of underdetermined convolutive mixtures using their time-frequency representation
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (05): : 1540 - 1550
  • [35] Constructing Time-Frequency Dictionaries for Source Separation via Time-Frequency Masking and Source Localisation
    de Frein, Ruairi
    Rickard, Scott T.
    Pearlmutter, Barak A.
    [J]. INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 573 - +
  • [36] Maximum likelihood approach for blind audio source separation using time-frequency Gaussian source models
    Févotte, C
    Cardoso, JF
    [J]. 2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 78 - 81
  • [37] UNDERDETERMINED AUDIO SOURCE SEPARATION FROM ANECHOIC MIXTURES WITH LONG TIME DELAY
    Cho, Namgook
    Kuo, C. -C Jay
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 1557 - +
  • [38] Blind source separation using time-frequency information matrix given by several wavelet transforms
    Ashino, Ryuichi
    Fujita, Keiko
    Mandai, Takeshi
    Morimoto, Akira
    Nishihara, Kiyoaki
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2007, 10 (05): : 555 - 568
  • [39] An Underdetermined Convolutional Blind Separation Algorithm for Time-Frequency Overlapped Wireless Communication Signals with Unknown Source Number
    Ma, Hao
    Zheng, Xiang
    Yu, Lu
    Wu, Xinrong
    Zhang, Yu
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [40] A Directional Laplacian Density for Underdetermined Audio Source Separation
    Mitianoudis, Nikolaos
    [J]. ARTIFICIAL NEURAL NETWORKS-ICANN 2010, PT I, 2010, 6352 : 450 - 459