BENCHMARKING FLEXIBLE ADAPTIVE TIME-FREQUENCY TRANSFORMS FOR UNDERDETERMINED AUDIO SOURCE SEPARATION

被引:3
|
作者
Nesbit, Andrew [1 ]
Vincent, Emmanuel [2 ]
Plumbley, Mark D. [1 ]
机构
[1] Queen Mary Univ London, Elect Engn & Comp Sci, Mile End Rd, London E1 4NS, England
[2] INRIA, IRISA, METISS Grp, F-35042 Rennes, France
基金
英国工程与自然科学研究理事会;
关键词
Time-frequency analysis; Discrete cosine transforms; Source separation; Benchmark; Evaluation;
D O I
10.1109/ICASSP.2009.4959514
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We have implemented several fast and flexible adaptive lapped orthogonal transform (LOT) schemes for underdetermined audio source separation. This is generally addressed by time-frequency masking, requiring die sources to be disjoint in the time-frequency domain. We have already shown that disjointness can be increased via adaptive dyadic LOTs. By taking inspiration from the windowing schemes used in many audio coding frameworks, we improve on earlier results in two ways. Firstly, we consider nondyadic LOTs which match the time-varying signal structures better. Secondly, we allow fora greater range of overlapping window profiles to decrease window boundary artifacts. This new scheme is benchmarked through oracle evaluations, and is shown to decrease computation time by over an order of magnitude compared to using very general schemes, whilst maintaining high separation performance and flexible signal adaptivity. As the results demonstrate, this work may find practical applications in high fidelity audio source separation.
引用
收藏
页码:37 / +
页数:2
相关论文
共 50 条
  • [1] Underdetermined source separation in the time-frequency domain
    Shan, Zeyong
    Swary, Jacob
    Aviyente, Selin
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PTS 1-3, PROCEEDINGS, 2007, : 945 - +
  • [2] ROBUST UNDERDETERMINED BLIND AUDIO SOURCE SEPARATION OF SPARSE SIGNALS IN THE TIME-FREQUENCY DOMAIN
    Sbai, Si Mohamed Aziz
    Aissa-El-Bey, Abdeldjalil
    Pastor, Dominique
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 3716 - 3719
  • [3] Time-Frequency Approach to Underdetermined Blind Source Separation
    Xie, Shengli
    Yang, Liu
    Yang, Jun-Mei
    Zhou, Guoxu
    Xiang, Yong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (02) : 306 - 316
  • [4] Oracle estimation of adaptive cosine packet transforms for underdetermined audio source separation
    Nesbit, Andrew
    Plumbley, Mark D.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 41 - 44
  • [5] AUDIO SOURCE SEPARATION WITH TIME-FREQUENCY VELOCITIES
    Wolf, Guy
    Mallat, Stephane
    Shamma, Shihab
    [J]. 2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [6] Underdetermined source separation of EEG signals in the time-frequency domain
    Shan, Zeyong
    Swary, Jacob
    Aviyente, Selin
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3637 - 3640
  • [7] Underdetermined blind source separation by a novel time-frequency method
    Su, Qiao
    Shen, Yuehong
    Wei, Yimin
    Deng, Changliang
    [J]. AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2017, 77 : 43 - 49
  • [8] Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
    Reju, Vaninirappuputhenpurayil Gopalan
    Koh, Soo Ngee
    Soon, Ing Yann
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (01): : 101 - 116
  • [9] Underdetermined blind separation of audio sources from the time-frequency representation of their convolutive mixtures
    Aissa-El-Bey, Abdeldjalil
    Abed-Meraim, Karim
    Grenier, Yves
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 153 - 156
  • [10] Underdetermined Blind Source Separation by Parallel Factor Analysis in Time-Frequency Domain
    Yang, Liu
    Lv, Jun
    Xiang, Yong
    [J]. COGNITIVE COMPUTATION, 2013, 5 (02) : 207 - 214