Sparse Representations in Audio and Music: From Coding to Source Separation

被引:127
|
作者
Plumbley, Mark D. [1 ]
Blumensath, Thomas [2 ]
Daudet, Laurent [3 ,5 ]
Gribonval, Remi [4 ]
Davies, Mike E. [6 ,7 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Univ Southampton, Sch Math, Southampton SO17 1BJ, Hants, England
[3] Univ Paris 06, Inst Jean Le Rond Alembert, LAM, F-75015 Paris, France
[4] INRIA, Ctr Inria Rennes Bretagne Atlantique, F-35042 Rennes, France
[5] Univ Denis Diderot Paris 7, Langevin Inst Waves & Images LOA, Paris, France
[6] Univ Edinburgh, Inst Digital Commun IDCOM, Sch Engn & Elect, Edinburgh EH9 3JL, Midlothian, Scotland
[7] Univ Edinburgh, Joint Res Inst Signal & Image Proc, Sch Engn & Elect, Edinburgh EH9 3JL, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Audio coding; basis functions; discrete cosine transforms; Fourier transforms; music; signal representations; wavelet transforms; BLIND SOURCE SEPARATION; SIGNAL RECOVERY; ALGORITHMS;
D O I
10.1109/JPROC.2009.2030345
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Sparse representations have proved a powerful tool in the analysis and processing of audio signals and already lie at the heart of popular coding standards such as MP3 and Dolby AAC. In this paper we give an overview of a number of current and emerging applications of sparse representations in areas from audio coding, audio enhancement and music transcription to blind source separation solutions that can solve the "cocktail party problem." In each case we will show how the prior assumption that the audio signals are approximately sparse in some time-frequency representation allows us to address the associated signal processing task.
引用
收藏
页码:995 / 1005
页数:11
相关论文
共 50 条
  • [31] Visually Guided Sound Source Separation With Audio-Visual Predictive Coding
    Song, Zengjie
    Zhang, Zhaoxiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (11) : 15528 - 15542
  • [32] Learning modular representations from global sparse coding networks
    Eva L Dyer
    Don H Johnson
    Richard G Baraniuk
    BMC Neuroscience, 11 (Suppl 1)
  • [33] Audio coding and electronic distribution of music
    Brandenburg, K
    SECOND INTERNATIONAL CONFERENCE ON WEB DELIVERING OF MUSIC, PROCEEDINGS, 2002, : 3 - 5
  • [34] SPARSE SOURCE SEPARATION FROM ORTHOGONAL MIXTURES
    Mishali, Moshe
    Eldar, Yonina C.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3145 - 3148
  • [35] Optimal sparse representations for blind source separation and blind deconvolution: A learning approach
    Bronstein, MM
    Bronstein, AM
    Zibulevsky, M
    Zeevi, YY
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 1815 - 1818
  • [36] Learning Discriminative Sparse Representations for Modeling, Source Separation, and Mapping of Hyperspectral Imagery
    Castrodad, Alexey
    Xing, Zhengming
    Greer, John B.
    Bosch, Edward
    Carin, Lawrence
    Sapiro, Guillermo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2011, 49 (11): : 4263 - 4281
  • [37] Soundprism: An Online System for Score-Informed Source Separation of Music Audio
    Duan, Zhiyao
    Pardo, Bryan
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (06) : 1205 - 1215
  • [38] The Influence of Blind Source Separation on Mixed Audio Speech and Music Emotion Recognition
    Laugs, Casper
    Koops, Hendrik Vincent
    Odijk, Daan
    Kaya, Heysem
    Volk, Anja
    COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION), 2020, : 67 - 71
  • [39] Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation
    Nesbit, Andrew
    Vincent, Emmanuel
    Plumbley, Mark D.
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 605 - +
  • [40] β-Divergence Two-Dimensional Sparse Nonnegative Matrix Factorization for Audio Source Separation
    Darsono, A. M.
    Haron, N. Z.
    Jaafar, A. S.
    Ahmad, M. I.
    2013 IEEE CONFERENCE ON WIRELESS SENSOR (ICWISE), 2013, : 119 - 123