Sparse audio representations using the MCLT

被引:19
|
作者
Davies, ME
Daudet, L
机构
[1] Univ London, Queen Mary, Dept Elect Engn, DSP & Multimedia Grp, London E1 4NS, England
[2] Univ Paris 06, Lab Acoust Musicale, F-75015 Paris, France
基金
英国工程与自然科学研究理事会;
关键词
lapped transforms; overcomplete dictionaries; sparse coding;
D O I
10.1016/j.sigpro.2005.05.024
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We consider sparse representations of audio based around the modulated complex lapped transform (MCLT) and a generalized iteratively reweighted least squares algorithm which can be interpreted as a variation of expectation maximization. We compare this mildly overcomplete representation to the more traditional modified discrete cosine transform (MDCT) in terms of coding cost and explore the possibility of extending it to a dual-resolution analysis using a pair of MCLT transforms, illustrating its potential application for audio modification. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:457 / 470
页数:14
相关论文
共 50 条
  • [1] Coding overcomplete representations of audio using the MCLT
    Yoon, Byung-Jun
    Malvar, Henrique S.
    DCC: 2008 DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2008, : 152 - +
  • [2] Musical audio analysis using sparse representations
    Plumbley, Mark D.
    Abdallah, Samer A.
    Blumensath, Thomas
    Jafari, Maria G.
    Nesbit, Andrew
    Vincent, Emmanuel
    Wang, Beiming
    COMPSTAT 2006: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2006, : 105 - +
  • [3] Parallel and distributed audio concealment using nonlocal sparse representations
    Li, Xin
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 775 - 778
  • [4] ADAPTIVE APPROACH FOR SPARSE REPRESENTATIONS USING THE LOCALLY COMPETITIVE ALGORITHM FOR AUDIO
    Bahadi, Soufiyan
    Rouat, Jean
    Plourde, Eric
    2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
  • [5] AUDIO SIGNAL REPRESENTATIONS FOR FACTORIZATION IN THE SPARSE DOMAIN
    Moussallam, Manuel
    Daudet, Laurent
    Richard, Gael
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 513 - 516
  • [6] Audio Fingerprinting Using a Robust Hash Function Based on the MCLT Peak-Pair
    Lee, Jun-Yong
    Kim, Hyoung-Gook
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (02): : 157 - 162
  • [7] Sparse Representations in Audio and Music: From Coding to Source Separation
    Plumbley, Mark D.
    Blumensath, Thomas
    Daudet, Laurent
    Gribonval, Remi
    Davies, Mike E.
    PROCEEDINGS OF THE IEEE, 2010, 98 (06) : 995 - 1005
  • [8] A Real-Time Audio Watermarking Algorithm Based on MCLT
    Zhang Qiu-yu
    Huang Yi-bo
    Deng Jia-bin
    ADVANCES IN CIVIL ENGINEERING, PTS 1-6, 2011, 255-260 : 2042 - 2046
  • [9] Audio-Visual Biometric Recognition Via Joint Sparse Representations
    Primorac, Rudi
    Togneri, Roberto
    Bennamoun, Mohammed
    Sohel, Ferdous
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3031 - 3035
  • [10] Image understanding using sparse representations
    Thiagarajan, J.J., 1600, Morgan and Claypool Publishers (15):