Sparse audio representations using the MCLT

被引：19

作者：

Davies, ME

Daudet, L

机构：

[1] Univ London, Queen Mary, Dept Elect Engn, DSP & Multimedia Grp, London E1 4NS, England

[2] Univ Paris 06, Lab Acoust Musicale, F-75015 Paris, France

来源：

SIGNAL PROCESSING | 2006年 / 86卷 / 03期

基金：

英国工程与自然科学研究理事会;

关键词：

lapped transforms; overcomplete dictionaries; sparse coding;

D O I：

10.1016/j.sigpro.2005.05.024

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We consider sparse representations of audio based around the modulated complex lapped transform (MCLT) and a generalized iteratively reweighted least squares algorithm which can be interpreted as a variation of expectation maximization. We compare this mildly overcomplete representation to the more traditional modified discrete cosine transform (MDCT) in terms of coding cost and explore the possibility of extending it to a dual-resolution analysis using a pair of MCLT transforms, illustrating its potential application for audio modification. (C) 2005 Elsevier B.V. All rights reserved.

引用

页码：457 / 470

页数：14

共 50 条

[1] Coding overcomplete representations of audio using the MCLT
Yoon, Byung-Jun
Malvar, Henrique S.
DCC: 2008 DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2008, : 152 - +
[2] Musical audio analysis using sparse representations
Plumbley, Mark D.
Abdallah, Samer A.
Blumensath, Thomas
Jafari, Maria G.
Nesbit, Andrew
Vincent, Emmanuel
Wang, Beiming
COMPSTAT 2006: PROCEEDINGS IN COMPUTATIONAL STATISTICS, 2006, : 105 - +
[3] Parallel and distributed audio concealment using nonlocal sparse representations
Li, Xin
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 775 - 778
[4] ADAPTIVE APPROACH FOR SPARSE REPRESENTATIONS USING THE LOCALLY COMPETITIVE ALGORITHM FOR AUDIO
Bahadi, Soufiyan
Rouat, Jean
Plourde, Eric
2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2021,
[5] AUDIO SIGNAL REPRESENTATIONS FOR FACTORIZATION IN THE SPARSE DOMAIN
Moussallam, Manuel
Daudet, Laurent
Richard, Gael
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 513 - 516
[6] Audio Fingerprinting Using a Robust Hash Function Based on the MCLT Peak-Pair
Lee, Jun-Yong
Kim, Hyoung-Gook
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2015, 34 (02): : 157 - 162
[7] Sparse Representations in Audio and Music: From Coding to Source Separation
Plumbley, Mark D.
Blumensath, Thomas
Daudet, Laurent
Gribonval, Remi
Davies, Mike E.
PROCEEDINGS OF THE IEEE, 2010, 98 (06) : 995 - 1005
[8] A Real-Time Audio Watermarking Algorithm Based on MCLT
Zhang Qiu-yu
Huang Yi-bo
Deng Jia-bin
ADVANCES IN CIVIL ENGINEERING, PTS 1-6, 2011, 255-260 : 2042 - 2046
[9] Audio-Visual Biometric Recognition Via Joint Sparse Representations
Primorac, Rudi
Togneri, Roberto
Bennamoun, Mohammed
Sohel, Ferdous
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3031 - 3035
[10] Image understanding using sparse representations
Thiagarajan, J.J., 1600, Morgan and Claypool Publishers (15):

← 1 2 3 4 5 →