Unsupervised learning of sparse and shift-invariant decompositions of polyphonic music

被引:0
|
作者
Blumensath, T [1 ]
Davies, M [1 ]
机构
[1] Univ London, Dept Elect Engn, London E1 4NS, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many time-series in engineering arise from a sparse mixture of individual components. Sparse coding can be used to decompose such signals into a set of functions. Most sparse coding algorithms divide the signal into blocks. The functions learned from these blocks are, however, not independent of the temporal alignment of the blocks. We present a fast algorithm for sparse coding that does not depend on the block location. To reduce the dimensionality of the problem, a subspace selection step is used during signal decomposition. Due to this reduction an Iterative Reweighted Least Squares method can be used for the constrained optimisation. We demonstrate the algorithm's abilities by learning functions from a polyphonic piano recording. The found functions represent individual notes and a sparse signal decomposition leads to a transcription of the piano signal.
引用
收藏
页码:497 / 500
页数:4
相关论文
共 50 条
  • [1] Sparse and shift-invariant representations of music
    Blumensath, T
    Davies, M
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01): : 50 - 57
  • [2] Learning Shift-Invariant Sparse Representation of Actions
    Li, Yi
    Fermuller, Cornelia
    Aloimonos, Yiannis
    Ji, Hui
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2630 - 2637
  • [3] Sparse shift-invariant NMF
    Potluru, Vamsi K.
    Plis, Sergey M.
    Calhoun, Vince D.
    2008 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS & INTERPRETATION, 2008, : 69 - +
  • [4] On shift-invariant sparse coding
    Blumensath, T
    Davies, M
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 1205 - 1212
  • [5] Learning Scale and Shift-Invariant Dictionary for Sparse Representation
    Aritake, Toshimitsu
    Murata, Noboru
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, 2019, 11943 : 472 - 483
  • [6] Learning Features for Activity Recognition with Shift-Invariant Sparse Coding
    Vollmer, Christian
    Gross, Horst-Michael
    Eggert, Julian P.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2013, 2013, 8131 : 367 - 374
  • [7] Affine frame decompositions and shift-invariant spaces
    Chui, CK
    Sun, QY
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2006, 20 (01) : 74 - 107
  • [8] Unsupervised analysis of polyphonic music by sparse coding
    Abdallah, SA
    Plumbley, MD
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (01): : 179 - 196
  • [9] Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model
    Benetos, Emmanouil
    Dixon, Simon
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (03): : 1727 - 1741
  • [10] On learning with shift-invariant structures
    Rusu, Cristian
    DIGITAL SIGNAL PROCESSING, 2020, 99