Parameter-Free Audio Motif Discovery in Large Data Archives

被引:8
|
作者
Hao, Yuan [1 ]
Shokoohi-Yekta, Mohammad [1 ]
Papageorgiou, George [1 ]
Keogh, Eamonn [1 ]
机构
[1] Univ Calif Riverside, Riverside, CA 92521 USA
关键词
audio motif; spectrogram; anytime algorithm;
D O I
10.1109/ICDM.2013.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The discovery of repeated structure, i.e. motifs/near-duplicates, is often the first step in exploratory data mining. As such, the last decade has seen extensive research efforts in motif discovery algorithms for text, DNA, time series, protein sequences, graphs, images, and video. Surprisingly, there has been less attention devoted to finding repeated patterns in audio sequences, in spite of their ubiquity in science and entertainment. While there is significant work for the special case of motifs in music, virtually all this work makes many assumptions about data (often to the point of being genre specific) and thus these algorithms do not generalize to audio sequences containing animal vocalizations, industrial processes, or a host of other domains that we may wish to explore. In this work we introduce a novel technique for finding audio motifs. Our method does not require any domain-specific tuning and is essentially parameter-free. We demonstrate our algorithm on very diverse domains, finding audio motifs in laboratory mice vocalizations, wild animal sounds, music, and human speech. Our experiments demonstrate that our ideas are effective in discovering objectively correct or subjectively plausible motifs. Moreover, we show our novel probabilistic early abandoning approach is efficient, being two to three orders of magnitude faster than brute-force search, and thus faster than real-time for most problems.
引用
收藏
页码:261 / 270
页数:10
相关论文
共 50 条
  • [1] MERLIN: Parameter-Free Discovery of Arbitrary Length Anomalies in Massive Time Series Archives
    Nakamura, Takaaki
    Imamura, Makoto
    Mercer, Ryan
    Keogh, Eamonn
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2020), 2020, : 1190 - 1195
  • [2] Hierarchical, Parameter-Free Community Discovery
    Papadimitriou, Spiros
    Sun, Jimeng
    Faloutsos, Christos
    Yu, Philip S.
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 170 - +
  • [3] A Hybrid and Parameter-Free Clustering Algorithm for Large Data Sets
    Shao, Hengkang
    Zhang, Ping
    Chen, Xinye
    Li, Fang
    Du, Guanglong
    [J]. IEEE ACCESS, 2019, 7 : 24806 - 24818
  • [4] Parameter-free rendering of single-molecule localization microscopy data for parameter-free resolution estimation
    Descloux, Adrien C.
    Grussmayer, Kristin S.
    Radenovic, Aleksandra
    [J]. COMMUNICATIONS BIOLOGY, 2021, 4 (01)
  • [5] Parameter-free rendering of single-molecule localization microscopy data for parameter-free resolution estimation
    Adrien C. Descloux
    Kristin S. Grußmayer
    Aleksandra Radenovic
    [J]. Communications Biology, 4
  • [6] MERLIN++: parameter-free discovery of time series anomalies
    Takaaki Nakamura
    Ryan Mercer
    Makoto Imamura
    Eamonn Keogh
    [J]. Data Mining and Knowledge Discovery, 2023, 37 : 670 - 709
  • [7] A Parameter-Free Classification Method for Large Scale Learning
    Boulle, Marc
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2009, 10 : 1367 - 1385
  • [8] Parameter-free uniformisation
    Friedman, Sy-David
    [J]. PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY, 2008, 136 (09) : 3327 - 3330
  • [9] Parameter-free spatial data mining using MDL
    Papadimitriou, S
    Gionis, A
    Tsaparas, P
    Väisänen, RA
    Mannila, H
    Faloutsos, C
    [J]. FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 346 - 353
  • [10] Parameter-free Topology Inference and Sparsification for Data on Manifolds
    Dey, Tamal K.
    Dong, Zhe
    Wang, Yusu
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, 2017, : 2733 - 2747