Matrix Profile XIII: Time Series Snippets: A New Primitive for Time Series Data Mining

被引:28
|
作者
Imani, Shima [1 ]
Madrid, Frank [1 ]
Ding, Wei [2 ]
Crouter, Scott [3 ]
Keogh, Eamonn [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[2] Univ Massachusetts Boston, Dept Comp Sci, Boston, MA USA
[3] Univ Tennessee, Coll Educ Hlth & Human Sci, Knoxville, TN USA
关键词
time series; motifs; sampling; diversification;
D O I
10.1109/ICBK.2018.00058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Perhaps the most basic query made by a data analyst confronting a new data source is "Show me some representative/typical data." Answering this question is trivial in many domains, but surprisingly, it is very difficult in large time series datasets. The major difficulty is not time or space complexity, but defining what it means to be representative data in this domain. In this work, we show that the obvious candidate definitions: motifs, shapelets, cluster centers, random samples etc., are all poor choices. Thus motivated, we introduce time series snippets, a novel representation of typical time series subsequences. Beyond their utility for visualizing and summarizing massive time series collections, we show that time series snippets have utility for high-level comparison of large time series collections.
引用
收藏
页码:382 / 389
页数:8
相关论文
共 50 条
  • [31] An Efficient Time Series Data Mining Technique
    Aboalsamh, Hatim A.
    Hafez, Alaaeldin M.
    Assassa, Ghazy M. R.
    PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 950 - +
  • [32] Time Series Data Mining: A Unifying View
    Keogh, Eamonn
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (12): : 3861 - 3863
  • [33] Outliers Mining in Time Series Data Sets
    Zheng Binxiang
    JournalofSystemsEngineeringandElectronics, 2002, (01) : 93 - 97
  • [34] Similarity problems in time series data mining
    Yan, XB
    Li, YJ
    Fan, B
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS I AND II, 2003, : 382 - 385
  • [35] Preserving Privacy in Time Series Data Mining
    Zhu, Ye
    Fu, Yongjian
    Fu, Huirong
    INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2011, 7 (04) : 64 - 85
  • [36] Time Series Data Mining: A Unifying View
    Keogh, Eamonn
    2024 IEEE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, DSAA 2024, 2024, : 424 - 426
  • [37] A data mining framework for time series estimation
    Hu, Xiao
    Xu, Peng
    Wu, Shaozhi
    Asgari, Shadnaz
    Bergsneider, Marvin
    JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (02) : 190 - 199
  • [38] Data mining on time series of sequential patterns
    Visa, A
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 166 - 171
  • [39] Visual mining of spatial time series data
    Andrienko, G
    Andrienko, N
    Gatalsky, P
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 524 - 527
  • [40] Research on framework of time series data mining
    Yan, XB
    Li, YJ
    Jin, SW
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS 1 AND 2, 2004, : 197 - 200