Improving discretization based pattern discovery for multivariate time series by additional preprocessing

被引:1
|
作者
Noering, Fabian Kai-Dietrich [1 ]
Jonas, Konstantin [2 ]
Klawonn, Frank [3 ,4 ]
机构
[1] Volkswagen AG, Wolfsburg, Germany
[2] Deutsch Bahn AG, Volkswagen AG, Berlin, Germany
[3] Ostfalia Univ Appl Sci, Dept Comp Sci, Wolfenbuttel, Germany
[4] Helmholtz Ctr Infect Res, Braunschweig, Germany
关键词
Time series data mining; pattern discovery; motif discovery; variable pattern length; unsupervised; multivariate; LINEAR-TIME; MOTIFS;
D O I
10.3233/IDA-205329
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In technical systems the analysis of similar load situations is a promising technique to gain information about the system's state, its health or wearing. Very often, load situations are challenging to be defined by hand. Hence, these situations need to be discovered as recurrent patterns within multivariate time series data of the system under consideration. Unsupervised algorithms for finding such recurrent patterns in multivariate time series must be able to cope with very large data sets because the system might be observed over a very long time. In our previous work we identified discretization-based approaches to be very interesting for variable length pattern discovery because of their low computing time due to the simplification (symbolization) of the time series. In this paper we propose additional preprocessing steps for symbolic representation of time series aiming for enhanced multivariate pattern discovery. Beyond that we show the performance (quality and computing time) of our algorithms in a synthetic test data set as well as in a real life example with 100 millions of time points. We also test our approach with increasing dimensionality of the time series.
引用
收藏
页码:1051 / 1072
页数:22
相关论文
共 50 条
  • [1] Algorithm for time series based on hidden pattern discovery
    Xiang, Kui
    Jiang, Jing-Ping
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2007, 29 (01): : 59 - 62
  • [2] Sensor Time Series Association Rule Discovery Based on Modified Discretization Method
    Xue, Ruidong
    Zhang, Tingting
    Chen, Dehua
    Le, Jiajin
    Lavassani, Mehrzad
    2016 FIRST IEEE INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND THE INTERNET (ICCCI 2016), 2016, : 196 - 202
  • [3] Empirical Sensitivity Analysis of Discretization Parameters for Fault Pattern Extraction From Multivariate Time Series Data
    Baek, Sujeong
    Young, Duck
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (05) : 1198 - 1209
  • [4] Visual-Interactive Preprocessing of Multivariate Time Series Data
    Bernard, Juergen
    Hutter, Marco
    Reinemuth, Heiko
    Pfeifer, Hendrik
    Bors, Christian
    Kohlhammer, Joern
    COMPUTER GRAPHICS FORUM, 2019, 38 (03) : 401 - 412
  • [5] Classification-driven temporal discretization of multivariate time series
    Moskovitch, Robert
    Shahar, Yuval
    DATA MINING AND KNOWLEDGE DISCOVERY, 2015, 29 (04) : 871 - 913
  • [6] Classification-driven temporal discretization of multivariate time series
    Robert Moskovitch
    Yuval Shahar
    Data Mining and Knowledge Discovery, 2015, 29 : 871 - 913
  • [7] Pattern Discovery in the Financial Time Series Based on Local Trend
    Mai Van Hoan
    Dao The Huy
    Luong Chi Mai
    ADVANCES IN INFORMATION AND COMMUNICATION TECHNOLOGY, 2017, 538 : 442 - 451
  • [8] Discovery of Temporal Associations in Multivariate Time Series
    Zhuang, Dennis E. H.
    Li, Gary C. L.
    Wong, Andrew K. C.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) : 2969 - 2982
  • [9] Improving time series similarity measures by integrating preprocessing steps
    Frank Höppner
    Data Mining and Knowledge Discovery, 2017, 31 : 851 - 878
  • [10] Improving time series similarity measures by integrating preprocessing steps
    Hoeppner, Frank
    DATA MINING AND KNOWLEDGE DISCOVERY, 2017, 31 (03) : 851 - 878