Distribution Agnostic Symbolic Representations for Time Series Dimensionality Reduction and Online Anomaly Detection

被引:5
|
作者
Bountrogiannis, Konstantinos [1 ,2 ]
Tzagkarakis, George [2 ]
Tsakalides, Panagiotis [2 ]
机构
[1] Univ Crete, Comp Sci Dept, Iraklion 70013, Greece
[2] Fdn Res & Technol Hellas, Inst Comp Sci, GR-70013 Iraklion, Greece
关键词
Time series analysis; Data mining; Anomaly detection; Aggregates; Task analysis; Quantization (signal); Market research; dynamic clustering; kernel methods; streaming data; symbolic representations; time series analysis; AGGREGATE APPROXIMATION; MEAN SHIFT;
D O I
10.1109/TKDE.2022.3174630
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the importance of the lower bounding distances and the attractiveness of symbolic representations, the family of symbolic aggregate approximations (SAX) has been used extensively for encoding time series data. However, typical SAX-based methods rely on two restrictive assumptions; the Gaussian distribution and equiprobable symbols. This paper proposes two novel data-driven SAX-based symbolic representations, distinguished by their discretization steps. The first representation, oriented for general data compaction and indexing scenarios, is based on the combination of kernel density estimation and Lloyd-Max quantization to minimize the information loss and mean squared error in the discretization step. The second method, oriented for high-level mining tasks, employs the Mean-Shift clustering method and is shown to enhance anomaly detection in the lower-dimensional space. Besides, we verify on a theoretical basis a previously observed phenomenon of the intrinsic process that results in a lower than the expected variance of the intermediate piecewise aggregate approximation. This phenomenon causes an additional information loss but can be avoided with a simple modification. The proposed representations possess all the attractive properties of the conventional SAX method. Furthermore, experimental evaluation on real-world datasets demonstrates their superiority compared to the traditional SAX and an alternative data-driven SAX variant.
引用
收藏
页码:5752 / 5766
页数:15
相关论文
共 50 条
  • [41] Informative symbolic representations as a way to qualitatively analyze time series
    Zhukova, G. N.
    Smetanin, Yu G.
    Uljanov, M., V
    2019 INTERNATIONAL CONFERENCE ON ENGINEERING TECHNOLOGIES AND COMPUTER SCIENCE (ENT): INNOVATION & APPLICATION, 2019, : 43 - 47
  • [42] Generalized Nets Model of Dimensionality Reduction in Time Series
    Krawczak, Maciej
    Szkatula, Grazyna
    INTELLIGENT SYSTEMS'2014, VOL 2: TOOLS, ARCHITECTURES, SYSTEMS, APPLICATIONS, 2015, 323 : 847 - 858
  • [43] Range Automata for Alphabetic Time Series Dimensionality Reduction
    Badhiye, Sagarkumar S.
    Chatur, P. N.
    2018 FIRST INTERNATIONAL CONFERENCE ON SECURE CYBER COMPUTING AND COMMUNICATIONS (ICSCCC 2018), 2018, : 360 - 363
  • [44] A Novel Time Series Representation Approach for Dimensionality Reduction
    Bawaneh, Mohammad
    Simon, Vilmos
    INFOCOMMUNICATIONS JOURNAL, 2022, 14 (02): : 44 - 55
  • [45] Jump-Starting Multivariate Time Series Anomaly Detection for Online Service Systems
    Ma, Minghua
    Zhang, Shenglin
    Chen, Junjie
    Xu, Jun
    Li, Haozhe
    Lin, Yongliang
    Nie, Xiaohui
    Zhou, Bo
    Wang, Yong
    Pei, Dan
    PROCEEDINGS OF THE 2021 USENIX ANNUAL TECHNICAL CONFERENCE, 2021, : 413 - 426
  • [46] Unsupervised Online Anomaly Detection on Multivariate Sensing Time Series Data for Smart Manufacturing
    Hsieh, Ruei-Jie
    Chou, Jerry
    Ho, Chih-Hsiang
    2019 IEEE 12TH CONFERENCE ON SERVICE-ORIENTED COMPUTING AND APPLICATIONS (SOCA 2019), 2019, : 90 - 97
  • [47] Unsupervised anomaly detection in multivariate time series with online evolving spiking neural networks
    Dennis Bäßler
    Tobias Kortus
    Gabriele Gühring
    Machine Learning, 2022, 111 : 1377 - 1408
  • [48] Online Anomaly Detection for Smartphone-Based Multivariate Behavioral Time Series Data
    Liu, Gang
    Onnela, Jukka-Pekka
    SENSORS, 2022, 22 (06)
  • [49] Unsupervised anomaly detection in multivariate time series with online evolving spiking neural networks
    Baessler, Dennis
    Kortus, Tobias
    Guehring, Gabriele
    MACHINE LEARNING, 2022, 111 (04) : 1377 - 1408
  • [50] Online Fault Diagnosis of Electromagnetic Launch System via Time Series Anomaly Detection
    Zeng, Delin
    Lu, Junyong
    IEEE TRANSACTIONS ON PLASMA SCIENCE, 2024, 52 (08) : 3285 - 3293