Distribution Agnostic Symbolic Representations for Time Series Dimensionality Reduction and Online Anomaly Detection

被引:5
|
作者
Bountrogiannis, Konstantinos [1 ,2 ]
Tzagkarakis, George [2 ]
Tsakalides, Panagiotis [2 ]
机构
[1] Univ Crete, Comp Sci Dept, Iraklion 70013, Greece
[2] Fdn Res & Technol Hellas, Inst Comp Sci, GR-70013 Iraklion, Greece
关键词
Time series analysis; Data mining; Anomaly detection; Aggregates; Task analysis; Quantization (signal); Market research; dynamic clustering; kernel methods; streaming data; symbolic representations; time series analysis; AGGREGATE APPROXIMATION; MEAN SHIFT;
D O I
10.1109/TKDE.2022.3174630
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the importance of the lower bounding distances and the attractiveness of symbolic representations, the family of symbolic aggregate approximations (SAX) has been used extensively for encoding time series data. However, typical SAX-based methods rely on two restrictive assumptions; the Gaussian distribution and equiprobable symbols. This paper proposes two novel data-driven SAX-based symbolic representations, distinguished by their discretization steps. The first representation, oriented for general data compaction and indexing scenarios, is based on the combination of kernel density estimation and Lloyd-Max quantization to minimize the information loss and mean squared error in the discretization step. The second method, oriented for high-level mining tasks, employs the Mean-Shift clustering method and is shown to enhance anomaly detection in the lower-dimensional space. Besides, we verify on a theoretical basis a previously observed phenomenon of the intrinsic process that results in a lower than the expected variance of the intermediate piecewise aggregate approximation. This phenomenon causes an additional information loss but can be avoided with a simple modification. The proposed representations possess all the attractive properties of the conventional SAX method. Furthermore, experimental evaluation on real-world datasets demonstrates their superiority compared to the traditional SAX and an alternative data-driven SAX variant.
引用
收藏
页码:5752 / 5766
页数:15
相关论文
共 50 条
  • [1] Anomaly Detection for Symbolic Time Series Representations of Reduced Dimensionality
    Bountrogiannis, Konstantinos
    Tzagkarakis, George
    Tsakalides, Panagiotis
    28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 2398 - 2402
  • [2] Exploring the Influence of Dimensionality Reduction on Anomaly Detection Performance in Multivariate Time Series
    Altin, Mahsun
    Cakir, Altan
    IEEE ACCESS, 2024, 12 : 85783 - 85794
  • [3] Dimensionality reduction and clustering of time series for anomaly detection in a supermarket heating system
    Salmina, Lorenzo
    Castello, Roberto
    Stoll, Justine
    Scartezzini, Jean-Louis
    CARBON-NEUTRAL CITIES - ENERGY EFFICIENCY AND RENEWABLES IN THE DIGITAL ERA (CISBAT 2021), 2021, 2042
  • [4] Learning the feature distribution similarities for online time series anomaly detection
    Fan, Jin
    Ge, Yan
    Zhang, Xinyi
    Wang, Zheyu
    Wu, Huifeng
    Wu, Jia
    NEURAL NETWORKS, 2024, 180
  • [5] Online Anomaly Detection of Time Series at Scale
    Mason, Andrew
    Zhao, Yifan
    He, Hongmei
    Gompelman, Raymon
    Mandava, Srikanth
    2019 INTERNATIONAL CONFERENCE ON CYBER SITUATIONAL AWARENESS, DATA ANALYTICS AND ASSESSMENT (CYBER SA), 2019,
  • [6] Dimensionality Reduction Techniques for Streaming Time Series: A New Symbolic Approach
    Balzanella, Antonio
    Irpino, Antonio
    Verde, Rosanna
    CLASSIFICATION AS A TOOL FOR RESEARCH, 2010, : 381 - 389
  • [7] Symbolic time series analysis for anomaly detection: A comparative evaluation
    Chin, SC
    Ray, A
    Rajagopalan, V
    SIGNAL PROCESSING, 2005, 85 (09) : 1859 - 1868
  • [8] Symbolic time-series analysis for anomaly detection in mechanical
    Khatkhate, Amol
    Ray, Asok
    Keller, Eric
    Gupta, Shalabh
    Chin, Shin C.
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2006, 11 (04) : 439 - 447
  • [9] Online anomaly detection using dimensionality reduction techniques for HTTP log analysis
    Juvonen, Antti
    Sipola, Tuomo
    Hamalainen, Timo
    COMPUTER NETWORKS, 2015, 91 : 46 - 56
  • [10] Anomaly Detection in Rails Using Dimensionality Reduction
    Mitsui, Shingo
    Sasaki, Toshihiko
    Shinya, Masayoshi
    Arai, Yasuo
    Nishimura, Ryutaro
    ISIJ INTERNATIONAL, 2023, 63 (01) : 170 - 178