An evolving approach to the similarity-based modeling for online clustering in non-stationary environments

被引:0
|
作者
Almeida, Nayron Morais [1 ]
Camargos, Murilo Osorio [2 ]
Mariano, Denis G. B. [3 ]
Bomfim, Carlos H. M. [3 ]
Palhares, Reinaldo M. [3 ]
Caminhas, Walmir M. [3 ]
机构
[1] Univ Fed Minas Gerais, Grad Program Elect Engn, Ave Antonio Carlos 6627, BR-31270901 Belo Horizonte, MG, Brazil
[2] Univ Estadual Montes Claros, Grad Program Comp Modeling & Syst, Ave Rui Braga S-N, BR-39401089 Montes Claros, MG, Brazil
[3] Univ Fed Minas Gerais, Dept Elect Engn, Ave Antonio Carlos 6627, BR-31270901 Belo Horizonte, MG, Brazil
关键词
Clustering; Online; Evolving; Similarity-based modeling; DATA STREAM;
D O I
10.1007/s12530-024-09646-w
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel evolving approach based on the Similarity-Based Modeling (SBM), a technique widely used in industrial applications of anomaly detection and multiclass classification. The proposed approach, which inherits from SBM, uses a simple model-matrix composed of historical points to represent each cluster. Its inference procedure for a given input instance consists only of generating an estimate, considering each cluster, and then assigning the input to the most similar cluster according to a novel membership function that considers approximation error and data density. The main features of our approach include a simple and intuitive learning scheme, the ability to model clusters of any shape without using micro-cluster-like procedures, robustness to noisy data, and low computational effort. We evaluate the effectiveness of the proposed approach on fifteen datasets widely used in the literature, assessing its ability to deal with overlapping clusters, clusters with arbitrary shape, noisy data, and high dimensionality. Using Adjusted Rand Index (ARI) and Purity metrics, the proposed algorithm was compared with eight recent state-of-the-art algorithms, and the proposed method achieved the highest performance on most of the datasets. On the remaining datasets, it showed similar performance to other methods. Averaging over the fifteen datasets, our approach achieved an ARI value of 0.8872 and a Purity value of 0.9107. The most competitive method, considering ARI, achieved an average value of 0.6988, and considering Purity, achieved an average value of 0.9257. This shows the effectiveness of the proposed approach.
引用
收藏
页数:30
相关论文
共 50 条
  • [1] A similarity-based approach to time-varying coefficient non-stationary autoregression
    Lieberman, Offer
    JOURNAL OF TIME SERIES ANALYSIS, 2012, 33 (03) : 484 - 502
  • [2] A heterogeneous online learning ensemble for non-stationary environments
    Idrees, Mobin M.
    Minku, Leandro L.
    Stahl, Frederic
    Badii, Atta
    KNOWLEDGE-BASED SYSTEMS, 2020, 188
  • [3] An Online Algorithm for Computation Offloading in Non-Stationary Environments
    Rahman, Aniq Ur
    Ghatak, Gourab
    De Domenico, Antonio
    IEEE COMMUNICATIONS LETTERS, 2020, 24 (10) : 2167 - 2171
  • [4] A Non-Stationary Online Learning Approach to Mobility Management
    Zhou, Yiming
    Shen, Cong
    van der Schaar, Mihaela
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2019, 18 (02) : 1434 - 1446
  • [5] A Non-Stationary Online Learning Approach to Mobility Management
    Zhou, Yiming
    Shen, Cong
    Luo, Xiliang
    van der Schaar, Mihaela
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2018,
  • [6] Non-stationary phase correction based on local similarity
    Liu, Yujin
    Li, Zhenchun
    Guo, Kai
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2012, 47 (06): : 887 - 893
  • [7] Online Detection and Fuzzy Clustering of Anomalies in Non-Stationary Time Series
    He, Changjiang
    Leslie, David S.
    Grant, James A.
    SIGNALS, 2024, 5 (01): : 40 - 59
  • [8] Online Clustering of Non-stationary Data Using Incremental and Decremental SVM
    Boukharouba, Khaled
    Lecoeuche, Stephane
    ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT I, 2008, 5163 : 336 - 345
  • [9] Similarity-based Attention Embedding Approach for Attributed Graph Clustering
    Weng, Wei
    Li, Tong
    Liao, Jian-Chao
    Guo, Feng
    Chen, Fen
    Wei, Bo-Wen
    Journal of Network Intelligence, 2022, 7 (04): : 848 - 861
  • [10] Incremental kernel spectral clustering for online learning of non-stationary data
    Langone, Rocco
    Agudelo, Oscar Mauricio
    De Moor, Bart
    Suykens, Johan A. K.
    NEUROCOMPUTING, 2014, 139 : 246 - 260