Supervised Adaptive Incremental Clustering for data stream of chunks

被引:9
|
作者
Zheng, Laiwen
Huo, Hong [1 ]
Guo, Yiyou
Fang, Tao
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Shanghai 200030, Peoples R China
基金
中国国家自然科学基金;
关键词
Automatic clustering; Adaptive update; Classification; Data stream of chunks; Supervised clustering; CLASSIFICATION; ALGORITHM; SELECTION;
D O I
10.1016/j.neucom.2016.09.054
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many supervised clustering algorithms have been developed to find the optimal clusters for static datasets by presetting some parameters, but they are seldom suitable for dynamic datasets, such as the data stream of chunks. To find the optimal clusters of the data stream of chunks, a novel Supervised Adaptive Incremental Clustering (SAIC) algorithm is proposed. SAIC can cluster dynamic datasets of arbitrary shapes and sizes automatically. It includes learning and post-processing phases. In the learning phase, each cluster updates adaptively according to its learning rate that is calculated from its counter value. All data points are shuffled at each iteration in order to make SAIC insensitive to the input order of data points. In the post-processing phase, the outliers or boundary points are eliminated according to the counter value of each cluster and the number of iterations. Four synthetic datasets and fourteen UCI datasets are used to evaluate the performance of SAIC, respectively. The experiments on UCI datasets show that SAIC reaches to or outperforms some other supervised clustering algorithms and several unsupervised incremental clustering algorithms. In addition, three data stream of chunks are used to evaluate SAIC from different aspects, which shows SAIC has the scalability and incremental learning ability for the clustering of data streams of chunks.
引用
收藏
页码:502 / 517
页数:16
相关论文
共 50 条
  • [1] Incremental adaptive semi-supervised fuzzy clustering for data stream classification
    Casalino, Gabriella
    Castellano, Giovanna
    Mencar, Corrado
    [J]. PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2018,
  • [2] Incremental semi-supervised clustering in a data stream with a flock of agents
    Bruneau, Pierrick
    Picarougne, Fabien
    Gelgon, Marc
    [J]. 2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 3067 - 3074
  • [3] Incremental and Adaptive Clustering Stream Data over Sliding Window
    Dang, Xuan Hong
    Lee, Vincent C. S.
    Ng, Wee Keong
    Ong, Kok Leong
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 660 - +
  • [4] Data Stream Classification by Dynamic Incremental Semi-Supervised Fuzzy Clustering
    Casalino, Gabriella
    Castellano, Giovanna
    Mencar, Corrado
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2019, 28 (08)
  • [5] Data Stream Classification by Adaptive Semi-supervised Fuzzy Clustering
    Castellano, Giovanna
    Fanelli, Anna Maria
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 770 - 771
  • [6] A Fast Incremental Kernel Principal Component Analysis for Learning Stream of Data Chunks
    Tokumoto, Takaomi
    Ozawa, Seiichi
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2881 - 2888
  • [7] A dynamic hierarchical incremental learning-based supervised clustering for data stream with considering concept drift
    Nikpour, Soheila
    Asadi, Shahrokh
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2022, 13 (6) : 2983 - 3003
  • [8] A dynamic hierarchical incremental learning-based supervised clustering for data stream with considering concept drift
    Soheila Nikpour
    Shahrokh Asadi
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2022, 13 : 2983 - 3003
  • [9] Split incremental clustering algorithm of mixed data stream
    Siwar Gorrab
    Fahmi Ben Rejab
    Kaouther Nouira
    [J]. Progress in Artificial Intelligence, 2024, 13 : 51 - 64
  • [10] Incremental Clustering Approach for Evolving Trajectory Data Stream
    Shein, Thi Thi
    Puntheeranurak, Sutheera
    [J]. 2018 6TH INTERNATIONAL ELECTRICAL ENGINEERING CONGRESS (IEECON), 2018,