Two-Stage Sparse Representation Clustering for Dynamic Data Streams

被引:4
|
作者
Chen, Jie [1 ]
Wang, Zhu [2 ]
Yang, Shengxiang [3 ]
Mao, Hua [4 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[2] Sichuan Univ, Law Sch, Chengdu 610065, Peoples R China
[3] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England
[4] Northumbria Univ, Dept Comp & Informat Sci, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
基金
中国国家自然科学基金;
关键词
Clustering algorithms; Dictionaries; Heuristic algorithms; Machine learning; Streaming media; Data models; Convergence; Clustering; data stream; dictionary learning; sparse representation; ALGORITHM;
D O I
10.1109/TCYB.2022.3204894
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data streams are a potentially unbounded sequence of data objects, and the clustering of such data is an effective way of identifying their underlying patterns. Existing data stream clustering algorithms face two critical issues: 1) evaluating the relationship among data objects with individual landmark windows of fixed size and 2) passing useful knowledge from previous landmark windows to the current landmark window. Based on sparse representation techniques, this article proposes a two-stage sparse representation clustering (TSSRC) method. The novelty of the proposed TSSRC algorithm comes from evaluating the effective relationship among data objects in the landmark windows with an accurate number of clusters. First, the proposed algorithm evaluates the relationship among data objects using sparse representation techniques. The dictionary and sparse representations are iteratively updated by solving a convex optimization problem. Second, the proposed TSSRC algorithm presents a dictionary initialization strategy that seeks representative data objects by making full use of the sparse representation results. This efficiently passes previously learned knowledge to the current landmark window over time. Moreover, the convergence and sparse stability of TSSRC can be theoretically guaranteed in continuous landmark windows under certain conditions. Experimental results on benchmark datasets demonstrate the effectiveness and robustness of TSSRC.
引用
收藏
页码:6408 / 6420
页数:13
相关论文
共 50 条
  • [31] Two-stage clustering in genotype-by-environment analyses with missing data
    Godfrey, AJR
    Wood, GR
    Ganesalingam, S
    Nichols, MA
    Qiao, CG
    JOURNAL OF AGRICULTURAL SCIENCE, 2002, 139 : 67 - 77
  • [32] A two-stage clustering algorithm for multi-type relational data
    Gao, Ying
    Liu, Da-you
    Sun, Cheng-min
    Liu, He
    PROCEEDINGS OF NINTH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING, 2008, : 376 - 380
  • [33] A Nested Two-Stage Clustering Method for Structured Temporal Sequence Data
    Wang, Liang
    Narayanan, Vignesh
    Yu, Yao-Chi
    Park, Yikyung
    Li, Jr-Shin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2021, 63 (07) : 1627 - 1662
  • [34] A two-stage online monitoring procedure for high-dimensional data streams
    Li, Jun
    JOURNAL OF QUALITY TECHNOLOGY, 2019, 51 (04) : 392 - 406
  • [35] A two-stage monitoring scheme for multiple high-dimensional data streams
    Wang, Tao
    Shi, Pin
    Zang, Qingpei
    Li, Zhonghua
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (12) : 2597 - 2617
  • [36] Two-Stage Clustering of Household Electricity Load Shapes for Improved Temporal Pattern Representation
    Afzalan, Milad
    Jazizadeh, Farrokh
    Eldardiry, Hoda
    IEEE ACCESS, 2021, 9 : 151667 - 151680
  • [37] A new sparse representation learning of complex data: Application to dynamic clustering of web navigation
    Rastin, Parisa
    Cabanes, Guenael
    Matei, Basarab
    Bennani, Younes
    Marty, Jean-Marc
    PATTERN RECOGNITION, 2019, 91 : 291 - 307
  • [38] CHRONICLE: A Two-Stage Density-Based Clustering Algorithm for Dynamic Networks
    Kim, Min-Soo
    Han, Jiawei
    DISCOVERY SCIENCE, PROCEEDINGS, 2009, 5808 : 152 - 167
  • [39] Two-Stage User Identification Based on User Topology Dynamic Community Clustering
    Zhang, Jiajing
    Yuan, Zhenhua
    Xu, Neng
    Chen, Jinlan
    Wang, Juxiang
    COMPLEXITY, 2021, 2021
  • [40] Two-Stage Multiscale Search for Sparse Targets
    Bashan, Eran
    Newstadt, Gregory
    Hero, Alfred O., III
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (05) : 2331 - 2341