An Improved Visual Assessment with Data-Dependent Kernel for Stream Clustering

被引:0
|
作者
Zhang, Baojie [1 ]
Cao, Yang [2 ]
Zhu, Ye [2 ]
Rajasegarar, Sutharshan [2 ]
Liu, Gang [3 ]
Li, Hong Xian [2 ]
Angelova, Maia [2 ]
Li, Gang [2 ]
机构
[1] Xian Shiyou Univ, Xian 710065, Shaanxi, Peoples R China
[2] Deakin Univ, Burwood, Vic 3125, Australia
[3] Harbin Engn Univ, Harbin 150001, Peoples R China
关键词
Cluster tendency assessment; VAT; Isolation kernel; Clustering; Data stream;
D O I
10.1007/978-3-031-33374-3_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advances of 5G and the Internet of Things enable more devices and sensors to be interconnected. Unlike traditional data, the large amount of data generated from various sensors and devices requires real-time analysis. The data objects in a stream will change over time and only have a single access. Thus, traditional methods no longer meet the needs of fast exploratory data analysis for continuously generated data. Cluster tendency assessment is an effective method to determine the number of potential clusters. Recently, there are methods based on Visual Assessment of cluster Tendency (VAT) proposed for visualising cluster structures in streaming data using cluster heat maps. However, those heat maps rely on Euclidean distance that does not consider the data distribution characteristics. Consequently, it would be difficult to separate adjacent clusters of varied densities. In this paper, we discuss this issue for the latest inc-siVAT method, and propose to use a data-dependent kernel method to overcome it for clustering streaming data. Extensive evaluation on 7 large synthetic and real-world datasets shows the superiority of kernel-based inc-siVAT over 4 recently published state-of-the-art online and offline clustering algorithms.
引用
收藏
页码:197 / 209
页数:13
相关论文
共 50 条
  • [1] Data-Dependent Sparsity for Subspace Clustering
    Xin, Bo
    Wang, Yizhou
    Gao, Wen
    Wipf, David
    [J]. CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [2] Data-dependent kernel machines for Microarray data classification
    Xiong, Huilin
    Zhang, Ya
    Chen, Xue-Wen
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (04) : 583 - 595
  • [3] A criterion for learning the data-dependent kernel for classification
    Li, Jun-Bao
    Chu, Shu-Chuan
    Pan, Jeng-Shyang
    [J]. ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2007, 4632 : 365 - +
  • [4] Principal Composite Kernel Feature Analysis: Data-Dependent Kernel Approach
    Motai, Yuichi
    Yoshida, Hiroyuki
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (08) : 1863 - 1875
  • [5] Optimizing the data-dependent kernel under a unified kernel optimization framework
    Chen, Bo
    Liu, Hongwei
    Bao, Zheng
    [J]. PATTERN RECOGNITION, 2008, 41 (06) : 2107 - 2119
  • [6] Similarity Learning via Optimizing the Data-Dependent Kernel
    Xiong, Huilin
    Shi, Panfei
    [J]. 2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 512 - 516
  • [7] Generalized Multiple Kernel Learning With Data-Dependent Priors
    Mao, Qi
    Tsang, Ivor W.
    Gao, Shenghua
    Wang, Li
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (06) : 1134 - 1148
  • [8] Multiple data-dependent kernel for classification of hyperspectral images
    He, Zhi
    Li, Junbao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (03) : 1118 - 1135
  • [9] Large-Scale Data-Dependent Kernel Approximation
    Ionescu, Catalin
    Popa, Alin-Ionut
    Sminchisescu, Cristian
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 19 - 27
  • [10] Visual Process Monitoring by Data-Dependent Kernel Discriminant Analysis with t-Distributed Similarities
    Wei, Chihang
    Wen, Chenglin
    He, Jieguang
    Song, Zhihuan
    [J]. ACS OMEGA, 2023, 8 (41): : 38013 - 38024