Automatic aspect discrimination in data clustering

被引:10
|
作者
Horta, Danilo [1 ]
Campello, Ricardo J. G. B. [1 ]
机构
[1] Univ Sao Paulo, ICMC, BR-13560970 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Clustering; Aspect discrimination; Attribute weighting; Cluster validation; FUZZY EXTENSION; RELATIONAL DATA; VALIDITY; AGGREGATION; VALIDATION; ALGORITHMS; COMPLEXITY; CRITERION; INDEXES;
D O I
10.1016/j.patcog.2012.05.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4370 / 4388
页数:19
相关论文
共 50 条
  • [1] Automatic clustering of hyperspectral data
    Salomon, R.
    Dolberg, S.
    Rotman, S. R.
    2006 IEEE 24TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL, 2006, : 334 - +
  • [2] Automatic similarity detection and clustering of data
    Einstein, Craig
    Chin, Peter
    CYBER SENSING 2017, 2017, 10185
  • [3] Automatic clustering algorithm for fuzzy data
    Hung, Wen-Liang
    Yang, Jenn-Hwai
    JOURNAL OF APPLIED STATISTICS, 2015, 42 (07) : 1503 - 1518
  • [4] Improved preprocessing and data clustering for landmine discrimination
    Mereddy, P
    Agarwal, S
    Rao, V
    DETECTION AND REMEDIATION TECHNOLOGIES FOR MINES AND MINELIKE TARGETS V, PTS 1 AND 2, 2000, 4038 : 1341 - 1351
  • [5] Automatic Smoke Detection in MODIS Satellite Data based on K-means Clustering and Fisher Linear Discrimination
    Li, Xiaolian
    Wang, Jing
    Song, Weiguo
    Ma, Jian
    Telesca, Luciano
    Zhang, Yongming
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2014, 80 (10): : 971 - 982
  • [6] COMPUTATIONAL APPROACHES TO AUTOMATIC DATA CLUSTERING AND CLASSIFICATION
    SUMPTER, BG
    NOID, DW
    COMPUTATIONAL POLYMER SCIENCE, 1995, 5 (03): : 121 - 134
  • [7] A Bacterial Evolutionary Algorithm for Automatic Data Clustering
    Das, Swagatam
    Chowdhury, Archana
    Abraham, Ajith
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 2403 - +
  • [8] Elastic Differential Evolution for Automatic Data Clustering
    Chen, Jun-Xian
    Gong, Yue-Jiao
    Chen, Wei-Neng
    Li, Mengting
    Zhang, Jun
    IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (08) : 4134 - 4147
  • [9] Automatic Subspace Clustering of High Dimensional Data
    Rakesh Agrawal
    Johannes Gehrke
    Dimitrios Gunopulos
    Prabhakar Raghavan
    Data Mining and Knowledge Discovery, 2005, 11 : 5 - 33
  • [10] Automatic database clustering using data mining
    Guinepain, Sylvain
    Gruenwald, Le
    SEVENTEENTH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2006, : 124 - +