Automatic aspect discrimination in data clustering

被引:10
|
作者
Horta, Danilo [1 ]
Campello, Ricardo J. G. B. [1 ]
机构
[1] Univ Sao Paulo, ICMC, BR-13560970 Sao Carlos, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Clustering; Aspect discrimination; Attribute weighting; Cluster validation; FUZZY EXTENSION; RELATIONAL DATA; VALIDITY; AGGREGATION; VALIDATION; ALGORITHMS; COMPLEXITY; CRITERION; INDEXES;
D O I
10.1016/j.patcog.2012.05.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The attributes describing a data set may often be arranged in meaningful subsets, each of which corresponds to a different aspect of the data. An unsupervised algorithm (SCAD) that simultaneously performs fuzzy clustering and aspects weighting was proposed in the literature. However, SCAD may fail and halt given certain conditions. To fix this problem, its steps are modified and then reordered to reduce the number of parameters required to be set by the user. In this paper we prove that each step of the resulting algorithm, named ASCAD, globally minimizes its cost-function with respect to the argument being optimized. The asymptotic analysis of ASCAD leads to a time complexity which is the same as that of fuzzy c-means. A hard version of the algorithm and a novel validity criterion that considers aspect weights in order to estimate the number of clusters are also described. The proposed method is assessed over several artificial and real data sets. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4370 / 4388
页数:19
相关论文
共 50 条
  • [21] An efficient robust automatic clustering algorithm for interval data
    Vo-Van Tai
    Ngoc, Lethikim
    Nguyen-Trang Thao
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (10) : 4621 - 4635
  • [22] Automatic hierarchical clustering algorithm for remote sensing data
    Sidorova V.S.
    Pattern Recognition and Image Analysis, 2011, 21 (02) : 328 - 331
  • [23] A Multi-Objective Genetic Algorithm with Fuzzy Relational Clustering for Automatic Data Clustering
    Kundu, Animesh
    Paull, Animesh Kumar
    Shill, Pintu Chandra
    Murase, Kazuyuki
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2015, : 89 - 94
  • [24] DISCRIMINATION - ASPECT OF HELPING PROCESS
    HARGROVE, DS
    PORTER, TL
    JOURNAL OF RESEARCH AND DEVELOPMENT IN EDUCATION, 1971, 4 (02): : 28 - 34
  • [25] Automatic clustering algorithm for interval data based on overlap distance
    Lethikim, Ngoc
    Lehoang, Tuan
    Vovan, Tai
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (05) : 2194 - 2209
  • [26] Fuzzy clustering of distributional data with automatic weighting of variable components
    Irpino, Antonio
    Verde, Rosanna
    de Carvalho, Francisco de A. T.
    INFORMATION SCIENCES, 2017, 406 : 248 - 268
  • [27] Hybrid Symbiotic Organism Search algorithms for Automatic Data Clustering
    Rajah, Vidushen
    Ezugwu, Absalom E.
    2020 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2020,
  • [28] Unsupervised Clustering of Clickthrough Data for Automatic Annotation of Multimedia Content
    Ntalianis, Klimis
    Doulamis, Anastasios
    Tsapatsoulis, Nicolas
    Doulamis, Nikolaos
    ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT II, 2009, 5769 : 895 - +
  • [29] Clustering interval-valued data with automatic variables weighting
    Rizo Rodriguez, Sara Ines
    Tenorio de Carvalho, Francisco de Assis
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [30] Dynamic Niching Genetic Algorithm with Data Attraction for Automatic Clustering
    常冬霞
    张贤达
    TsinghuaScienceandTechnology, 2009, 14 (06) : 718 - 724