ITERATE: A conceptual clustering algorithm for data mining

被引:34
|
作者
Biswas, G [1 ]
Weinberg, JB
Fisher, DH
机构
[1] Vanderbilt Univ, Dept Comp Sci, Nashville, TN 37235 USA
[2] So Illinois Univ, Dept Comp Sci, Edwardsville, IL 62026 USA
关键词
concept formation; conceptual clustering; criterion function; data mining; iterative redistribution; knowledge discovery; order bias;
D O I
10.1109/5326.669556
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data exploration task can be divided into three interrelated subtasks: 1) feature selection, 2) discovery, and 3) interpretation, This paper describes an unsupervised discovery method with biases geared toward partitioning objects into clusters that improve interpretability. The algorithm ITERATE employs: 1) a data ordering scheme and 2) an iterative redistribution operator to produce maximally cohesive and distinct clusters. Cohesion or intraclass similarity is measured in terms of the match between individual objects and their assigned cluster prototype. Distinctness or interclass dissimilarity is measured by an average of the variance of the distribution match between clusters. We demonstrate that interpretability, from a problem-solving viewpoint, is addressed by the intraclass and interclass measures. Empirical results demonstrate the properties of the discovery algorithm and its applications to problem solving.
引用
收藏
页码:219 / 230
页数:12
相关论文
共 50 条
  • [1] A modified clustering algorithm for data mining
    Xu, ZJ
    Wang, LS
    Luo, JC
    Zhang, JQ
    [J]. IGARSS 2005: IEEE International Geoscience and Remote Sensing Symposium, Vols 1-8, Proceedings, 2005, : 741 - 744
  • [2] An Effective Clustering Algorithm for Data Mining
    Vijendra, Singh
    Ashwini, Kelkar
    Laxman, Sahoo
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA STORAGE AND DATA ENGINEERING (DSDE 2010), 2010, : 250 - 253
  • [3] Distributed Clustering Algorithm for Spatial Data Mining
    Bendechache, Malika
    Kechadi, M-Tahar
    [J]. PROCEEDINGS 2015 SECOND IEEE INTERNATIONAL CONFERENCE ON SPATIAL DATA MINING AND GEOGRAPHICAL KNOWLEDGE SERVICES (ICSDM 2015), 2015, : 60 - 65
  • [4] Hierarchical Sequence Clustering Algorithm for Data Mining
    Chezhian, V. Umadevi
    Subash, Thanappan
    Samy, M. Ragavan
    [J]. WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL III, 2011, : 1861 - 1864
  • [5] Dual clustering algorithm for spatial data mining
    Zhou, Jiaogen
    Guan, Jihong
    Bian, Fuling
    [J]. Journal of Computational Information Systems, 2006, 2 (04): : 1405 - 1410
  • [6] Clustering Algorithm and Its Application in Data Mining
    Zou, Hailei
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2020, 110 (01) : 21 - 30
  • [7] Clustering Algorithm and Its Application in Data Mining
    Hailei Zou
    [J]. Wireless Personal Communications, 2020, 110 : 21 - 30
  • [8] Neural Network Data Mining Clustering Optimization Algorithm
    Jiao, Guie
    Li, Wang
    [J]. IETE JOURNAL OF RESEARCH, 2021,
  • [9] The Application of Data Mining Clustering Algorithm in Fuzzy Control
    Li Guodong
    Xia Kewen
    [J]. SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING: THEORY AND PRACTICE, VOL 1, 2012, 114 : 105 - 113
  • [10] An ant-based clustering algorithm in data mining
    Tang, Y
    Ma, YK
    [J]. SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1101 - 1105