ITERATE: A conceptual clustering algorithm for data mining

被引:34
|
作者
Biswas, G [1 ]
Weinberg, JB
Fisher, DH
机构
[1] Vanderbilt Univ, Dept Comp Sci, Nashville, TN 37235 USA
[2] So Illinois Univ, Dept Comp Sci, Edwardsville, IL 62026 USA
关键词
concept formation; conceptual clustering; criterion function; data mining; iterative redistribution; knowledge discovery; order bias;
D O I
10.1109/5326.669556
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The data exploration task can be divided into three interrelated subtasks: 1) feature selection, 2) discovery, and 3) interpretation, This paper describes an unsupervised discovery method with biases geared toward partitioning objects into clusters that improve interpretability. The algorithm ITERATE employs: 1) a data ordering scheme and 2) an iterative redistribution operator to produce maximally cohesive and distinct clusters. Cohesion or intraclass similarity is measured in terms of the match between individual objects and their assigned cluster prototype. Distinctness or interclass dissimilarity is measured by an average of the variance of the distribution match between clusters. We demonstrate that interpretability, from a problem-solving viewpoint, is addressed by the intraclass and interclass measures. Empirical results demonstrate the properties of the discovery algorithm and its applications to problem solving.
引用
收藏
页码:219 / 230
页数:12
相关论文
共 50 条
  • [21] The Effect of Clustering in the Apriori Data Mining Algorithm: A Case Study
    Yilmaz, Nergis
    Alptekin, Gulfem Isiklar
    [J]. WORLD CONGRESS ON ENGINEERING - WCE 2013, VOL III, 2013, : 1611 - +
  • [22] LC:: A conceptual clustering algorithm
    Martínez-Trinidad, JF
    Sánchez-Díaz, G
    [J]. MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2001, 2123 : 117 - 127
  • [23] A Kind of Improved Data Clustering Algorithm in Web Log Mining
    Guo, Jin
    Zhang, Shengbing
    Qiu, Zheng
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS RESEARCH AND MECHATRONICS ENGINEERING, 2015, 121 : 2115 - 2119
  • [24] Performance Analysis of Clustering Algorithm in Data Mining in R Language
    Reddy, Avulapalli Jayaram
    Tripathy, Balakrushna
    Nimje, Seema
    Ganga, Gopalam Sree
    Varnasree, Kamireddy
    [J]. SOFT COMPUTING SYSTEMS, ICSCS 2018, 2018, 837 : 364 - 372
  • [25] Fast implementation of dual clustering algorithm for spatial data mining
    Zhou, Jiaogen
    Bian, Fuling
    Guan, Jihong
    Zhang, Meng
    [J]. FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 568 - +
  • [26] Research on data mining clustering algorithm in cloud computing environment
    Du, Li
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2021, 128 : 179 - 180
  • [27] LogCluster - A Data Clustering and Pattern Mining Algorithm for Event Logs
    Vaarandi, Risto
    Pihelgas, Mauno
    [J]. 2015 11TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2015, : 1 - 7
  • [28] A data clustering algorithm for mining patterns from event logs
    Vaarandi, R
    [J]. PROCEEDINGS OF THE 3RD IEEE WORKSHOP ON IP OPERATIONS & MANAGEMENT (IPOM2003), 2003, : 119 - 126
  • [29] Data Mining Using Clustering Algorithm as Tool for Poverty Analysis
    Talingdan, Janelyn A.
    [J]. 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND COMPUTER APPLICATIONS (ICSCA 2019), 2019, : 56 - 59
  • [30] A new data mining method based on fusion clustering algorithm
    Wang, TZ
    Tang, TH
    [J]. PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 706 - 711