Discovering fuzzy clusters in databases using an evolutionary approach

被引:0
|
作者
Chung, LLH [1 ]
Chan, KCC [1 ]
Leung, H [1 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
关键词
data mining; fuzzy clustering; linguistic terms; and genetic algorithm;
D O I
10.1117/12.381728
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a fuzzy clustering technique for relational database for data mining task. Clustering task for data mining application can be performed more effective if the technique is able to handle both continuous- and discrete-valued data commonly found in real-life relational databases. However, many of fuzzy clustering techniques such as fuzzy c-means are developed only for continuous-valued data due to their distance measure defined in the Euclidean space. When attributes are also characterized by discrete-valued attribute, they are unable to perform their task. Besides, how to deal with fuzzy input data in addition to mixed continuous and discrete is not clearly discussed. Instead of using a distance measure for defining similarity between records, we propose a technique based on a genetic algorithm (GA). By representing a specific grouping of records in a chromosome and using an objective measure as a fitness measure to determine if such grouping is meaningful and interesting, our technique is able to handle continuous, discrete, and even fuzzy input data. Unlike many of the existing clustering techniques, which can only produce the result of grouping with no interpretation, our proposed algorithm is able to generate a set of rules describing the interestingness of the discovered clusters. This feature, in rum, eases the understandability of the discovered result.
引用
收藏
页码:11 / 21
页数:11
相关论文
共 50 条
  • [1] Discovering clusters in gene expression data using evolutionary approach
    Ma, PCH
    Chan, KCC
    [J]. 15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, : 459 - 466
  • [2] Discovering knowledge from medical databases using evolutionary algorithms
    Wong, ML
    Lam, W
    Leung, KS
    Ngan, PS
    Cheng, JCY
    [J]. IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2000, 19 (04): : 45 - 55
  • [3] DISCOVERING CONCEPT CLUSTERS BY DECOMPOSING DATABASES
    ZHONG, N
    OHSUGA, S
    [J]. DATA & KNOWLEDGE ENGINEERING, 1994, 12 (02) : 223 - 244
  • [4] A new approach for discovering fuzzy quantitative sequential patterns in sequence databases
    Chen, Yen-Liang
    Huang, Tony Cheng-Kui
    [J]. FUZZY SETS AND SYSTEMS, 2006, 157 (12) : 1641 - 1661
  • [5] An effective algorithm for discovering fuzzy rules in relational databases
    Au, WH
    Chan, KCC
    [J]. 1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1314 - 1319
  • [6] DISCOVERING FUZZY RULES IN DATABASES WITH LINGUISTIC VARIABLE ELIMINATION
    Bohacik, Jan
    [J]. NEURAL NETWORK WORLD, 2010, 20 (01) : 45 - 61
  • [7] Using the Evolutionary Computation Approach in the Initial Phase of Protocol Discovering
    Palka, Dariusz
    Piekarczyk, Marcin
    Wojcik, Krzysztof
    [J]. ARTIFICIAL INTELLIGENCEAND SOFT COMPUTING, PT I, 2019, 11508 : 493 - 505
  • [8] Discovering Moving Clusters from Spatial-Temporal Databases
    Hwang, San-Yih
    Lee, Chien-Ming
    Lee, Chien-Hsiang
    [J]. ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, PROCEEDINGS, 2008, : 111 - 114
  • [9] Discovering fuzzy functional dependencies as semantic knowledge in large databases
    Wang, X
    Chen, GQ
    [J]. SHAPING BUSINESS STRATEGY IN A NETWORKED WORLD, VOLS 1 AND 2, PROCEEDINGS, 2004, : 1136 - 1139
  • [10] An Evolutionary Fuzzy c-Means Approach for Clustering of Bio-informatics databases
    Di Nuovo, Alessandro G.
    Catania, Vincenzo
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 2079 - 2084