Genetic algorithms for auto-clustering in KDD

被引:0
|
作者
Li, Minqiang [1 ]
Li, Jianwu [1 ]
Kou, Jisong [1 ]
机构
[1] Tianjin Univ, Tianjin, China
关键词
Data reduction - Database systems - Error analysis - Optimization;
D O I
暂无
中图分类号
学科分类号
摘要
In solving the clustering problem in the context of knowledge discovery in databases (KDD), the traditional methods, for example, the K-means algorithm and its variants, usually require the users to provide the number of clusters in advance based on the pro-information. Unfortunately, the number of clusters in general is unknown to the users who are usually short of pro-information. Therefore, the clustering calculation becomes a tedious trial-and-error work, and the result is often not global optimal especially when the number of clusters is large. In this paper, a new dynamic clustering method based on genetic algorithms (GA) is proposed and applied for auto-clustering of data entities in large databases. The algorithm can automatically cluster the data according to their similarities and find the exact number of clusters. Experiment results indicate that the method is of global optimization by dynamically clustering logic.
引用
收藏
页码:53 / 58
相关论文
共 50 条
  • [1] Genetic Algorithms for Auto-Clustering in KDD
    Li Minqiang
    JournalofSystemsEngineeringandElectronics, 2000, (03) : 53 - 58
  • [2] Auto-Clustering Using Particle Swarm Optimization and Bacterial Foraging
    Olesen, Jakob R.
    Cordero, Jorge H.
    Zeng, Yifeng
    AGENTS AND DATA MINING INTERACTION, 2009, 5680 : 69 - 83
  • [3] Auto-clustering Pairs Generation Method for Siamese Neural Networks Training
    Mokin, Arseniy K.
    Gayer, Alexander, V
    Sheshkus, Alexander, V
    Arlazarov, Vladimir L.
    FOURTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2021), 2022, 12084
  • [4] Auto-clustering of Financial Reports Based on Formatting Style and Author's Fingerprint
    Lambruschini, Braulio C. Blanco
    Brorsson, Mats
    Zurad, Maciej
    MACHINE LEARNING AND PRINCIPLES AND PRACTICE OF KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT II, 2023, 1753 : 112 - 127
  • [5] Auto-Clustering Algorithm for Heterogeneous Information Network using Improved Particle Swarm Optimization
    Liu, Changping
    Liu, Yang
    Chen, Jiashi
    MEASUREMENT TECHNOLOGY AND ITS APPLICATION, PTS 1 AND 2, 2013, 239-240 : 1448 - 1455
  • [6] Mugshot database acquisition in video surveillance networks using incremental auto-clustering quality measures
    Xiong, QR
    Jaynes, C
    IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2003, : 191 - 198
  • [7] Genetic clustering algorithms
    Chiou, YC
    Lan, LW
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2001, 135 (02) : 413 - 427
  • [8] Genetic algorithms for clustering and fuzzy clustering
    Bandyopadhyay, Sanghamitra
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (06) : 524 - 531
  • [9] Metaheuristics for clustering in KDD
    Rayward-Smith, VJ
    2005 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-3, PROCEEDINGS, 2005, : 2380 - 2387
  • [10] Distributed Auto-Clustering for Residential Load Profiling Using AMI Data From the US High Plains
    Vahedi, Soroush
    Zhao, Long
    IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (06) : 4530 - 4541