Enhancing principal direction divisive clustering

被引:23
|
作者
Tasoulis, S. K. [1 ]
Tasoulis, D. K. [2 ]
Plagianakos, V. P. [1 ]
机构
[1] Univ Cent Greece, Dept Comp Sci & Biomed Informat, Lamia 35100, Greece
[2] Univ London Imperial Coll Sci Technol & Med, Dept Math, London SW7 2AZ, England
关键词
Clustering; Principal component analysis; Kernel density estimation; ALGORITHM; CLASSIFICATION; SELECTION;
D O I
10.1016/j.patcog.2010.05.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While data clustering has a long history and a large amount of research has been devoted to the development of numerous clustering techniques, significant challenges still remain. One of the most important of them is associated with high data dimensionality. A particular class of clustering algorithms has been very successful in dealing with such datasets, utilising information driven by the principal component analysis. In this work, we try to deepen our understanding on what can be achieved by this kind of approaches. We attempt to theoretically discover the relationship between true clusters in the data and the distribution of their projection onto the principal components. Based on such findings, we propose appropriate criteria for the various steps involved in hierarchical divisive clustering and develop compilations of them into new algorithms. The proposed algorithms require minimal user-defined parameters and have the desirable feature of being able to provide approximations for the number of clusters present in the data. The experimental results indicate that the proposed techniques are effective in simulated as well as real data scenarios. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3391 / 3411
页数:21
相关论文
共 50 条
  • [1] Principal Direction Divisive Partitioning
    Daniel Boley
    Data Mining and Knowledge Discovery, 1998, 2 : 325 - 344
  • [2] Principal direction divisive partitioning
    Boley, D
    DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (04) : 325 - 344
  • [3] Hierarchical clustering using non-greedy principal direction divisive partitioning
    Nilsson, M
    INFORMATION RETRIEVAL, 2002, 5 (04): : 311 - 321
  • [4] Hierarchical Clustering Using Non-Greedy Principal Direction Divisive Partitioning
    Martin Nilsson
    Information Retrieval, 2002, 5 : 311 - 321
  • [5] Random direction divisive clustering
    Tasoulis, S. K.
    Tasoulis, D. K.
    Plagianakos, V. P.
    PATTERN RECOGNITION LETTERS, 2013, 34 (02) : 131 - 139
  • [6] Evolutionary Principal Direction Divisive Partitioning
    Tasoulis, Sotiris K.
    Tasoulis, Dimitris K.
    Plagianakos, Vassilis P.
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [7] Principal direction divisive partitioning with kernels and k-means steering
    Zeimpekis, Dimitrios
    Gallopoulos, Efstratios
    SURVEY OF TEXT MINING II: CLUSTERING, CLASSIFICATION, AND RETRIEVAL, 2008, : 45 - 64
  • [8] Error analysis of automatic speech recognition using Principal Direction Divisive Partitioning
    McKoskey, D
    Boley, D
    MACHINE LEARNING: ECML 2000, 2000, 1810 : 263 - 270
  • [9] Farthest Centroid Divisive Clustering
    Fang, Haw-ren
    Saad, Yousef
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 232 - +
  • [10] Maximum Clusterability Divisive Clustering
    Hofmeyr, David
    Pavlidis, Nicos
    2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 780 - 786