RaCoCl: Robust Rank Correlation Based Clustering - An Exploratory Study for High-Dimensional Data

被引:0
|
作者
Krone, Martin [1 ]
Klawonn, Frank [1 ]
Jayaram, Balasubramaniam [2 ]
机构
[1] Ostfalia Univ Appl Sci, Wolfenbuettel, Germany
[2] Indian Inst Technol, Hyderabad, Andhra Pradesh, India
基金
欧洲研究理事会;
关键词
Fuzzy Gamma Rank Correlation Coefficient; Clustering; High-dimensional Data; Fuzzy C-Means; ASSOCIATION;
D O I
10.1109/FUZZ-IEEE.2013.6622463
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The curse of dimensionality, which refers to both the combinatorial explosion in dimensions and the concentration of distances or norms in high dimensions, affects most of the clustering techniques. Recent studies on the concentration of norms suggest the use of a correlation measure instead of distances to more effectively judge (dis)similarity in high dimensions. In this work, based on these observations, we propose a robust rank correlation based clustering method. Specifically, we employ the recently proposed fuzzy gamma rank correlation measure. We show that this intuitively simple algorithm has the following advantages: (i) It requires very few parameters to be set, (ii) the number of clusters need not be apriori known, (iii) while there is an indirect dependence on the underlying distance measure, its makes use of both global and local information, (iv) it can be robust to noise depending on the correlation measure employed and, (v) as it is shown, performs well with high dimensional data. We illustrate the algorithm on some datasets where the traditional Fuzzy C-Means algorithm is known to fail.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Fast Robust Correlation for High-Dimensional Data
    Raymaekers, Jakob
    Rousseeuw, Peter J.
    [J]. TECHNOMETRICS, 2021, 63 (02) : 184 - 198
  • [2] Robust landmark graph-based clustering for high-dimensional data
    Yang, Ben
    Wu, Jinghan
    Sun, Aoran
    Gao, Naying
    Zhang, Xuetao
    [J]. NEUROCOMPUTING, 2022, 496 : 72 - 84
  • [3] Clustering High-Dimensional Data: A Survey on Subspace Clustering, Pattern-Based Clustering, and Correlation Clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Zimek, Arthur
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (01)
  • [4] Robust and compact maximum margin clustering for high-dimensional data
    Hakan Cevikalp
    Edward Chome
    [J]. Neural Computing and Applications, 2024, 36 : 5981 - 6003
  • [5] Robust and compact maximum margin clustering for high-dimensional data
    Cevikalp, Hakan
    Chome, Edward
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (11): : 5981 - 6003
  • [6] High-dimensional data clustering
    Bouveyron, C.
    Girard, S.
    Schmid, C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 502 - 519
  • [7] Clustering High-Dimensional Data
    Masulli, Francesco
    Rovetta, Stefano
    [J]. CLUSTERING HIGH-DIMENSIONAL DATA, CHDD 2012, 2015, 7627 : 1 - 13
  • [8] Robust Local Triangular Kernel Density-based Clustering for High-dimensional Data
    Musdholifah, Aina
    Hashim, Siti Zaiton Mohd
    [J]. 2013 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT), 2013, : 24 - 32
  • [9] Smooth Splicing: A Robust SNN-Based Method for Clustering High-Dimensional Data
    Tan, JingDong
    Wang, RuJing
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [10] Robust and sparse k-means clustering for high-dimensional data
    Brodinova, Sarka
    Filzmoser, Peter
    Ortner, Thomas
    Breiteneder, Christian
    Rohm, Maia
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (04) : 905 - 932