Robust and compact maximum margin clustering for high-dimensional data

被引:0
|
作者
Cevikalp, Hakan [1 ]
Chome, Edward [2 ]
机构
[1] Eskisehir Osmangazi Univ, Machine Learning & Comp Vis Lab, Elect & Elect Engn Dept, TR-26480 Meselik, Eskisehir, Turkiye
[2] Eskisehir Tech Univ, Comp Engn Dept, Eskisehir, Turkiye
来源
NEURAL COMPUTING & APPLICATIONS | 2024年 / 36卷 / 11期
关键词
Maximum margin clustering; Subspace clustering; Hyperplane fitting; Large margin; Robust clustering; ALGORITHM;
D O I
10.1007/s00521-023-09388-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the field of machine learning, clustering has become an increasingly popular research topic due to its critical importance. Many clustering algorithms have been proposed utilizing a variety of approaches. This study focuses on clustering of high-dimensional data using the maximum margin clustering approach. In this paper, two methods are introduced: The first method employs the classical maximum margin clustering approach, which separates data into two clusters with the greatest margin between them. The second method takes cluster compactness into account and searches for two parallel hyperplanes that best fit to the cluster samples while also being as far apart from each other as possible. Additionally, robust variants of these clustering methods are introduced to handle outliers and noise within the data samples. The stochastic gradient algorithm is used to solve the resulting optimization problems, enabling all proposed clustering methods to scale well with large-scale data. Experimental results demonstrate that the proposed methods are more effective than existing maximum margin clustering methods, particularly in high-dimensional clustering problems, highlighting the efficacy of the proposed methods.
引用
收藏
页码:5981 / 6003
页数:23
相关论文
共 50 条
  • [1] Robust and compact maximum margin clustering for high-dimensional data
    Hakan Cevikalp
    Edward Chome
    [J]. Neural Computing and Applications, 2024, 36 : 5981 - 6003
  • [2] High-dimensional data clustering
    Bouveyron, C.
    Girard, S.
    Schmid, C.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 502 - 519
  • [3] Clustering High-Dimensional Data
    Masulli, Francesco
    Rovetta, Stefano
    [J]. CLUSTERING HIGH-DIMENSIONAL DATA, CHDD 2012, 2015, 7627 : 1 - 13
  • [4] Robust and sparse k-means clustering for high-dimensional data
    Brodinova, Sarka
    Filzmoser, Peter
    Ortner, Thomas
    Breiteneder, Christian
    Rohm, Maia
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (04) : 905 - 932
  • [5] Robust and sparse k-means clustering for high-dimensional data
    Šárka Brodinová
    Peter Filzmoser
    Thomas Ortner
    Christian Breiteneder
    Maia Rohm
    [J]. Advances in Data Analysis and Classification, 2019, 13 : 905 - 932
  • [6] Robust landmark graph-based clustering for high-dimensional data
    Yang, Ben
    Wu, Jinghan
    Sun, Aoran
    Gao, Naying
    Zhang, Xuetao
    [J]. NEUROCOMPUTING, 2022, 496 : 72 - 84
  • [7] Robust Optimization in High-Dimensional Data Space with Support Vector Clustering
    Shang, Chao
    You, Fengqi
    [J]. IFAC PAPERSONLINE, 2018, 51 (18): : 19 - 24
  • [8] Robust estimation of the mean vector for high-dimensional data set using robust clustering
    Shahriari, Hamid
    Ahmadi, Orod
    [J]. JOURNAL OF APPLIED STATISTICS, 2015, 42 (06) : 1183 - 1205
  • [9] Clustering of High-Dimensional and Correlated Data
    McLachlan, Geoffrey J.
    Ng, Shu-Kay
    Wang, K.
    [J]. DATA ANALYSIS AND CLASSIFICATION, 2010, : 3 - 11
  • [10] Clustering in high-dimensional data spaces
    Murtagh, FD
    [J]. STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 279 - 292