Refining spherical K-means for clustering documents

被引:0
|
作者
Peng, Jiming [1 ]
Zhu, Jiaping [2 ]
机构
[1] McMaster Univ, Dept Comp & Software, Adv Optimizat Lab, Hamilton, ON L8S 4K1, Canada
[2] McMaster Univ, Dept Math & Stat, Adv Optimizat Lab, Hamilton, ON L8S 4K1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spherical k-means is a popular algorithm for document clustering. However, it may still yield poor performance in some circumstances. In this paper, we consider a discrete optimization model for spkmeans. By using the convexity of objective function and specific structure of constraint set, we first reformulate the discrete problem as an equivalent convex maximization problem with linear constraints. Then we characterize the local optimality of relaxed problem. Based on the characteristics, we refine the spherical k-means algorithm by alternatively performing spherical k-means and switching data points between clusters. This strategy guarantees that the refined algorithm can always attain a local optimal solution.
引用
收藏
页码:4146 / +
页数:3
相关论文
共 50 条
  • [31] K-means clustering on CGRA
    Lopes, Joao D.
    de Sousa, Jose T.
    Neto, Horacio
    Vestias, Mario
    2017 27TH INTERNATIONAL CONFERENCE ON FIELD PROGRAMMABLE LOGIC AND APPLICATIONS (FPL), 2017,
  • [32] Online k-means Clustering
    Cohen-Addad, Vincent
    Guedj, Benjamin
    Kanade, Varun
    Rom, Guy
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [33] A HYBRID APPROACH USING PSO AND K-MEANS FOR SEMANTIC CLUSTERING OF WEB DOCUMENTS
    Avanija, J.
    Ramar, K.
    JOURNAL OF WEB ENGINEERING, 2013, 12 (3-4): : 249 - 264
  • [34] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [35] Deep k-Means: Jointly clustering with k-Means and learning representations
    Fard, Maziar Moradi
    Thonet, Thibaut
    Gaussier, Eric
    PATTERN RECOGNITION LETTERS, 2020, 138 : 185 - 192
  • [36] Unsupervised Text Binarization in Handwritten Historical Documents Using k-Means Clustering
    Kusetogullari, Huseyin
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 23 - 32
  • [37] PSO Aided k-Means Clustering: Introducing Connectivity in k-Means
    Breaban, Mihaela Elena
    Luchian, Henri
    GECCO-2011: PROCEEDINGS OF THE 13TH ANNUAL GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2011, : 1227 - 1234
  • [38] Automatic generation of initial value k to apply k-means method for text documents clustering
    Gupta, Namita
    Saxena, P. C.
    Gupta, J. P.
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2011, 3 (01) : 18 - 41
  • [39] SKIFF: Spherical K-means with iterative feature filtering for text document clustering
    Sharma, Iti
    Sharma, Abhay
    Chaturvedi, Rekha
    Rajpurohit, Jitendra
    Kumar, Manoj
    JOURNAL OF INFORMATION SCIENCE, 2023,
  • [40] Spherical k-means clustering is good for interpreting multivariate species occurrence data
    Hill, Mark O.
    Harrower, Colin A.
    Preston, Christopher D.
    METHODS IN ECOLOGY AND EVOLUTION, 2013, 4 (06): : 542 - 551