SCALABLE ALGORITHMS FOR CONVEX CLUSTERING

被引:2
|
作者
Zhou, Weilian [1 ]
Yi, Haidong [2 ]
Mishne, Gal [3 ]
Chi, Eric [1 ]
机构
[1] NC State Univ, Dept Stat, Raleigh, NC 27607 USA
[2] Univ N Carolina, Dept Comp Sci, Chapel Hill, NC 27599 USA
[3] Univ Calif San Diego, Halicioglu Data Sci Inst, La Jolla, CA 92093 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Convex optimization; Parallel computing; Sparsity; Unsupervised Learning;
D O I
10.1109/DSLW51110.2021.9523411
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Convex clustering is an appealing approach to many classical clustering problems. It stands out among standard methods as it enjoys the existence of a unique global optimal solution. Despite this advantage, convex clustering has not been widely adopted, due to its computationally intensive nature. To address this obstacle, especially in the "big data" setting, we introduce a Scalable cOnvex cLustering AlgoRithm via Parallel Coordinate Descent Method (SOLAR-PCDM) that improves the algorithm's scalability by combining a parallelizable algorithm with a compression strategy. This idea is in line with the rise and ever increasing availability of high performance computing systems built around multi-core processors, GPU-accelerators, and computer clusters. SOLAR-PCDM consists of two parts. In the first part, we develop a method called weighted convex clustering to recover the solution path by formulating a sequence of smaller equivalent optimization problems. In the second part, we utilize the Parallel Coordinate Descent Method (PCDM) to solve a specific convex clustering problem. We demonstrate the correctness and scalability of our algorithm on both simulated and real data examples.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Scalable fuzzy clustering algorithms
    Hall, Lawrence O.
    [J]. 2008 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1 AND 2, 2008, : 852 - 853
  • [2] Scalable Co-clustering Algorithms
    Kwon, Bongjune
    Cho, Hyuk
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, PT 1, PROCEEDINGS, 2010, 6081 : 32 - +
  • [3] Scalable clustering algorithms with balancing constraints
    Banerjee, Arindam
    Ghosh, Joydeep
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 13 (03) : 365 - 395
  • [4] Scalable Clustering Algorithms with Balancing Constraints
    Arindam Banerjee
    Joydeep Ghosh
    [J]. Data Mining and Knowledge Discovery, 2006, 13 : 365 - 395
  • [5] Scalable Algorithms for Individual Preference Stable Clustering
    Mosenzon, Ron
    Vakilian, Ali
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [6] Scalable Clustering Algorithms for Big Data: A Review
    Mahdi, Mahmoud A.
    Hosny, Khalid M.
    Elhenawy, Ibrahim
    [J]. IEEE ACCESS, 2021, 9 : 80015 - 80027
  • [7] Scalable Parallel Algorithms for Shared Nearest Neighbor Clustering
    Kumari, Sonal
    Maurya, Saurabh
    Goyal, Poonam
    Balasubramaniam, Sundar S.
    Goyal, Navneet
    [J]. PROCEEDINGS OF 2016 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING (HIPC), 2016, : 72 - 81
  • [8] Scalable clustering algorithms for continuous environmental flow cytometry
    Hyrkas, Jeremy
    Clayton, Sophie
    Ribalet, Francois
    Halperin, Daniel
    Armbrust, E. Virginia
    Howe, Bill
    [J]. BIOINFORMATICS, 2016, 32 (03) : 417 - 423
  • [9] Graph Clustering With Missing Data : Convex Algorithms and Analysis
    Vinayak, Ramya Korlakai
    Oymak, Samet
    Hassibi, Babak
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [10] Improving clustering algorithms through constrained convex optimization
    Nock, R
    Nielsen, F
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, 2004, : 557 - +