Sliding Window Algorithms for k-Clustering Problems

被引:0
|
作者
Borassi, Michele [1 ]
Epasto, Alessandro [2 ]
Lattanzi, Silvio [3 ]
Vassilviskii, Sergei [2 ]
Zadimoghaddam, Morteza [4 ]
机构
[1] Google Zurich, Zurich, Switzerland
[2] Google Res New York, New York, NY USA
[3] Google Res Zurich, Zurich, Switzerland
[4] Google Res Cambridge, Cambridge, MA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The sliding window model of computation captures scenarios in which data is arriving continuously, but only the latest w elements should be used for analysis. The goal is to design algorithms that update the solution efficiently with each arrival rather than recomputing it from scratch. In this work, we focus on k-clustering problems such as k-means and k-median. In this setting, we provide simple and practical algorithms that offer stronger performance guarantees than previous results. Empirically, we show that our methods store only a small fraction of the data, are orders of magnitude faster, and find solutions with costs only slightly higher than those returned by algorithms with access to the full dataset.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Near-Optimal k-Clustering in the Sliding Window Model
    Woodruff, David P.
    Zhong, Peilin
    Zhou, Samson
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [2] Algorithms of Machine Learning for K-Clustering
    Luis Castillo, S. Jose
    Fernandez del Castillo, Jose R.
    Gonzalez Sotos, Leon
    [J]. TRENDS IN PRACTICAL APPLICATIONS OF AGENTS AND MULTIAGENT SYSTEMS, 2010, 71 : 443 - 452
  • [3] Better Algorithms for Individually Fair k-Clustering
    Chakrabarty, Deeparnab
    Negahbani, Maryam
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Algorithms for fair k-clustering with multiple protected attributes
    Bohm, Matteo
    Fazzone, Adriano
    Leonardi, Stefano
    Menghini, Cristina
    Schwiegelshohn, Chris
    [J]. OPERATIONS RESEARCH LETTERS, 2021, 49 (05) : 787 - 789
  • [5] ALMOST OPTIMAL SOLUTIONS TO k-CLUSTERING PROBLEMS
    Kumar, Pankaj
    Kumar, Piyush
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 2010, 20 (04) : 431 - 447
  • [6] Consistent k-Clustering
    Lattanzi, Silvio
    Vassilvitskii, Sergei
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [7] Distributed k-clustering algorithms for random wireless multihop networks
    Ravelomanana, V
    [J]. NETWORKING - ICN 2005, PT 1, 2005, 3420 : 109 - 116
  • [8] On approximate geometric k-clustering
    Matousek, J
    [J]. DISCRETE & COMPUTATIONAL GEOMETRY, 2000, 24 (01) : 61 - 84
  • [9] Approximation Algorithms for Min-Sum k-Clustering and Balanced k-Median
    Behsaz, Babak
    Friggstad, Zachary
    Salavatipour, Mohammad R.
    Sivakumar, Rohit
    [J]. ALGORITHMICA, 2019, 81 (03) : 1006 - 1030
  • [10] k-Clustering with Fair Outliers
    Almanza, Matteo
    Epasto, Alessandro
    Panconesi, Alessandro
    Re, Giuseppe
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 5 - 15