Accelerating one-pass clustering by cluster selection racing

被引:1
|
作者
Labroche, Nicolas [1 ]
Detyniecki, Marcin [2 ]
Baerecke, Thomas [1 ]
机构
[1] UPMC Paris 6, CNRS, UMR 7606, LIP6, Paris, France
[2] CNRS, UMR 7606, LIP6, Paris, France
关键词
D O I
10.1109/ICTAI.2013.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a racing mechanism in the cluster selection process for one-pass clustering algorithms. We focus on cases where data are not numerical vectors and where it is not necessarily possible to compute a mean for each cluster. In this case, the distance of each point to existing clusters can be computed exhaustively with a quadratic complexity which is not tractable in most of nowadays use cases. In this paper we first introduce a stochastic approach for estimating the distance of each new data point to existing clusters based on Hoeffding and Bernstein bounds, that reduces the number of computations by simultaneously selecting the quantity of data to be sampled and by eliminating the non-competitive clusters. Second, this paper shows that it is possible to improve the efficiency of our approach by reducing the theoretical values of the Hoeffding and Bernstein bounds. Our algorithms, tested on real data sets, provide significant acceleration of the one-pass clustering algorithms, while making less error (or any depending on parameters) than one-pass clustering algorithm with fixed number of comparisons with each cluster.
引用
收藏
页码:491 / 498
页数:8
相关论文
共 50 条
  • [1] One-Pass Clustering Superpixels
    Kesavan, Yogarajah
    Ramanan, Amirthalingam
    [J]. 2014 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION FOR SUSTAINABILITY (ICIAFS), 2014,
  • [2] One-pass View-unaligned Clustering
    Dong, Wenhua
    Wu, Xiao-Jun
    Feng, Zhenhua
    Ahmed, Sara Atito Ali
    Awais, Muhammad
    Kittler, Josef
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9699 - 9709
  • [3] One-Pass Incomplete Multi-View Clustering
    Hu, Menglei
    Chen, Songcan
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3838 - 3845
  • [4] ONE-PASS SELECTION OF A SAMPLE WITH PROBABILITY PROPORTIONAL TO SIZE
    RICHARDSON, SC
    [J]. APPLIED STATISTICS-JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C, 1989, 38 (03): : 517 - 520
  • [5] A One-Pass Clustering Based Sketch Method for Network Monitoring
    Fu, Yongquan
    An, Lun
    Shen, Siqi
    Chen, Kai
    Barlet-Ros, Pere
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (06) : 2604 - 2613
  • [6] Scalable and practical One-Pass clustering algorithm for recommender system
    Khalid, Asra
    Ghazanfar, Mustansar Ali
    Zahra, Sobia
    Azam, Muhammad Awais
    [J]. INTELLIGENT DATA ANALYSIS, 2017, 21 (02) : 279 - 310
  • [7] A Scalable and Practical One-Pass Clustering Algorithm for Recommender System
    Khalid, Asra
    Chazanfar, Mustansar Ali
    Azam, Awais
    Alahmari, Saad Ali
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2015), 2015, 9875
  • [8] ONE-PASS CULTIVATION
    SIMS, REH
    [J]. NEW ZEALAND JOURNAL OF AGRICULTURE, 1977, 135 (03): : 38 - &
  • [9] One-Pass Reductions
    Vagvolgyi, Sandor
    [J]. ACTA CYBERNETICA, 2016, 22 (03): : 633 - 655
  • [10] ONE-PASS MODELING
    JONES, JH
    HARNE, RL
    FIRTH, KJ
    [J]. TRANSACTIONS OF THE AMERICAN NUCLEAR SOCIETY, 1985, 49 (JUN): : 471 - 473