Gradual model generator for single-pass clustering

被引:15
|
作者
Karkkainen, Ismo [1 ]
Franti, Pasi [1 ]
机构
[1] Univ Joensuu, Dept Comp Sci, Speech & Image Proc Unit, FIN-80101 Joensuu, Finland
关键词
clustering; Gaussian mixture model; single-pass; large data sets;
D O I
10.1016/j.patcog.2006.06.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an algorithm for generating a mixture model from a data set by converting the data into a model. The method is applicable when only part of the data fits in the main memory at the same time. The generated model is a Gaussian mixture model but the algorithm can be adapted to other types of models, too. The user cannot specify the size of the generated model. We also introduce a post-processing method, which can reduce the size of the model without using the original data. This will result in a more compact model with fewer components, but with approximately the same representation accuracy as the original model. Our comparisons show that the algorithm produces good results and is quite efficient. The whole process requires only 0.5-10% of the time spent by the expectation-maximization algorithm. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:784 / 795
页数:12
相关论文
共 50 条
  • [1] Gradual model generator for single-pass clustering
    Kärkkäinen, I
    Fränti, P
    FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, : 681 - 684
  • [2] Single-Pass Streaming Algorithms for Correlation Clustering
    Behnezhad, Soheil
    Charikar, Moses
    Ma, Weiyun
    Tan, Li-Yang
    PROCEEDINGS OF THE 2023 ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, SODA, 2023, : 819 - 849
  • [3] Single-Pass Clustering Algorithm Based on Storm
    Li Fang
    Dai Longlong
    Jiang Zhiying
    Li Shunzi
    2017 INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING AND ARTIFICIAL INTELLIGENCE (CCEAI 2017), 2017, 806
  • [4] Design and performance of a single-pass bubbling bioaerosol generator
    Mainelis, G
    Berry, D
    An, HR
    Yao, MS
    DeVoe, K
    Fennell, DE
    Jaeger, R
    ATMOSPHERIC ENVIRONMENT, 2005, 39 (19) : 3521 - 3533
  • [5] On-line single-pass clustering based on diffusion maps
    Allah, Fadoua Ataa
    Grosky, William I.
    Aboutajdine, Driss
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4592 : 107 - +
  • [6] Gradual Synthesis for Static Parallelization of Single-Pass Array-Processing Programs
    Fedyukovich, Grigory
    Ahmad, Maaz Bin Safeer
    Bodik, Rastislav
    ACM SIGPLAN NOTICES, 2017, 52 (06) : 572 - 585
  • [7] SINGLE-PASS OXYGENATOR
    HIROSE, T
    EVERETT, H
    BAILEY, CP
    JOURNAL OF CARDIOVASCULAR SURGERY, 1970, 11 (01): : 74 - &
  • [8] SINGLE-PASS OXYGENATOR
    HIROSE, T
    EVERETT, H
    MARSHALL, DV
    BAILEY, CP
    TRANSACTIONS AMERICAN SOCIETY FOR ARTIFICIAL INTERNAL ORGANS, 1969, 15 : 151 - &
  • [9] Single-Pass Pivot Algorithm for Correlation Clustering. Keep it simple!
    Chakrabarty, Sayak
    Makarychev, Konstantin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Hierarchical clustering based on single-pass for breaking topic detection and tracking
    李风环
    Zhao Zongfei
    Wang Zhenyu
    High Technology Letters, 2018, 24 (04) : 369 - 377