Gradual model generator for single-pass clustering

被引:15
|
作者
Karkkainen, Ismo [1 ]
Franti, Pasi [1 ]
机构
[1] Univ Joensuu, Dept Comp Sci, Speech & Image Proc Unit, FIN-80101 Joensuu, Finland
关键词
clustering; Gaussian mixture model; single-pass; large data sets;
D O I
10.1016/j.patcog.2006.06.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an algorithm for generating a mixture model from a data set by converting the data into a model. The method is applicable when only part of the data fits in the main memory at the same time. The generated model is a Gaussian mixture model but the algorithm can be adapted to other types of models, too. The user cannot specify the size of the generated model. We also introduce a post-processing method, which can reduce the size of the model without using the original data. This will result in a more compact model with fewer components, but with approximately the same representation accuracy as the original model. Our comparisons show that the algorithm produces good results and is quite efficient. The whole process requires only 0.5-10% of the time spent by the expectation-maximization algorithm. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:784 / 795
页数:12
相关论文
共 50 条
  • [11] Monitoring of Public Opinion on Typhoon Disaster Using Improved Clustering Model Based on Single-Pass Approach
    Chen, Xin
    SAGE OPEN, 2023, 13 (03):
  • [12] Establishing a validation model for single-pass albumin dialysis
    Labib, A.
    Tatersall, J.
    Lewington, A.
    Bellamy, M. C.
    BRITISH JOURNAL OF ANAESTHESIA, 2009, 102 (04) : 574 - 575
  • [13] On Single-Pass Indexing with MapReduce
    McCreadie, Richard M. C.
    Macdonald, Craig
    Ounis, Iadh
    PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 742 - 743
  • [14] SINGLE-PASS LIST PARTITIONING
    Frias, Leonor
    Singler, Johannes
    Sanders, Peter
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2008, 9 (03): : 179 - 196
  • [15] A Single-Pass Triclustering Algorithm
    Gnatyshak, D. V.
    AUTOMATIC DOCUMENTATION AND MATHEMATICAL LINGUISTICS, 2015, 49 (01) : 27 - 41
  • [16] Single-pass TFF processing
    Lutz, Herb
    Steen, Jonathan
    Chefer, Kate
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2011, 241
  • [17] Unsupervised Web Name Disambiguation Using Semantic Similarity and Single-Pass Clustering
    Iosif, Elias
    ARTIFICIAL INTELLIGENCE: THEORIES, MODELS AND APPLICATIONS, PROCEEDINGS, 2010, 6040 : 133 - 141
  • [19] Single-pass list partitioning
    Universitat Politècnica de Catalunya, Dep. de Llenguatges i Sistemes Informátics, Jordi Girona Salgado, 1-3, Barcelona
    08034, Spain
    不详
    76128, Germany
    Scalable Comput. Pract. Exp., 2008, 3 (179-184):
  • [20] Single-pass list partitioning
    Frias, Leonor
    Singler, Johannes
    Sanders, Peter
    CISIS 2008: THE SECOND INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS, PROCEEDINGS, 2008, : 817 - +