Very fast EM-based mixture model clustering using multiresolution kd-trees

被引:0
|
作者
Moore, AW [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is important in many fields including manufacturing, biology, finance, and astronomy. Mixture models are a popular approach due to their statistical foundations. and EM is a very popular method for finding: mixture models. EM, however, requires many accesses of the data, and thus has been dismissed as impractical (e.g. [9]) for data milling of enormous datasets. We present a new algorithm, based on the multiresolution. kd-trees of [5], which dramatically reduces the cost of EM-based clustering, with savings rising linearly with the number of datapoints. Although presented here for maximum likelihood estimation of Gaussian mixture models, it is also applicable to non-Gaussian models (provided class densities are monotonic in Mahalanobis distance?), mixed categorical/numeric clusters, and Bayesian methods such as Autoclass [1].
引用
收藏
页码:543 / 549
页数:7
相关论文
共 50 条
  • [1] A method for initialising the K-means clustering algorithm using kd-trees
    Redmond, Stephen J.
    Heneghan, Conor
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (08) : 965 - 973
  • [2] Distributed data stream clustering A fast EM-based approach
    Zhou, Aoying
    Cao, Feng
    Yan, Ying
    Sha, Chaofeng
    He, Xiaofeng
    [J]. 2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 711 - +
  • [3] Thematic clustering of text documents using an EM-based approach
    Sun Kim
    W John Wilbur
    [J]. Journal of Biomedical Semantics, 3 (Suppl 3)
  • [4] Clustering very large databases using EM mixture models
    Bradley, PS
    Fayyad, UM
    Reina, CA
    [J]. 15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 76 - 80
  • [5] Medical image collection indexing: Shape-based retrieval using KD-trees
    Robinson, GP
    Tagare, HD
    Duncan, JS
    Jaffe, CC
    [J]. COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 1996, 20 (04) : 209 - 217
  • [6] Medical image collection indexing: Shape-based retrieval using KD-trees
    Department of Diagnostic Radiology, Yale University, New Haven, CT 06520, United States
    不详
    不详
    [J]. COMPUT. MED. IMAGING GRAPH, 4 (209-217):
  • [7] A KD-trees based method for fast radiation source representation for virtual reality dosimetry applications in nuclear safeguards and security
    Molto Caracena, Teofilo
    Vendrell Vidal, Eduardo
    Goncalves, Joao G. M.
    Peerani, Paolo
    [J]. PROGRESS IN NUCLEAR ENERGY, 2017, 95 : 78 - 83
  • [8] Resolving the latent structure of schizophrenia endophenotypes using em-based finite mixture modeling
    Lenzenweger, M. F.
    McLachlan, G.
    Rubin, D. B.
    [J]. SCHIZOPHRENIA BULLETIN, 2007, 33 (02) : 239 - 240
  • [9] Efficient personalized recommendation of mobile web content using an EM-based clustering method
    He, Ming
    Chin, Alvin
    Chen, Enhong
    Tian, Jilei
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (CIT), 2014, : 152 - 159
  • [10] Flexible Use of Temporal and Spatial Reasoning for Fast and Scalable CPU Broad-Phase Collision Detection Using KD-Trees
    Serpa, Ygor Reboucas
    Formico Rodrigues, Maria Andreia
    [J]. COMPUTER GRAPHICS FORUM, 2019, 38 (01) : 260 - 273