Very fast EM-based mixture model clustering using multiresolution kd-trees

被引:0
|
作者
Moore, AW [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is important in many fields including manufacturing, biology, finance, and astronomy. Mixture models are a popular approach due to their statistical foundations. and EM is a very popular method for finding: mixture models. EM, however, requires many accesses of the data, and thus has been dismissed as impractical (e.g. [9]) for data milling of enormous datasets. We present a new algorithm, based on the multiresolution. kd-trees of [5], which dramatically reduces the cost of EM-based clustering, with savings rising linearly with the number of datapoints. Although presented here for maximum likelihood estimation of Gaussian mixture models, it is also applicable to non-Gaussian models (provided class densities are monotonic in Mahalanobis distance?), mixed categorical/numeric clusters, and Bayesian methods such as Autoclass [1].
引用
收藏
页码:543 / 549
页数:7
相关论文
共 50 条
  • [41] A graph-based superframework for mixture model estimation using EM: an analysis of US wholesale electricity markets
    Carlo Mari
    Cristiano Baldassari
    [J]. Neural Computing and Applications, 2023, 35 : 14867 - 14883
  • [42] Gaussian mixture model based clustering of Manual muscle testing grades using surface Electromyogram signals
    Saranya, S.
    Poonguzhali, S.
    Karunakaran, S.
    [J]. PHYSICAL AND ENGINEERING SCIENCES IN MEDICINE, 2020, 43 (03) : 837 - 847
  • [43] Gaussian mixture model based clustering of Manual muscle testing grades using surface Electromyogram signals
    S. Saranya
    S. Poonguzhali
    S. Karunakaran
    [J]. Physical and Engineering Sciences in Medicine, 2020, 43 : 837 - 847
  • [44] CopyMix: Mixture model based single-cell clustering and copy number profiling using variational inference
    Safinianaini, Negar
    De Souza, Camila P.E.
    Roth, Andrew
    Koptagel, Hazal
    Toosi, Hosein
    Lagergren, Jens
    [J]. Computational Biology and Chemistry, 2024, 113
  • [45] Physical Layer Authentication for Mission Critical Machine Type Communication using Gaussian Mixture Model based Clustering
    Weinand, Andreas
    Karrenbauer, Michael
    Ji Lianghai
    Schotten, Hans D.
    [J]. 2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
  • [46] Speaker Clustering Based on Non-Negative Matrix Factorization Using Gaussian Mixture Model in Complementary Subspace
    Nishida, Masafumi
    Yamamoto, Seiichi
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [47] Identifying urban form typologies in Seoul using a new Gaussian mixture model-based clustering framework
    Li, Na
    Quan, Steven Jige
    [J]. ENVIRONMENT AND PLANNING B-URBAN ANALYTICS AND CITY SCIENCE, 2023, 50 (09) : 2342 - 2358
  • [48] A variable clustering approach for overdispersed high-dimensional count data using a copula-based mixture model
    Brini, Alberto
    Manju, Abu
    van den Heuvel, Edwin R.
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [49] Evaluation of web service clustering using Dirichlet Multinomial Mixture model based approach for Dimensionality Reduction in service representation
    Agarwal, Neha
    Sikka, Geeta
    Awasthi, Lalit Kumar
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)
  • [50] A Multiphase Level Set Clustering Approach Using MRF-based Student's-t Mixture Model
    Xie, Qunyi
    Pan, Xu
    Zhu, Hongqing
    [J]. 2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,