Very fast EM-based mixture model clustering using multiresolution kd-trees

被引:0
|
作者
Moore, AW [1 ]
机构
[1] Carnegie Mellon Univ, Inst Robot, Pittsburgh, PA 15213 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is important in many fields including manufacturing, biology, finance, and astronomy. Mixture models are a popular approach due to their statistical foundations. and EM is a very popular method for finding: mixture models. EM, however, requires many accesses of the data, and thus has been dismissed as impractical (e.g. [9]) for data milling of enormous datasets. We present a new algorithm, based on the multiresolution. kd-trees of [5], which dramatically reduces the cost of EM-based clustering, with savings rising linearly with the number of datapoints. Although presented here for maximum likelihood estimation of Gaussian mixture models, it is also applicable to non-Gaussian models (provided class densities are monotonic in Mahalanobis distance?), mixed categorical/numeric clusters, and Bayesian methods such as Autoclass [1].
引用
收藏
页码:543 / 549
页数:7
相关论文
共 50 条
  • [31] Clustering gene expression data analysis using an improved EM algorithm based on multivariate elliptical contoured mixture models
    Liu, Zhe
    Song, Yu-qing
    Xie, Cong-hua
    Zhu, Feng
    Bao, Xiang
    [J]. OPTIK, 2014, 125 (21): : 6388 - 6394
  • [32] Color image segmentation using a model-based clustering and a MFA-EM algorithm
    Park, JH
    [J]. IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 : 934 - 941
  • [33] Image Fusion-Based Tone Mapping Using Gaussian Mixture Model Clustering
    Lee, Wang-Un
    Park, Seung
    Ko, Sung-Jea
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2020, : 153 - 156
  • [34] Motion-based video segmentation using fuzzy clustering and classical mixture model
    Nitsuwat, S
    Jin, JS
    Hudson, HM
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2000, : 300 - 303
  • [35] Fast attribute-based table clustering using predicate-trees: A vertical data mining approach
    [J]. Roy, A.G. (arjun.roy@ndsu.edu), 1600, IOS Press BV (12):
  • [36] Fast attribute-based table clustering using Predicate-Trees: A vertical data mining approach
    Roy, Arjun G.
    Chatterjee, Arijit
    Hossain, Mohammad K.
    Perrizo, William
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2012, 12 : S139 - S146
  • [37] Hybrid Recommender System using Semi-Supervised Clustering based on Gaussian Mixture Model
    Zhang, Yihao
    Liu, Xiaoyang
    Liu, Wanping
    Zhu, Changpeng
    [J]. 2016 INTERNATIONAL CONFERENCE ON CYBERWORLDS (CW), 2016, : 155 - 158
  • [38] A machine learning recommender system based on collaborative filtering using Gaussian mixture model clustering
    Fakoor, Delshad
    Maihami, Vafa
    Maihami, Reza
    [J]. MATHEMATICAL METHODS IN THE APPLIED SCIENCES, 2021,
  • [39] Identification of typical building daily electricity usage profiles using Gaussian mixture model-based clustering and hierarchical clustering
    Li, Kehua
    Ma, Zhenjun
    Robinson, Duane
    Ma, Jun
    [J]. APPLIED ENERGY, 2018, 231 : 331 - 342
  • [40] A graph-based superframework for mixture model estimation using EM: an analysis of US wholesale electricity markets
    Mari, Carlo
    Baldassari, Cristiano
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (20): : 14867 - 14883