Bayesian unsupervised classification framework based on stochastic partitions of data and a parallel search strategy

被引:12
|
作者
Corander J. [1 ]
Gyllenberg M. [2 ]
Koski T. [3 ]
机构
[1] Department of Mathematics, Åbo Akademi University
[2] Department of Mathematics and Statistics, Rolf Nevanlinna Institute, University of Helsinki, Helsinki 00014
[3] Department of Mathematics, Royal Institute of Technology
来源
Adv. Data Anal. Classif. | 2009年 / 1卷 / 3-24期
基金
芬兰科学院;
关键词
Bayesian classification; Markov chain Monte Carlo; Statistical learning; Stochastic optimization;
D O I
10.1007/s11634-009-0036-9
中图分类号
学科分类号
摘要
Advantages of statistical model-based unsupervised classification over heuristic alternatives have been widely demonstrated in the scientific literature. However, the existing model-based approaches are often both conceptually and numerically instable for large and complex data sets. Here we consider a Bayesian model-based method for unsupervised classification of discrete valued vectors, that has certain advantages over standard solutions based on latent class models. Our theoretical formulation defines a posterior probability measure on the space of classification solutions corresponding to stochastic partitions of observed data. To efficiently explore the classification space we use a parallel search strategy based on non-reversible stochastic processes. A decision-theoretic approach is utilized to formalize the inferential process in the context of unsupervised classification. Both real and simulated data sets are used for the illustration of the discussed methods. © 2009 Springer-Verlag.
引用
收藏
页码:3 / 24
页数:21
相关论文
共 50 条
  • [21] A stochastic quantum program synthesis framework based on Bayesian optimization
    Xiao, Yao
    Nazarian, Shahin
    Bogdan, Paul
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [22] Framework for tasks suggestion on web search based on unsupervised learning techniques
    Alsulmi, Mohammad
    Alshamarani, Reham
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5525 - 5532
  • [23] Unsupervised Texture Feature Classification Based on Cuckoo Search and Relief Algorithm
    Wang, Mingwei
    Wan, Youchuan
    Ye, Zhiwei
    Chen, Maolin
    NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [24] A General Framework for High-Dimensional Data Reduction Using Unsupervised Bayesian Model
    Jin, Longcun
    Wan, Wanggen
    Wu, Yongliang
    Cui, Bin
    Yu, Xiaoqing
    LIFE SYSTEM MODELING AND INTELLIGENT COMPUTING, PT II, 2010, 98 : 96 - 101
  • [25] VISUAL SEARCH TIME BASED ON STOCHASTIC SERIAL AND PARALLEL PROCESSINGS
    UENO, T
    PERCEPTION & PSYCHOPHYSICS, 1968, 3 (3B): : 229 - &
  • [26] Exploratory parallel hybrid sampling framework for imbalanced data classification
    Zheng, Ming
    Zhao, Zhuo
    Wang, Fei
    Hu, Xiaowen
    Xu, Sheng
    Li, Wanggen
    Li, Tong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [27] Unsupervised Classification of Data Streams based on Typicality and Eccentricity Data Analytics
    Jales Costa, Bruno Sielly
    Bezerra, Clauber Gomes
    Guedes, Luiz Affonso
    Parvanov Angelov, Plamen
    2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 58 - 63
  • [28] A Survey and Recommendations for Distributed, Parallel, Single Pass, Incremental Bayesian Classification based on MapReduce for Big Data
    Shafiq, M. Omair
    Yang, Yibing
    Fekri, Maryam
    2017 IEEE 19TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS WORKSHOPS (HPCCWS): MULTICORE AND MULTITHREADED ARCHITECTURES AND ALGORITHMS (M2A2 2017), 2017, : 42 - 49
  • [29] Recognizing groundwater DNAPL contaminant source and aquifer parameters using parallel heuristic search strategy based on Bayesian approach
    Wang, Han
    Lu, Wenxi
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2021, 35 (04) : 813 - 830
  • [30] Recognizing groundwater DNAPL contaminant source and aquifer parameters using parallel heuristic search strategy based on Bayesian approach
    Han Wang
    Wenxi Lu
    Stochastic Environmental Research and Risk Assessment, 2021, 35 : 813 - 830