Utilizing unsupervised learning to cluster data in the Bayesian data reduction algorithm

被引:0
|
作者
Lynch, RS [1 ]
Willett, PK [1 ]
机构
[1] USN, Undersea Warfare Ctr, Signal Proc Branch, Newport, RI 02840 USA
关键词
adaptive classification; noninformative prior; discrete data; unknown data distribution;
D O I
10.1117/12.603522
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, unsupervised learning is utilized to illustrate the ability of the Bayesian Data Reduction Algorithm (BDRA) to cluster unlabeled training data. The BDRA is based on the assumption that the discrete symbol probabilities of each class are a priori uniformly Dirichlet distributed, and it employs a "greedy" approach (similar to a backward sequential feature search) for reducing irrelevant features from the training data of each class. Notice that reducing irrelevant features is synonymous here with selecting those features that provide best classification performance; the metric for making data reducing decisions is an analytic formula for the probability of error conditioned on the training data. The contribution of this work is to demonstrate how clustering performance varies depending on the method utilized for unsupervised training. To illustrate performance, results are demonstrated using simulated data. In general, the results of this work have implications for finding clusters in data mining applications.
引用
收藏
页码:158 / 167
页数:10
相关论文
共 50 条
  • [1] Bayesian Unsupervised Learning with Multiple Data Types
    Agius, Phaedra
    Ying, Yiming
    Campbell, Colin
    [J]. STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
  • [2] Unsupervised learning algorithm based on data driving
    Liu, Kai-Di
    Pang, Yan-Jun
    Zhou, Shao-Ling
    Li, Wen-Guo
    [J]. Kongzhi yu Juece/Control and Decision, 2009, 24 (03): : 472 - 476
  • [3] Performance analysis of the Bayesian data reduction algorithm
    Kline, Douglas M.
    Galbraith, Craig S.
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2009, 1 (03) : 223 - 236
  • [4] A FEATURE REDUCTION AND UNSUPERVISED CLASSIFICATION ALGORITHM FOR MULTISPECTRAL DATA
    GOWDA, KC
    [J]. PATTERN RECOGNITION, 1984, 17 (06) : 667 - 676
  • [5] Data reduction algorithm for machine learning and data mining
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    [J]. NEW FRONTIERS IN APPLIED ARTIFICIAL INTELLIGENCE, 2008, 5027 : 276 - 285
  • [6] Unsupervised data reduction
    Okun, Oleg
    Priisalu, Helen
    [J]. SIGNAL PROCESSING, 2007, 87 (09) : 2260 - 2267
  • [7] Bayesian Cluster Enumeration Criterion for Unsupervised Learning
    Teklehaymanot, Freweyni K.
    Muma, Michael
    Zoubir, Abdelhak M.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2018, 66 (20) : 5392 - 5406
  • [8] A Deep Unsupervised Learning Algorithm for Dynamic Data Clustering
    Pantula, Priyanka D.
    Miriyala, Srinivas S.
    Mitra, Kishalay
    [J]. 2021 SEVENTH INDIAN CONTROL CONFERENCE (ICC), 2021, : 147 - 152
  • [9] Clustering fMRI data with a robust unsupervised learning algorithm for neuroscience data mining
    Aljobouri, Hadeel K.
    Jaber, Hussain A.
    Kocak, Orhan M.
    Algin, Oktay
    Cankaya, Ilyas
    [J]. JOURNAL OF NEUROSCIENCE METHODS, 2018, 299 : 45 - 54
  • [10] Quantizing features independently in the Bayesian data reduction algorithm
    Lynch, RS
    Willett, PK
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 1336 - 1341