Bayesian Clustering of Fuzzy Feature Vectors Using a Quasi-Likelihood Approach

被引：10

作者：

Marttinen, Pekka ^{[1
]}

Tang, Jing ^{[1
]}

De Baets, Bernard ^{[2
]}

Dawyndt, Peter ^{[3
]}

Corander, Jukka ^{[4
]}

机构：

[1] Univ Helsinki, Dept Math & Stat, FIN-00014 Helsinki, Finland

[2] Univ Ghent, Dept Appl Math Biometr & Proc Control, B-9000 Ghent, Belgium

[3] Univ Ghent, Dept Appl Math & Comp Sci, B-9000 Ghent, Belgium

[4] Abo Akad Univ, Dept Math, SF-20500 Turku, Finland

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2009年 / 31卷 / 01期

基金：

芬兰科学院;

关键词：

Bayesian clustering; fuzzy modeling; quasi-likelihood; continuous data; MODEL SELECTION; IDENTIFICATION;

D O I：

10.1109/TPAMI.2008.53

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Bayesian model-based classifiers, both unsupervised and supervised, have been studied extensively, and their value and versatility have been demonstrated on a wide spectrum of applications within science and engineering. A majority of the classifiers are built on the assumption of intrinsic discreteness of the considered data features or on their discretization prior to the modeling. On the other hand, Gaussian mixture classifiers have also been utilized to a large extent for continuous features in the Bayesian framework. Often, the primary reason for discretization in the classification context is the simplification of the analytical and numerical properties of the Bayesian models. However, the discretization can be problematic due to its ad hoc nature and the decreased statistical power to detect the correct classes ( or clusters) in the resulting procedure. Here, we introduce an unsupervised classification approach for fuzzy feature vectors that utilizes a discrete model structure while preserving the continuous characteristics of data. This goal is achieved by replacing the ordinary likelihood by a binomial quasi-likelihood to yield an analytical expression for the posterior probability of a given clustering solution. The resulting model can also be justified from an information-theoretic perspective. Our method is shown to yield highly accurate clusterings for challenging synthetic and empirical data sets and to perform favorably compared to some alternative approaches.

引用

页码：74 / 85

页数：12

共 50 条

[31] SOME ASYMPTOTIC INFERENCE IN QUASI-LIKELIHOOD NONLINEAR MODELS:A GEOMETRIC APPROACH
Wei Bocheng\+1\ Tang Niansheng\+1\ Wang Xueren\+21 Dept.of Math.
[J]. Applied Mathematics:A Journal of Chinese Universities, 2000, (02) : 173 - 183
[32] A method of relational fuzzy clustering based on producing feature vectors using FastMap
Brouwer, Roelof Kars
[J]. INFORMATION SCIENCES, 2009, 179 (20) : 3561 - 3582
[33] Fitting mixed Poisson regression models using quasi-likelihood methods
Chen, JJ
Ahn, HS
[J]. BIOMETRICAL JOURNAL, 1996, 38 (01) : 81 - 96
[34] Transmission Based Association Test for Multivariate Phenotype using Quasi-Likelihood
Kulkarni, Hemant
Ghosh, Saurabh
[J]. GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 647 - 647
[35] Efficient estimation of quasi-likelihood models using B-splines
Lu, Minggen
[J]. ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2017, 69 (05) : 1099 - 1127
[36] Efficient estimation of quasi-likelihood models using B-splines
Minggen Lu
[J]. Annals of the Institute of Statistical Mathematics, 2017, 69 : 1099 - 1127
[37] Estimating intraclass correlation for binary data using extended quasi-likelihood
Lee, Y
[J]. STATISTICAL MODELLING, 2004, 4 (02) : 113 - 126
[38] On empirical Bayes penalized quasi-likelihood inference in GLMMs and in Bayesian disease mapping and ecological modeling
MacNab, Ying C.
Lin, Yi
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (08) : 2950 - 2967
[39] A quasi-likelihood approach for accurate traffic matrix estimation in a high speed network
Cao, Jin
Chen, Aiyou
Bu, Tian
[J]. 27TH IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (INFOCOM), VOLS 1-5, 2008, : 412 - 420
[40] Inferences in median regression models for asymmetric longitudinal data: A quasi-likelihood approach
Nagarajah, Varathan
Sutradhar, Brajendra C.
Jowaheer, Vandna
Biswas, Atanu
[J]. BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2016, 30 (01) : 28 - 46

← 1 2 3 4 5 →