Clustering of fuzzy data and simultaneous feature selection: A model selection approach

被引:7
|
作者
Saha, Arkajyoti [1 ]
Das, Swagatam [2 ]
机构
[1] Indian Stat Inst, Stat Math Unit, 203 BT Rd, Kolkata 700108, WB, India
[2] Indian Stat Inst, Elect & Commun Sci Unit, 203 BT Rd, Kolkata 700108, WB, India
关键词
Fuzzy data; Component number selection; Model selection; Minimum Message Length (MML); Bayesian; Jeffreys prior; DNW prior; MIXTURE; PARTITIONS;
D O I
10.1016/j.fss.2017.11.015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fuzzy data occurs frequently in the fields of decision making, social sciences, and control theory. We consider the problem of clustering fuzzy data along with automatic component number detection and feature selection. A model selection criterion called minimum message length is used to address the problem of component number selection. The Bayesian framework can be adopted here, by applying an explicit prior distribution over the parameter values. We discuss both uninformative and informative priors. For the latter, a gradient descent algorithm for automatic optimization of the prior hyper-parameters is presented. The problem of simultaneous feature selection involves ordering the discriminative features according to their relative importance, and at the same time eliminating non-discriminative features. The feature selection problem is also formulated as a parameter estimation problem by extending the concept of feature saliency. Then the estimation can be computed simultaneously with the clustering steps. By combining the clustering, the cluster number detection and the feature selection into one estimation problem, we modified the fuzzy Expectation-Maximization (EM) algorithm to perform all of the estimation. Evaluation criteria are proposed and empirical study results are reported to showcase the efficacy of our proposals. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 37
页数:37
相关论文
共 50 条
  • [1] An automated parameter selection approach for simultaneous clustering and feature selection
    Kumar, Vijay
    Chhabra, Jitender K.
    Kumar, Dinesh
    [J]. JOURNAL OF ENGINEERING RESEARCH, 2016, 4 (02): : 65 - 85
  • [2] A differential evolution approach for simultaneous clustering and feature selection
    Hancer, Emrah
    [J]. 2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [3] Fast Simultaneous Clustering and Feature Selection for Binary Data
    Laclau, Charlotte
    Nadif, Mohamed
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIII, 2014, 8819 : 192 - 202
  • [4] Improved clustering approach based on fuzzy feature selection
    Wu, Naijun
    Li, Xiuyun
    Yang, Jie
    Liu, Peng
    [J]. 2007 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1-3, 2007, : 479 - +
  • [5] Feature Weighting and Feature Selection in Fuzzy Clustering
    Borgelt, Christian
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 838 - 844
  • [6] Unified Simultaneous Clustering and Feature Selection for Unlabeled and Labeled Data
    Han, Dongyoon
    Kim, Junmo
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 6083 - 6098
  • [7] An approach to feature selection based on fuzzy clustering and statistic theory
    Gao, XB
    Ji, HB
    Xie, WX
    [J]. ICEMI'99: FOURTH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 1999, : 988 - 992
  • [8] Feature Selection and Semisupervised Fuzzy Clustering
    Kong, Yi-qing
    Wang, Shi-tong
    [J]. FUZZY INFORMATION AND ENGINEERING, 2009, 1 (02) : 179 - 190
  • [9] Feature selection via fuzzy clustering
    Sun, Hao-Jun
    Sun, Mei
    Mei, Zhen
    [J]. PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1400 - +
  • [10] Simultaneous feature selection and ant colony clustering
    Akarsu, Emre
    Karahoca, Adem
    [J]. WORLD CONFERENCE ON INFORMATION TECHNOLOGY (WCIT-2010), 2011, 3