Fuzzy Set Similarity for Feature Selection in Classification

被引:0
|
作者
Cross, Valerie [1 ]
Zmuda, Michael [1 ]
Paul, Rahul [2 ]
Hall, Lawrence [2 ]
机构
[1] Miami Univ, Comp Sci & Software Engn, Oxford, OH 45056 USA
[2] Univ S Florida, Comp Sci & Engn, Tampa, FL 33620 USA
关键词
feature selection; fuzzy set similarity measures; concordance correlation coefficient; microarray data;
D O I
10.1109/fuzz48607.2020.9177820
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A problem for machine learning research occurs when many possible features exist but the training data examples are very few. For example, microarray data typically have a much larger number of features, the genes, as compared to the number of training data examples, the patients. One approach is to first determine the best features for prediction and then to group features based on a measure of their relatedness. The concordance correlation coefficient has been used to place somewhat correlated features into disjoint groups of similar features. Multiple base classifiers are created by randomly picking one feature from each of the feature groups and then the collection of base classifiers is used in an ensemble classifier. Each classifier in the ensemble provides a vote. The majority vote is used to produce the final class prediction. This paper investigates grouping features using fuzzy set similarity measures as well as the concordance correlation coefficient as a relatedness measure. The performance of these different measures is compared in terms of accuracy, sensitivity, specificity, and F-measure using the ensemble classifiers created with the different relatedness measures. Four microarray gene expression data sets are used in the experiments to determine the usefulness of fuzzy set similarity measures and how they compare with the concordance correlation coefficient. Using the concordance correlation coefficient to guide clustering is not superior to fuzzy set similarity measures. Depending on the particular data set and performance measure being used, different fuzzy , set similarity measures perform better than or just as well as the concordance correlation coefficient.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Fuzzy Classification in Ant Feature Selection
    Vieira, S. M.
    Sousa, J. M. C.
    Runkler, T. A.
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 1764 - +
  • [2] Fuzzy Rough Set Based Feature Selection for Large-Scale Hierarchical Classification
    Zhao, Hong
    Wang, Ping
    Hu, Qinghua
    Zhu, Pengfei
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (10) : 1891 - 1903
  • [3] An Automated Text Classification Method: Using Improved Fuzzy Set Approach for Feature Selection
    Abbasi, Bushra Zaheer
    Hussain, Shahid
    Faisal, Muhammad Imran
    [J]. PROCEEDINGS OF 2019 16TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2019, : 666 - 670
  • [4] Multiobjective Evolutionary Feature Selection for Fuzzy Classification
    Jimenez, Fernando
    Martinez, Carlos
    Marzano, Enrico
    Tomas Palma, Jose
    Sanchez, Gracia
    Sciavicco, Guido
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (05) : 1085 - 1099
  • [5] A new feature selection criterion for fuzzy classification
    Almeida, R. J.
    Silva, C. A.
    Sousa, J. M. C.
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2006, : 437 - +
  • [6] PROTOTYPE CLASSIFICATION AND FEATURE SELECTION WITH FUZZY SETS
    BEZDEK, JC
    CASTELAZ, PF
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1977, 7 (02): : 87 - 92
  • [7] Feature Subset Selection for Fuzzy Classification Methods
    Cintra, Marcos E.
    Camargo, Heloisa A.
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND METHODS, PT 1, 2010, 80 : 318 - +
  • [8] A fuzzy set based approach for effective feature selection
    Das, Amit Kumar
    Chakraborty, Basabi
    Goswami, Saptarsi
    Chakrabarti, Amlan
    [J]. FUZZY SETS AND SYSTEMS, 2022, 449 : 187 - 206
  • [9] A combination of fuzzy similarity measures and fuzzy entropy measures for supervised feature selection
    Lohrmann, Christoph
    Luukka, Pasi
    Jablonska-Sabuka, Matylda
    Kauranne, Tuomo
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 110 : 216 - 236
  • [10] Feature selection using fuzzy entropy measures with similarity classifier
    Luukka, Pasi
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) : 4600 - 4607