A combination of fuzzy similarity measures and fuzzy entropy measures for supervised feature selection

被引:33
|
作者
Lohrmann, Christoph [1 ,2 ]
Luukka, Pasi [2 ]
Jablonska-Sabuka, Matylda [1 ]
Kauranne, Tuomo [1 ]
机构
[1] Lappeenranta Univ Technol, Sch Engn Sci, Skinnarilankatu 34, Lappeenranta 53850, Finland
[2] Lappeenranta Univ Technol, Sch Business & Management, Skinnarilankatu 34, Lappeenranta 53850, Finland
关键词
Feature ranking; Filter method; Wrapper method; Machine learning; ReliefF; CLASSIFIER;
D O I
10.1016/j.eswa.2018.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large amounts of information and various features are in many machine learning applications available, or easily obtainable. However, their quality is potentially low and greater volumes of information are not always beneficial for machine learning, for instance, when not all available features in a data set are relevant for the classification task and for understanding the studied phenomenon. Feature selection aims at determining a subset of features that represents the data well, gives accurate classification results and reduces the impact of noise on the classification performance. In this paper, we propose a filter feature ranking method for feature selection based on fuzzy similarity and entropy measures (FSAE), which is an adaptation of the idea used for the wrapper function by Luukka (2011) and has an additional scaling factor. The scaling factor to the feature and class-specific entropy values that is implemented, accounts for the distance between the ideal vectors for each class. Moreover, a wrapper version of the FSAE with a similarity classifier is presented as well. The feature selection method is tested on five medical data sets: dermatology, chronic kidney disease, breast cancer, diabetic retinopathy and horse colic. The wrapper version of FSAE is compared to the wrapper introduced by Luukka (2011) and shows at least as accurate results with often considerably fewer features. In the comparison with ReliefF, Laplacian score, Fisher score and the filter version of Luukka (2011), the FSAE filter in general achieves competitive mean accuracies and results for one medical data set, the breast cancer Wisconsin data set, together with the Laplacian score in the best results over all possible feature removals. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:216 / 236
页数:21
相关论文
共 50 条
  • [1] Feature selection using fuzzy entropy measures with similarity classifier
    Luukka, Pasi
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (04) : 4600 - 4607
  • [2] Feature selection using Yu's similarity measure and fuzzy entropy measures
    Iyakaremye, Cesar
    Luukka, Pasi
    Koloseni, David
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [3] Similarity measures, penalty functions, and fuzzy entropy from new fuzzy subsethood measures
    Santos, Helida
    Couso, Ines
    Bedregal, Benjamin
    Takac, Zdenko
    Minarova, Maria
    Asiain, Alfredo
    Barrenechea, Edurne
    Bustince, Humberto
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2019, 34 (06) : 1281 - 1302
  • [4] Similarity and entropy measures for hesitant fuzzy sets
    Hu, Junhua
    Yang, Yan
    Zhang, Xiaolong
    Chen, Xiaohong
    INTERNATIONAL TRANSACTIONS IN OPERATIONAL RESEARCH, 2018, 25 (03) : 857 - 886
  • [5] Similarity and entropy measures for circular intuitionistic fuzzy sets
    Alreshidi, Nasser Aedh
    Shah, Zahir
    Khan, Muhammad Jabir
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [6] Feature subset selection based on fuzzy entropy measures for handling classification problems
    Shie, Jen-Da
    Chen, Shyi-Ming
    APPLIED INTELLIGENCE, 2008, 28 (01) : 69 - 82
  • [7] Feature subset selection based on fuzzy entropy measures for handling classification problems
    Jen-Da Shie
    Shyi-Ming Chen
    Applied Intelligence, 2008, 28 : 69 - 82
  • [8] On the entropy of fuzzy measures
    Yager, RR
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2000, 8 (04) : 453 - 461
  • [9] FUZZY MEASURES AND THE ENTROPY OF FUZZY PARTITIONS
    DUMITRESCU, D
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1993, 176 (02) : 359 - 373
  • [10] On some measures of similarity and entropy for Pythagorean fuzzy sets with their applications
    Abdul Haseeb Ganie
    Surender Singh
    Mohammed M. Khalaf
    Mohammed M. Ali Al-Shamiri
    Computational and Applied Mathematics, 2022, 41