A combination of fuzzy similarity measures and fuzzy entropy measures for supervised feature selection

被引:33
|
作者
Lohrmann, Christoph [1 ,2 ]
Luukka, Pasi [2 ]
Jablonska-Sabuka, Matylda [1 ]
Kauranne, Tuomo [1 ]
机构
[1] Lappeenranta Univ Technol, Sch Engn Sci, Skinnarilankatu 34, Lappeenranta 53850, Finland
[2] Lappeenranta Univ Technol, Sch Business & Management, Skinnarilankatu 34, Lappeenranta 53850, Finland
关键词
Feature ranking; Filter method; Wrapper method; Machine learning; ReliefF; CLASSIFIER;
D O I
10.1016/j.eswa.2018.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large amounts of information and various features are in many machine learning applications available, or easily obtainable. However, their quality is potentially low and greater volumes of information are not always beneficial for machine learning, for instance, when not all available features in a data set are relevant for the classification task and for understanding the studied phenomenon. Feature selection aims at determining a subset of features that represents the data well, gives accurate classification results and reduces the impact of noise on the classification performance. In this paper, we propose a filter feature ranking method for feature selection based on fuzzy similarity and entropy measures (FSAE), which is an adaptation of the idea used for the wrapper function by Luukka (2011) and has an additional scaling factor. The scaling factor to the feature and class-specific entropy values that is implemented, accounts for the distance between the ideal vectors for each class. Moreover, a wrapper version of the FSAE with a similarity classifier is presented as well. The feature selection method is tested on five medical data sets: dermatology, chronic kidney disease, breast cancer, diabetic retinopathy and horse colic. The wrapper version of FSAE is compared to the wrapper introduced by Luukka (2011) and shows at least as accurate results with often considerably fewer features. In the comparison with ReliefF, Laplacian score, Fisher score and the filter version of Luukka (2011), the FSAE filter in general achieves competitive mean accuracies and results for one medical data set, the breast cancer Wisconsin data set, together with the Laplacian score in the best results over all possible feature removals. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:216 / 236
页数:21
相关论文
共 50 条
  • [21] Handling Missing Values Based on Similarity Classifiers and Fuzzy Entropy Measures
    Karim, Faten Khalid
    Elmannai, Hela
    Seleem, Abdelrahman
    Hamad, Safwat
    Mostafa, Samih M.
    ELECTRONICS, 2022, 11 (23)
  • [22] Similarity measures of picture fuzzy sets based on entropy and their application in MCDM
    Nguyen Xuan Thao
    Pattern Analysis and Applications, 2020, 23 : 1203 - 1213
  • [24] FUZZY ENTROPY FROM WEAK FUZZY SUBSETHOOD MEASURES
    Galar, M.
    Bustince, H.
    Fernandez, J.
    Sanz, J.
    Beliakov, G.
    NEURAL NETWORK WORLD, 2010, 20 (01) : 139 - 158
  • [25] Similarity measures on intuitionistic fuzzy sets
    Liang, ZZ
    Shi, PF
    PATTERN RECOGNITION LETTERS, 2003, 24 (15) : 2687 - 2693
  • [26] Fuzzy similarity measures for colour images
    Van der Weken, Dietrich
    De Witte, Valerie
    Nachtegael, Mike
    Schulte, Stefan
    Kerre, Etienne
    2006 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 806 - +
  • [27] Contrast Similarity Measures of Fuzzy Sets
    Batyrshin, Ildar
    Kosheleva, Olga
    Kreinovich, Vladik
    Kubysheva, Nailya
    Akhtiamov, Raouf
    COMPUTACION Y SISTEMAS, 2019, 23 (04): : 1569 - 1573
  • [28] Relating Fuzzy Set Similarity Measures
    Cross, Valerie
    FUZZY LOGIC IN INTELLIGENT SYSTEM DESIGN: THEORY AND APPLICATIONS, 2018, 648 : 9 - 21
  • [29] Fuzzy classifier based on similarity measures
    Elloumi, S
    Jaoua, A
    COMPUTATIONAL INTELLIGENCE FOR MODELLING, CONTROL & AUTOMATION - EVOLUTIONARY COMPUTATION & FUZZY LOGIC FOR INTELLIGENT CONTROL, KNOWLEDGE ACQUISITION & INFORMATION RETRIEVAL, 1999, 55 : 271 - 275
  • [30] Applications and Comparisons of Fuzzy Similarity Measures
    Baccour, Leila
    Alimi, Adel M.
    2010 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2010), 2010,