A combination of fuzzy similarity measures and fuzzy entropy measures for supervised feature selection

被引:33
|
作者
Lohrmann, Christoph [1 ,2 ]
Luukka, Pasi [2 ]
Jablonska-Sabuka, Matylda [1 ]
Kauranne, Tuomo [1 ]
机构
[1] Lappeenranta Univ Technol, Sch Engn Sci, Skinnarilankatu 34, Lappeenranta 53850, Finland
[2] Lappeenranta Univ Technol, Sch Business & Management, Skinnarilankatu 34, Lappeenranta 53850, Finland
关键词
Feature ranking; Filter method; Wrapper method; Machine learning; ReliefF; CLASSIFIER;
D O I
10.1016/j.eswa.2018.06.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large amounts of information and various features are in many machine learning applications available, or easily obtainable. However, their quality is potentially low and greater volumes of information are not always beneficial for machine learning, for instance, when not all available features in a data set are relevant for the classification task and for understanding the studied phenomenon. Feature selection aims at determining a subset of features that represents the data well, gives accurate classification results and reduces the impact of noise on the classification performance. In this paper, we propose a filter feature ranking method for feature selection based on fuzzy similarity and entropy measures (FSAE), which is an adaptation of the idea used for the wrapper function by Luukka (2011) and has an additional scaling factor. The scaling factor to the feature and class-specific entropy values that is implemented, accounts for the distance between the ideal vectors for each class. Moreover, a wrapper version of the FSAE with a similarity classifier is presented as well. The feature selection method is tested on five medical data sets: dermatology, chronic kidney disease, breast cancer, diabetic retinopathy and horse colic. The wrapper version of FSAE is compared to the wrapper introduced by Luukka (2011) and shows at least as accurate results with often considerably fewer features. In the comparison with ReliefF, Laplacian score, Fisher score and the filter version of Luukka (2011), the FSAE filter in general achieves competitive mean accuracies and results for one medical data set, the breast cancer Wisconsin data set, together with the Laplacian score in the best results over all possible feature removals. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:216 / 236
页数:21
相关论文
共 50 条
  • [11] On some measures of similarity and entropy for Pythagorean fuzzy sets with their applications
    Ganie, Abdul Haseeb
    Singh, Surender
    Khalaf, Mohammed M. M.
    Al-Shamiri, Mohammed M. Ali
    COMPUTATIONAL & APPLIED MATHEMATICS, 2022, 41 (08):
  • [12] Cosine similarity, distance and entropy measures for fuzzy soft matrices
    Raj M.
    Tiwari P.
    Gupta P.
    International Journal of Information Technology, 2022, 14 (4) : 2219 - 2230
  • [13] Fuzzy Influence in Fuzzy Semantic Similarity Measures
    Adel, Naeemeh
    Crockett, Keeley
    Carvalho, Joao P.
    Cross, Valerie
    IEEE CIS INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS 2021 (FUZZ-IEEE), 2021,
  • [14] Measures for Unsupervised Fuzzy-Rough Feature Selection
    Mac Parthalain, Neil
    Jensen, Richard
    2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 560 - 565
  • [15] SIMILARITY MEASURES FOR FUZZY SETS
    Beg, Ismat
    Ashraf, Samina
    APPLIED AND COMPUTATIONAL MATHEMATICS, 2009, 8 (02) : 192 - 202
  • [16] A Hybrid Feature Selection Method Based on Fuzzy Feature Selection and Consistency Measures
    Jalali, Laleh
    Nasiri, Mahdi
    Minaei, Behrooz
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 1, 2009, : 718 - 722
  • [17] Entropy of discrete fuzzy measures
    Marichal, JL
    Roubens, M
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2000, 8 (06) : 625 - 640
  • [18] Feature Selection Using Fuzzy Neighborhood Entropy-Based Uncertainty Measures for Fuzzy Neighborhood Multigranulation Rough Sets
    Sun, Lin
    Wang, Lanying
    Ding, Weiping
    Qian, Yuhua
    Xu, Jiucheng
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2021, 29 (01) : 19 - 33
  • [19] New subsethood measures and similarity measures of fuzzy sets
    Huang, GS
    Liu, YS
    2005 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, VOLS 1 AND 2, PROCEEDINGS: VOL 1: COMMUNICATION THEORY AND SYSTEMS, 2005, : 999 - 1002
  • [20] Some novel q-rung orthopair fuzzy similarity measures and entropy measures with their applications
    Ganie, Abdul Haseeb
    Singh, Surender
    EXPERT SYSTEMS, 2023, 40 (06)