Multilabel Feature Selection Using Mutual Information and ML-ReliefF for Multilabel Classification

被引:9
|
作者
Shi, Enhui [1 ]
Sun, Lin [1 ]
Xu, Jiucheng [1 ]
Zhang, Shiguang [1 ,2 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
[2] Tianjin Univ, Sch Comp Sci & Technol, Tianjin 300072, Peoples R China
基金
中国国家自然科学基金;
关键词
Mutual information; Correlation; Feature extraction; Classification algorithms; Information filters; Entropy; Feature selection; mutual information; ReliefF; multilabel classification; LABEL FEATURE-SELECTION; STREAMING FEATURE-SELECTION; NAIVE BAYES; ALGORITHM;
D O I
10.1109/ACCESS.2020.3014916
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, multilabel classification algorithms play an increasingly significant role in data mining and machine learning. However, some existing mutual information-based algorithms ignore the influence of the proportions of labels on the correlation degree between features and label sets. Besides, the correlation degree of label sets cannot be accurately measured in most traditional ReliefF algorithms, and the repeated calculation arises from the division of heterogeneous neighbors. To overcome these shortcomings, this paper proposes a multilabel feature selection method using mutual information and improved multilabel ReliefF (ML-ReliefF). First, the proportion of each label is calculated in label space and combined with the mutual information of features and labels to construct a novel correlation degree between features and label sets to preprocess multilabel datasets, which is used to reduce runtime of ML-ReliefF. Second, the mutual information of label sets is introduced into improving accuracy of the correlation degree among label sets. Furthermore, two types of correlation degree for label sets based on ML-ReliefF are developed to divide similar and heterogeneous samples more clearly. Third, a divided method of heterogeneous neighbors is presented to effectively avoid the repeated calculation in ML-ReliefF, and a novel method of feature weighting based on ML-ReliefF is constructed to evaluate the importance of features. Finally, a multilabel feature selection algorithm based on mutual information and ML-ReliefF for multilabel classification is designed to improve the performance of multilabel classification. Experiments under fourteen multilabel datasets show the effectiveness of our algorithm and improve the classification performance for multilabel datasets.
引用
收藏
页码:145381 / 145400
页数:20
相关论文
共 50 条
  • [1] Multilabel feature selection using ML-ReliefF and neighborhood mutual information for multilabel neighborhood decision systems
    Sun, Lin
    Yin, Tengyu
    Ding, Weiping
    Qian, Yuhua
    Xu, Jiucheng
    [J]. INFORMATION SCIENCES, 2020, 537 : 401 - 424
  • [2] Mutual information-based feature selection for multilabel classification
    Doquire, Gauthier
    Verleysen, Michel
    [J]. NEUROCOMPUTING, 2013, 122 : 148 - 155
  • [3] Distributed Selection of Continuous Features in Multilabel Classification Using Mutual Information
    Gonzalez-Lopez, Jorge
    Ventura, Sebastian
    Cano, Alberto
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2280 - 2293
  • [4] Feature selection using Fisher score and multilabel neighborhood rough sets for multilabel classification
    Sun, Lin
    Wang, Tianxiang
    Ding, Weiping
    Xu, Jiucheng
    Lin, Yaojin
    [J]. INFORMATION SCIENCES, 2021, 578 : 887 - 912
  • [5] Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Li, Jinjin
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2017, 25 (06) : 1491 - 1507
  • [6] Multilabel Feature Selection Based on Fuzzy Mutual Information and Orthogonal Regression
    Dai, Jianhua
    Liu, Qi
    Chen, Wenxiang
    Zhang, Chucai
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (09) : 5136 - 5148
  • [7] Multilabel all-relevant feature selection using lower bounds of conditional mutual information
    Teisseyre, Pawel
    Lee, Jaesung
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2023, 216
  • [8] Joint Feature Selection and Classification for Multilabel Learning
    Huang, Jun
    Li, Guorong
    Huang, Qingming
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (03) : 876 - 889
  • [9] Graphical Feature Selection for Multilabel Classification Tasks
    Lastra, Gerardo
    Luaces, Oscar
    Quevedo, Jose R.
    Bahamonde, Antonio
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS X: IDA 2011, 2011, 7014 : 246 - 257
  • [10] LEFMIFS: Label enhancement and fuzzy mutual information for robust multilabel feature selection
    Yin, Tengyu
    Chen, Hongmei
    Yuan, Zhong
    Sang, Binbin
    Horng, Shi-Jinn
    Li, Tianrui
    Luo, Chuan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133