Mutual information-based label distribution feature selection for multi-label learning

被引:45
|
作者
Qian, Wenbin [1 ,2 ]
Huang, Jintao [3 ]
Wang, Yinglong [3 ]
Shu, Wenhao [4 ]
机构
[1] Jiangxi Agr Univ, Sch Software, Nanchang 330045, Jiangxi, Peoples R China
[2] Beijing Key Lab Knowledge Engn Mat Sci, Beijing 100083, Peoples R China
[3] Jiangxi Agr Univ, Sch Comp & Informat Engn, Nanchang 330045, Jiangxi, Peoples R China
[4] East China Jiaotong Univ, Sch Informat Engn, Nanchang 330013, Jiangxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Multi-label data; Granular computing; Label enhancement; Mutual information; STREAMING FEATURE-SELECTION; ATTRIBUTE REDUCTION; MISSING LABELS; CLASSIFICATION; GRAPH; ACCELERATOR; ALGORITHM; DECISION; SPARSE;
D O I
10.1016/j.knosys.2020.105684
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection used for dimensionality reduction of the feature space plays an important role in multi-label learning where high-dimensional data are involved. Although most existing multi-label feature selection approaches can deal with the problem of label ambiguity which mainly focuses on the assumption of uniform distribution with logical labels, it cannot be applied to many practical applications where the significance of related label for every instance tends to be different. To deal with this issue, in this study, label distribution learning covered with a certain real number of labels is introduced to design a model for the labeling-significance. Nevertheless, multi-label feature selection is limited to handling only labels consisting of logical relations. In order to solve this problem, combining the random variable distribution with granular computing, we first propose a label enhancement algorithm to transform logical labels in multi-label data into label distribution with more supervised information, which can mine the hidden label significance from every instance. On this basis, to remove some redundant or irrelevant features in multi-label data, a label distribution feature selection algorithm using mutual information and label enhancement is developed. Finally, the experimental results show that the performance of the proposed method is superior to the other state-of-the-art approaches when dealing with multi-label data. (C) 2020 Elsevier B.V. All rights reserved.
引用
下载
收藏
页数:24
相关论文
共 50 条
  • [31] Feature Redundancy Based on Interaction Information for Multi-Label Feature Selection
    Gao, Wanfu
    Hu, Juncheng
    Li, Yonghao
    Zhang, Ping
    IEEE ACCESS, 2020, 8 : 146050 - 146064
  • [32] Feature selection based on label distribution and fuzzy mutual information
    Xiong, Chuanzhen
    Qian, Wenbin
    Wang, Yinglong
    Huang, Jintao
    INFORMATION SCIENCES, 2021, 574 : 297 - 319
  • [33] Distributed multi-label feature selection using individual mutual information measures
    Gonzalez-Lopez, Jorge
    Ventura, Sebastian
    Cano, Alberto
    KNOWLEDGE-BASED SYSTEMS, 2020, 188 (188)
  • [34] Multi-label feature selection based on label correlations and feature redundancy
    Fan, Yuling
    Chen, Baihua
    Huang, Weiqin
    Liu, Jinghua
    Weng, Wei
    Lan, Weiyao
    KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [35] Multi-label feature selection algorithm based on information entropy
    Zhang, Zhenhai
    Li, Shining
    Li, Zhigang
    Chen, Hao
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2013, 50 (06): : 1177 - 1184
  • [36] Multi-label feature selection for missing labels by granular-ball based mutual information
    Shu, Wenhao
    Hu, Yichen
    Qian, Wenbin
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12589 - 12612
  • [37] Multi-label Feature Selection Method Based on Multivariate Mutual Information and Particle Swarm Optimization
    Wang, Xidong
    Zhao, Lei
    Xu, Jianhua
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT IV, 2018, 11304 : 84 - 95
  • [38] Multi-label feature selection based on correlation label enhancement
    He, Zhuoxin
    Lin, Yaojin
    Wang, Chenxi
    Guo, Lei
    Ding, Weiping
    INFORMATION SCIENCES, 2023, 647
  • [39] Multi-label feature selection based on the division of label topics
    Zhang, Ping
    Gao, Wanfu
    Hu, Juncheng
    Li, Yonghao
    INFORMATION SCIENCES, 2021, 553 : 129 - 153
  • [40] Label Construction for Multi-label Feature Selection
    Spolaor, Newton
    Monard, Maria Carolina
    Tsoumakas, Grigorios
    Lee, Huei Diana
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 247 - 252