Multi-label feature selection based on minimizing feature redundancy of mutual information

被引:0
|
作者
Zhou, Gaozhi [2 ]
Li, Runxin [1 ,2 ]
Shang, Zhenhong [2 ]
Li, Xiaowu [2 ]
Jia, Lianyin [2 ]
机构
[1] Kunming Univ Sci & Technol, Yunnan Key Lab Comp Technol Applicat, Kunming 650500, Peoples R China
[2] Kunming Univ Sci & Technol, Fac Informat Engn & Automat, Kunming 650500, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label feature selection; Mutual information; Sparse model; Redundant correlation; OPTIMIZATION ALGORITHM; SHRINKAGE; COMMON;
D O I
10.1016/j.neucom.2024.128392
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label feature selection is an indispensable technology in the preprocessing of multi-label high-dimensional data. Approaches utilizing information theory and sparse models hold promise in this domain, demonstrating strong performance. Although there have been extensive literatures using l 1 and l 2 , 1-norms to identify label- specific features and common features in the feature space, they all ignore the redundant information interference problem when different features are learned simultaneously. Considering that features and labels in multi-label data are rarely linearly correlated, the MFS-MFR approach is presented to generate a representation of the nonlinear correlation between features and labels using the mutual information estimator. Following that, MFS-MFR detects specific and common features in the feature-label mutual information space using two coefficient matrices constrained by the l 1 and l 2 , 1-norms, respectively. In particular, we define a nonzero correlation constraint that effectively minimizes the redundant correlation between the two matrices. Moreover, a manifold regularization term is devised to preserve the local information of the mutual information space. To solve the optimization model with nonlinear binary regular term, we employ a novel solution approach called S-FISTA. Extensive experiments across 15 multi-label benchmark datasets, comparing against 11 top-performing multi-label feature selection methods, demonstrate the superior performance of MFS-MFR.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Feature Redundancy Based on Interaction Information for Multi-Label Feature Selection
    Gao, Wanfu
    Hu, Juncheng
    Li, Yonghao
    Zhang, Ping
    IEEE ACCESS, 2020, 8 : 146050 - 146064
  • [2] Multi-label feature selection based on label correlations and feature redundancy
    Fan, Yuling
    Chen, Baihua
    Huang, Weiqin
    Liu, Jinghua
    Weng, Wei
    Lan, Weiyao
    KNOWLEDGE-BASED SYSTEMS, 2022, 241
  • [3] Multi-label feature selection based on neighborhood mutual information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Chen, Jinkun
    Duan, Jie
    APPLIED SOFT COMPUTING, 2016, 38 : 244 - 256
  • [4] Granular multi-label feature selection based on mutual information
    Li, Feng
    Miao, Duoqian
    Pedrycz, Witold
    PATTERN RECOGNITION, 2017, 67 : 410 - 423
  • [5] Multi-Label Feature Selection with Conditional Mutual Information
    Wang, Xiujuan
    Zhou, Yuchen
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [6] Approximating mutual information for multi-label feature selection
    Lee, J.
    Lim, H.
    Kim, D. -W.
    ELECTRONICS LETTERS, 2012, 48 (15) : 929 - 930
  • [7] Multi-label causal feature selection based on neighbourhood mutual information
    Wang, Jie
    Lin, Yaojin
    Li, Longzhu
    Wang, Yun-an
    Xu, Meiyan
    Chen, Jinkun
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (11) : 3509 - 3522
  • [8] Multi-label causal feature selection based on neighbourhood mutual information
    Jie Wang
    Yaojin Lin
    Longzhu Li
    Yun-an Wang
    Meiyan Xu
    Jinkun Chen
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 3509 - 3522
  • [9] Feature-specific mutual information variation for multi-label feature selection
    Hu, Liang
    Gao, Lingbo
    Li, Yonghao
    Zhang, Ping
    Gao, Wanfu
    INFORMATION SCIENCES, 2022, 593 : 449 - 471
  • [10] Mutual information-based label distribution feature selection for multi-label learning
    Qian, Wenbin
    Huang, Jintao
    Wang, Yinglong
    Shu, Wenhao
    KNOWLEDGE-BASED SYSTEMS, 2020, 195