Information-theoretic partially labeled heterogeneous feature selection based on neighborhood rough sets

被引:14
|
作者
Zhang, Hongying [1 ]
Sun, Qianqian [1 ]
Dong, Kezhen [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Monotonic entropy; Partially labeled heterogeneous data; ATTRIBUTE REDUCTION;
D O I
10.1016/j.ijar.2022.12.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid increase of large-scale, real-world datasets, it becomes critical to address the problem of partially labeled heterogeneous feature selection (i.e., some samples, which own numerical and categorical features, have no labels). Existing solutions typically adopt linear correlations between features. In this paper, three different monotonic uncertainty measures are defined on equivalence classes and neighborhood classes to study the partially labeled heterogeneous feature selection by exploring the nonlinear correlations. First, consistent entropy and monotonic neighborhood entropy, based on classical rough set theory and neighborhood rough set theory, are proposed to construct a uniform measure for feature selection in heterogeneous datasets. Furthermore, a maximal neighborhood entropy strategy is developed by considering the inconsistency of neighborhood classes described by the features and partial labels. Finally, two feature selection algorithms are presented by three novel monotonic uncertainty measures. The comparative experiments demonstrate the effectiveness and superiority of the newly proposed feature selection measures.(c) 2022 Elsevier Inc. All rights reserved.
引用
收藏
页码:200 / 217
页数:18
相关论文
共 50 条
  • [31] Fast multi-label feature selection based on information-theoretic feature ranking
    Lee, Jaesung
    Kim, Dae-Won
    PATTERN RECOGNITION, 2015, 48 (09) : 2761 - 2771
  • [32] Multi-label feature selection based on fuzzy neighborhood rough sets
    Jiucheng Xu
    Kaili Shen
    Lin Sun
    Complex & Intelligent Systems, 2022, 8 : 2105 - 2129
  • [33] Feature selection for label distribution learning based on neighborhood fuzzy rough sets
    Deng, Zhixuan
    Li, Tianrui
    Zhang, Pengfei
    Liu, Keyu
    Yuan, Zhong
    Deng, Dayong
    APPLIED SOFT COMPUTING, 2025, 169
  • [34] Multi-label feature selection based on fuzzy neighborhood rough sets
    Xu, Jiucheng
    Shen, Kaili
    Sun, Lin
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (03) : 2105 - 2129
  • [35] Feature selection for multi-label classification based on neighborhood rough sets
    Duan, Jie
    Hu, Qinghua
    Zhang, Lingjun
    Qian, Yuhua
    Li, Deyu
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (01): : 56 - 65
  • [36] Maximum relevance minimum redundancy-based feature selection using rough mutual information in adaptive neighborhood rough sets
    Kanglin Qu
    Jiucheng Xu
    Ziqin Han
    Shihui Xu
    Applied Intelligence, 2023, 53 : 17727 - 17746
  • [37] Maximum relevance minimum redundancy-based feature selection using rough mutual information in adaptive neighborhood rough sets
    Qu, Kanglin
    Xu, Jiucheng
    Han, Ziqin
    Xu, Shihui
    APPLIED INTELLIGENCE, 2023, 53 (14) : 17727 - 17746
  • [38] A Fast Information-Theoretic Approximation of Joint Mutual Information Feature Selection
    Liu, Heng
    Ditzler, Gregory
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 4610 - 4617
  • [39] Neighborhood rough sets with distance metric learning for feature selection
    Yang, Xiaoling
    Chen, Hongmei
    Li, Tianrui
    Wan, Jihong
    Sang, Binbin
    KNOWLEDGE-BASED SYSTEMS, 2021, 224
  • [40] Parallel Approaches to Neighborhood Rough Sets: Classification and Feature Selection
    Zhang, Junbo
    Wang, Chizheng
    Pan, Yi
    Li, Tianrui
    KNOWLEDGE ENGINEERING AND MANAGEMENT , ISKE 2013, 2014, 278 : 1 - 10