SemiFREE: Semisupervised Feature Selection With Fuzzy Relevance and Redundancy

被引:26
|
作者
Liu, Keyu [1 ,2 ,3 ]
Li, Tianrui [1 ,2 ,3 ]
Yang, Xibei [4 ]
Chen, Hongmei [1 ,2 ,3 ]
Wang, Jie [1 ,2 ,3 ]
Deng, Zhixuan [1 ,2 ,3 ]
机构
[1] Southwest Jiaotong Univ, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Inst Artificial Intelligence, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
[4] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212100, Peoples R China
基金
美国国家科学基金会;
关键词
Fuzzy feature selection; fuzzy mutual information; maximum relevance; minimal redundancy; semisupervised feature selection; SUPERVISED FEATURE-SELECTION; ATTRIBUTE REDUCTION; BAND SELECTION; SETS;
D O I
10.1109/TFUZZ.2023.3255893
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection, as an effective dimensionality reduction technique, is favored in preprocessing data. However, most existing algorithms are solely liable for labeled or unlabeled data, whereas a limited portion of real-world data is annotated with labels. In this article, we therefore propose a novel scheme named SemiFREE, i.e., semisupervised feature selection with fuzzy relevance and redundancy. First, both labeled and unlabeled samples are assigned with fuzzy decisions that allow class membership to naturally express the fuzziness or uncertainty in data labeling. Second, sample similarities in feature space and fuzzy decision are captured to induce fuzzy information measures for redefining the feature relevance and redundancy. Finally, adhering to the principle of relevance-maximization and redundancy-minimization, SemiFREE leverages the forward sequential searching strategy to identify qualified features progressively. Extensive experiments demonstrate the superiority of SemiFREE in the presence of partially labeled data against some other well-established feature selection algorithms.
引用
收藏
页码:3384 / 3396
页数:13
相关论文
共 50 条
  • [1] Semisupervised Feature Selection Based on Relevance and Redundancy Criteria
    Xu, Jin
    Tang, Bo
    He, Haibo
    Man, Hong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (09) : 1974 - 1984
  • [2] Feature Selection and Semisupervised Fuzzy Clustering
    Kong, Yi-qing
    Wang, Shi-tong
    FUZZY INFORMATION AND ENGINEERING, 2009, 1 (02) : 179 - 190
  • [3] A Feature Selection Based on Relevance and Redundancy
    Lu, Yonghe
    Liu, Wenqiu
    Li, Yanfeng
    JOURNAL OF COMPUTERS, 2015, 10 (04) : 284 - 291
  • [4] Feature Selection with Integrated Relevance and Redundancy Optimization
    Xu, Linli
    Zhou, Qi
    Huang, Aiqing
    Ouyang, Wenjun
    Chen, Enhong
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 1063 - 1068
  • [5] Feature Selection via Analysis of Relevance and Redundancy
    王飒
    王克勇
    郑链
    JournalofBeijingInstituteofTechnology, 2008, (03) : 300 - 304
  • [6] Minimum redundancy - Maximum relevance feature selection
    Peng, HC
    Ding, C
    Long, FH
    IEEE INTELLIGENT SYSTEMS, 2005, 20 (06) : 70 - 71
  • [7] Fuzzy information-theoretic feature selection via relevance, redundancy, and complementarity criteria
    Ma, Xi-Ao
    Ju, Chunhua
    INFORMATION SCIENCES, 2022, 611 : 564 - 590
  • [8] Feature Selection With Maximal Relevance and Minimal Supervised Redundancy
    Wang, Yadi
    Li, Xiaoping
    Ruiz, Ruben
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) : 707 - 717
  • [9] Efficient feature selection via analysis of relevance and redundancy
    Yu, L
    Liu, H
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 5 : 1205 - 1224
  • [10] Improved Measures of Redundancy and Relevance for mRMR Feature Selection
    Jo, Insik
    Lee, Sangbum
    Oh, Sejong
    COMPUTERS, 2019, 8 (02)