A Robust Multilabel Feature Selection Approach Based on Graph Structure Considering Fuzzy Dependency and Feature Interaction

被引:14
|
作者
Yin, Tengyu [1 ,2 ,3 ,4 ]
Chen, Hongmei [1 ,2 ,3 ,4 ]
Yuan, Zhong [5 ]
Wan, Jihong [1 ,2 ,3 ,4 ]
Liu, Keyu [1 ,2 ,3 ,4 ]
Horng, Shi-Jinn [6 ,7 ]
Li, Tianrui [1 ,2 ,3 ,4 ]
机构
[1] Southwest Jiaotong Univ, Inst Artificial Intelligence, Sch Comp & Artificial Intelligence, Chengdu 611756, Peoples R China
[2] Southwest Jiaotong Univ, Natl Engn Lab Integrated Transportat Big Data Appl, Chengdu 611756, Peoples R China
[3] Southwest Jiaotong Univ, Engn Res Ctr SustainableUrban Intelligent Transpor, Minist Educ, Chengdu 611756, Peoples R China
[4] Southwest Jiaotong Univ, Mfg Ind Chains Collaborat & Informat Support Techn, Chengdu 611756, Peoples R China
[5] Sichuan Univ, Coll Comp Sci, Chengdu 610065, Peoples R China
[6] Asia Univ, Dept Comp Sci & Informat Engn, Taichung 41354, Taiwan
[7] Sichuan Univ, China Med Univ Hosp, China Med Univ, Dept Med Res, Taichung 404327, Taiwan
基金
中国国家自然科学基金;
关键词
Feature interaction; fuzzy rough sets; graph structure; multilabel feature selection; robustness; NEIGHBORHOOD ROUGH SETS; MUTUAL INFORMATION;
D O I
10.1109/TFUZZ.2023.3287193
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of multilabel learning depends heavily on the quality of the input features. A mass of irrelevant and redundant features may seriously affect the performance of multilabel learning, and feature selection is an effective technique to solve this problem. However, most multilabel feature selection methods mainly emphasize removing these useless features, and the exploration of feature interaction is ignored. Moreover, the widespread existence of real-world data with uncertainty, ambiguity, and noise limits the performance of feature selection. To this end, our work is dedicated to designing an efficient and robust multilabel feature selection scheme. First, the distribution character of multilabel data is analyzed to generate robust fuzzy multineighborhood granules. By exploring the classification information implied in the data under the granularity structure, a robust multilabel k-nearest neighbor fuzzy rough set model is constructed, and the concept of fuzzy dependency is studied. Second, a series of fuzzy multineighborhood uncertainty measures in k-nearest neighbor fuzzy rough approximation spaces are studied to analyze the correlations of feature pairs, including interactivity. Third, by investigating the uncertainty measure between feature and label, between features, multilabel data is modeled as a complete weighted graph. Then, these vertices are assessed iteratively to guide the assignment of feature weights. Finally, a graph structure-based robust multilabel feature selection algorithm (GRMFS) is designed. The experiments are conducted on 15 multilabel datasets. The results verify the superior performance of GRMFS as compared with nine representative feature selection methods.
引用
收藏
页码:4516 / 4528
页数:13
相关论文
共 50 条
  • [1] A robust graph based multi-label feature selection considering feature-label dependency
    Liu, Yunfei
    Chen, Hongmei
    Li, Tianrui
    Li, Weiyi
    [J]. APPLIED INTELLIGENCE, 2023, 53 (01) : 837 - 863
  • [2] A robust graph based multi-label feature selection considering feature-label dependency
    Yunfei Liu
    Hongmei Chen
    Tianrui Li
    Weiyi Li
    [J]. Applied Intelligence, 2023, 53 : 837 - 863
  • [3] Fuzzy Mutual Information-Based Multilabel Feature Selection With Label Dependency and Streaming Labels
    Liu, Jinghua
    Lin, Yaojin
    Ding, Weiping
    Zhang, Hongbo
    Du, Jixiang
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2023, 31 (01) : 77 - 91
  • [4] A graph approach for fuzzy -rough feature selection
    Chen, Jinkun
    Mi, Jusheng
    Lin, Yaojin
    [J]. FUZZY SETS AND SYSTEMS, 2020, 391 : 96 - 116
  • [5] LEFMIFS: Label enhancement and fuzzy mutual information for robust multilabel feature selection
    Yin, Tengyu
    Chen, Hongmei
    Yuan, Zhong
    Sang, Binbin
    Horng, Shi-Jinn
    Li, Tianrui
    Luo, Chuan
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [6] Multilabel Feature Selection: A Local Causal Structure Learning Approach
    Yu, Kui
    Cai, Mingzhu
    Wu, Xingyu
    Liu, Lin
    Li, Jiuyong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (06) : 3044 - 3057
  • [7] Fuzzy Neighborhood-Based Manifold Learning and Feature Weight Matrix for Multilabel Feature Selection
    Sun, Lin
    Zhang, Qifeng
    Ding, Weiping
    Xu, Jiucheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [8] Streaming Feature Selection for Multilabel Learning Based on Fuzzy Mutual Information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Li, Jinjin
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2017, 25 (06) : 1491 - 1507
  • [9] Multilabel Feature Selection Based on Fuzzy Mutual Information and Orthogonal Regression
    Dai, Jianhua
    Liu, Qi
    Chen, Wenxiang
    Zhang, Chucai
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (09) : 5136 - 5148
  • [10] A filter-based feature selection approach in multilabel classification
    Shaikh, Rafia
    Rafi, Muhammad
    Mahoto, Naeem Ahmed
    Sulaiman, Adel
    Shaikh, Asadullah
    [J]. MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2023, 4 (04):