Feature Selection for Multi-label Classification Using Neighborhood Preservation

被引:30
|
作者
Cai, Zhiling [1 ]
Zhu, William [1 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Lab Granular Comp & AI, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; multi-label learning; neighborhood relationship preserving; sample similarity; TRANSFORMATION; REDUCTION;
D O I
10.1109/JAS.2017.7510781
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label learning deals with data associated with a set of labels simultaneously. Dimensionality reduction is an important but challenging task in multi-label learning. Feature selection is an efficient technique for dimensionality reduction to search an optimal feature subset preserving the most relevant information. In this paper, we propose an effective feature evaluation criterion for multi-label feature selection, called neighborhood relationship preserving score. This criterion is inspired by similarity preservation, which is widely used in single-label feature selection. It evaluates each feature subset by measuring its capability in preserving neighborhood relationship among samples. Unlike similarity preservation, we address the order of sample similarities which can well express the neighborhood relationship among samples, not just the pairwise sample similarity. With this criterion, we also design one ranking algorithm and one greedy algorithm for feature selection problem. The proposed algorithms are validated in six publicly available data sets from machine learning repository. Experimental results demonstrate their superiorities over the compared state-of-the-art methods.
引用
收藏
页码:320 / 330
页数:11
相关论文
共 50 条
  • [1] Feature Selection for Multi-label Classification Using Neighborhood Preservation
    Zhiling Cai
    William Zhu
    [J]. IEEE/CAA Journal of Automatica Sinica, 2018, 5 (01) : 320 - 330
  • [2] Feature Selection for Hierarchical Multi-label Classification
    da Silva, Luan V. M.
    Cerri, Ricardo
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 196 - 208
  • [3] Feature Selection for Multi-label Classification Problems
    Doquire, Gauthier
    Verleysen, Michel
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2011, PT I, 2011, 6691 : 9 - 16
  • [4] Multi-label feature selection based on neighborhood mutual information
    Lin, Yaojin
    Hu, Qinghua
    Liu, Jinghua
    Chen, Jinkun
    Duan, Jie
    [J]. APPLIED SOFT COMPUTING, 2016, 38 : 244 - 256
  • [5] Feature selection for multi-label classification using multivariate mutual information
    Lee, Jaesung
    Kim, Dae-Won
    [J]. PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357
  • [6] Feature selection for multi-label naive Bayes classification
    Zhang, Min-Ling
    Pena, Jose M.
    Robles, Victor
    [J]. INFORMATION SCIENCES, 2009, 179 (19) : 3218 - 3229
  • [7] A lazy feature selection method for multi-label classification
    Pereira, Rafael B.
    Plastino, Alexandre
    Zadrozny, Bianca
    Merschmann, Luiz H. C.
    [J]. INTELLIGENT DATA ANALYSIS, 2021, 25 (01) : 21 - 34
  • [8] Categorizing feature selection methods for multi-label classification
    Pereira, Rafael B.
    Plastino, Alexandre
    Zadrozny, Bianca
    Merschmann, Luiz H. C.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2018, 49 (01) : 57 - 78
  • [9] Optimization approach for feature selection in multi-label classification
    Lim, Hyunki
    Lee, Jaesung
    Kim, Dae-Won
    [J]. PATTERN RECOGNITION LETTERS, 2017, 89 : 25 - 30
  • [10] Memetic feature selection algorithm for multi-label classification
    Lee, Jaesung
    Kim, Dae-Won
    [J]. INFORMATION SCIENCES, 2015, 293 : 80 - 96