Feature Selection for Multi-label Classification Using Neighborhood Preservation

被引:30
|
作者
Cai, Zhiling [1 ]
Zhu, William [1 ]
机构
[1] Univ Elect Sci & Technol China, Inst Fundamental & Frontier Sci, Lab Granular Comp & AI, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; multi-label learning; neighborhood relationship preserving; sample similarity; TRANSFORMATION; REDUCTION;
D O I
10.1109/JAS.2017.7510781
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label learning deals with data associated with a set of labels simultaneously. Dimensionality reduction is an important but challenging task in multi-label learning. Feature selection is an efficient technique for dimensionality reduction to search an optimal feature subset preserving the most relevant information. In this paper, we propose an effective feature evaluation criterion for multi-label feature selection, called neighborhood relationship preserving score. This criterion is inspired by similarity preservation, which is widely used in single-label feature selection. It evaluates each feature subset by measuring its capability in preserving neighborhood relationship among samples. Unlike similarity preservation, we address the order of sample similarities which can well express the neighborhood relationship among samples, not just the pairwise sample similarity. With this criterion, we also design one ranking algorithm and one greedy algorithm for feature selection problem. The proposed algorithms are validated in six publicly available data sets from machine learning repository. Experimental results demonstrate their superiorities over the compared state-of-the-art methods.
引用
收藏
页码:320 / 330
页数:11
相关论文
共 50 条
  • [21] A many-objective feature selection for multi-label classification
    Dong, Hongbin
    Sun, Jing
    Sun, Xiaohang
    Ding, Rui
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 208
  • [22] Embedded Feature Selection for Multi-label Classification of Music Emotions
    You, Mingyu
    Liu, Jiaming
    Li, Guo-Zheng
    Chen, Yan
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2012, 5 (04) : 668 - 678
  • [23] Improving Multi-Label Medical Text Classification by Feature Selection
    Glinka, Kinga
    Wozniak, Rafal
    Zakrzewska, Danuta
    [J]. 2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 176 - 181
  • [24] gMLC: a multi-label feature selection framework for graph classification
    Kong, Xiangnan
    Yu, Philip S.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 31 (02) : 281 - 305
  • [25] Bayesian Chain Classifier with Feature Selection for Multi-label Classification
    Benitez Jimenez, Ricardo
    Morales, Eduardo F.
    Jair Escalante, Hugo
    [J]. ADVANCES IN SOFT COMPUTING, MICAI 2018, PT I, 2018, 11288 : 232 - 243
  • [26] Embedded Feature Selection for Multi-label Classification of Music Emotions
    Mingyu You
    Jiaming Liu
    Guo-Zheng Li
    Yan Chen
    [J]. International Journal of Computational Intelligence Systems, 2012, 5 : 668 - 678
  • [27] PS-MLC: Feature selection for multi-label classification using clustering in feature space
    Mishra, Nitin Kumar
    Singh, Pramod Kumar
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (04)
  • [28] Label Construction for Multi-label Feature Selection
    Spolaor, Newton
    Monard, Maria Carolina
    Tsoumakas, Grigorios
    Lee, Huei Diana
    [J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 247 - 252
  • [29] Joint multi-label classification and label correlations with missing labels and feature selection
    He, Zhi-Fen
    Yang, Ming
    Gao, Yang
    Liu, Hui-Dong
    Yin, Yilong
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 163 : 145 - 158
  • [30] Multi-Label Feature Selection using Correlation Information
    Braytee, Ali
    Liu, Wei
    Catchpoole, Daniel R.
    Kennedy, Paul J.
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1649 - 1656