gMLC: a multi-label feature selection framework for graph classification

被引:0
|
作者
Xiangnan Kong
Philip S. Yu
机构
[1] University of Illinois at Chicago,Department of Computer Science
来源
关键词
Feature selection; Graph classification; Multi-label learning; Subgraph pattern; Label correlation;
D O I
暂无
中图分类号
学科分类号
摘要
Graph classification has been showing critical importance in a wide variety of applications, e.g. drug activity predictions and toxicology analysis. Current research on graph classification focuses on single-label settings. However, in many applications, each graph data can be assigned with a set of multiple labels simultaneously. Extracting good features using multiple labels of the graphs becomes an important step before graph classification. In this paper, we study the problem of multi-label feature selection for graph classification and propose a novel solution, called gMLC, to efficiently search for optimal subgraph features for graph objects with multiple labels. Different from existing feature selection methods in vector spaces that assume the feature set is given, we perform multi-label feature selection for graph data in a progressive way together with the subgraph feature mining process. We derive an evaluation criterion to estimate the dependence between subgraph features and multiple labels of graphs. Then, a branch-and-bound algorithm is proposed to efficiently search for optimal subgraph features by judiciously pruning the subgraph search space using multiple labels. Empirical studies demonstrate that our feature selection approach can effectively boost multi-label graph classification performances and is more efficient by pruning the subgraph search space using multiple labels.
引用
收藏
页码:281 / 305
页数:24
相关论文
共 50 条
  • [1] gMLC: a multi-label feature selection framework for graph classification
    Kong, Xiangnan
    Yu, Philip S.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 31 (02) : 281 - 305
  • [2] Feature Selection for Hierarchical Multi-label Classification
    da Silva, Luan V. M.
    Cerri, Ricardo
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 196 - 208
  • [3] Feature Selection for Multi-label Classification Problems
    Doquire, Gauthier
    Verleysen, Michel
    [J]. ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2011, PT I, 2011, 6691 : 9 - 16
  • [4] Label generation with consistency on the graph for multi-label feature selection
    Hao, Pingting
    Zhang, Ping
    Feng, Qi
    Gao, Wanfu
    [J]. INFORMATION SCIENCES, 2024, 677
  • [5] Online Feature Selection for Multi-label Classification in Multi-objective Optimization Framework
    Paul, Dipanjyoti
    Kumar, Rahul
    Saha, Sriparna
    Mathew, Jimson
    [J]. PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019), 2019, : 530 - 531
  • [6] Feature selection for multi-label naive Bayes classification
    Zhang, Min-Ling
    Pena, Jose M.
    Robles, Victor
    [J]. INFORMATION SCIENCES, 2009, 179 (19) : 3218 - 3229
  • [7] Categorizing feature selection methods for multi-label classification
    Pereira, Rafael B.
    Plastino, Alexandre
    Zadrozny, Bianca
    Merschmann, Luiz H. C.
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2018, 49 (01) : 57 - 78
  • [8] A lazy feature selection method for multi-label classification
    Pereira, Rafael B.
    Plastino, Alexandre
    Zadrozny, Bianca
    Merschmann, Luiz H. C.
    [J]. INTELLIGENT DATA ANALYSIS, 2021, 25 (01) : 21 - 34
  • [9] Feature Selection in Multi-label classification through MLQPFS
    Soheili, Majid
    Moghadam, Amir-Massoud Eftekhari
    [J]. 2016 4TH INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, AND AUTOMATION (ICCIA), 2016, : 430 - 434
  • [10] Categorizing feature selection methods for multi-label classification
    Rafael B. Pereira
    Alexandre Plastino
    Bianca Zadrozny
    Luiz H. C. Merschmann
    [J]. Artificial Intelligence Review, 2018, 49 : 57 - 78