gMLC: a multi-label feature selection framework for graph classification

被引:0
|
作者
Xiangnan Kong
Philip S. Yu
机构
[1] University of Illinois at Chicago,Department of Computer Science
来源
关键词
Feature selection; Graph classification; Multi-label learning; Subgraph pattern; Label correlation;
D O I
暂无
中图分类号
学科分类号
摘要
Graph classification has been showing critical importance in a wide variety of applications, e.g. drug activity predictions and toxicology analysis. Current research on graph classification focuses on single-label settings. However, in many applications, each graph data can be assigned with a set of multiple labels simultaneously. Extracting good features using multiple labels of the graphs becomes an important step before graph classification. In this paper, we study the problem of multi-label feature selection for graph classification and propose a novel solution, called gMLC, to efficiently search for optimal subgraph features for graph objects with multiple labels. Different from existing feature selection methods in vector spaces that assume the feature set is given, we perform multi-label feature selection for graph data in a progressive way together with the subgraph feature mining process. We derive an evaluation criterion to estimate the dependence between subgraph features and multiple labels of graphs. Then, a branch-and-bound algorithm is proposed to efficiently search for optimal subgraph features by judiciously pruning the subgraph search space using multiple labels. Empirical studies demonstrate that our feature selection approach can effectively boost multi-label graph classification performances and is more efficient by pruning the subgraph search space using multiple labels.
引用
收藏
页码:281 / 305
页数:24
相关论文
共 50 条
  • [41] Low-rank learning for feature selection in multi-label classification
    Lim, Hyunki
    [J]. PATTERN RECOGNITION LETTERS, 2023, 172 : 106 - 112
  • [42] Feature selection for multi-label classification using multivariate mutual information
    Lee, Jaesung
    Kim, Dae-Won
    [J]. PATTERN RECOGNITION LETTERS, 2013, 34 (03) : 349 - 357
  • [43] Multi-Label Bioinformatics Data Classification With Ensemble Embedded Feature Selection
    Guo, Yumeng
    Chung, Fu-Lai
    Li, Guozheng
    Zhang, Lei
    [J]. IEEE ACCESS, 2019, 7 : 103863 - 103875
  • [44] Feature selection for multi-label classification based on neighborhood rough sets
    Duan, Jie
    Hu, Qinghua
    Zhang, Lingjun
    Qian, Yuhua
    Li, Deyu
    [J]. Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (01): : 56 - 65
  • [45] Knowledge Graph Constraints for Multi-label Graph Classification
    Ringsquandl, Martin
    Lamparter, Steffen
    Thon, Ingo
    Lepratti, Raffaello
    Kroeger, Peer
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 121 - 127
  • [46] Partial Classifier Chains with Feature Selection by Exploiting Label Correlation in Multi-Label Classification
    Wang, Zhenwu
    Wang, Tielin
    Wan, Benting
    Han, Mengjie
    [J]. ENTROPY, 2020, 22 (10) : 1 - 22
  • [47] Feature selection for multi-label learning with streaming label
    Liu, Jinghua
    Li, Yuwen
    Weng, Wei
    Zhang, Jia
    Chen, Baihua
    Wu, Shunxiang
    [J]. NEUROCOMPUTING, 2020, 387 : 268 - 278
  • [48] Independent Feature and Label Components for Multi-label Classification
    Zhong, Yongjian
    Xu, Chang
    Du, Bo
    Zhang, Lefei
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 827 - 836
  • [49] Multi-label feature selection considering label supplementation
    Zhang, Ping
    Liu, Guixia
    Gao, Wanfu
    Song, Jiazhi
    [J]. PATTERN RECOGNITION, 2021, 120 (120)
  • [50] Robust multi-label feature selection with shared coupled and dynamic graph regularization
    Wang, Lingzhi
    Chen, Hongmei
    Peng, Bo
    Li, Tianrui
    Yin, Tengyu
    [J]. APPLIED INTELLIGENCE, 2023, 53 (13) : 16973 - 16997