Multi-label classification with label clusters

被引:0
|
作者
Gatto, Elaine Cecilia [1 ]
Ferrandin, Mauri [2 ]
Cerri, Ricardo [3 ]
机构
[1] Univ Fed Sao Carlos, Dept Comp Sci, BR-13565905 Sao Carlos, SP, Brazil
[2] Univ Fed Santa Catarina, Dept Control Automat & Comp Engn, BR-89036002 Blumenau, SC, Brazil
[3] Univ Sao Paulo, Inst Math & Comp Sci, Ave Trabalhador Sao Carlense,400 Ctr, BR-13566590 Sao Carlos, SP, Brazil
关键词
Multi-label correlations; Multi-label partitioning; Multi-label clustering; Multi-label classification; Multi-label learning; CLASSIFIERS; DEPENDENCE; ENSEMBLES;
D O I
10.1007/s10115-024-02270-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification is the task of simultaneously predicting a set of labels for an instance, with global and local being the two predominant approaches. The global approach trains a single classifier to handle all classes simultaneously, while the local approach breaks down the problem into multiple binary problems. Despite extensive research, effectively capturing label correlations remains a challenge in both methods. In this paper, we introduce an approach that clusters the label space to create hybrid partitions (disjoint correlated label clusters), striking a balance between global and local strategies while leveraging both advantages. Our approach consists of (i) clustering the label space based on correlations, (ii) generating and validating the resulting hybrid partitions, (iii) selecting the best partitions, and (iv) evaluating their performance. We also compare our approach against an oracle, exhaustive search, and random search to assess how closely our hybrid partitions approximate the best possible partitions. The oracle selects the best partition using the test set, while the exhaustive approach relies on validation data. Experiments conducted on multiple multi-label datasets demonstrate that our method, along with random partitions, achieves results that are superior or competitive compared to traditional global and local approaches, as well as the state-of-the-art Ensemble of Classifier Chains. These findings suggest that conventional methods may not fully capture label correlations, and clustering the label space offers a promising solution.
引用
收藏
页码:1741 / 1785
页数:45
相关论文
共 50 条
  • [41] An efficient stacking model with label selection for multi-label classification
    Yan-Nan Chen
    Wei Weng
    Shun-Xiang Wu
    Bai-Hua Chen
    Yu-Ling Fan
    Jing-Hua Liu
    Applied Intelligence, 2021, 51 : 308 - 325
  • [42] Multi-label relational classification via node and label correlation
    Zhang, Zan
    Wang, Hao
    Liu, Lin
    Li, Jiuyong
    NEUROCOMPUTING, 2018, 292 : 72 - 81
  • [43] Multi-label Image Classification with A Probabilistic Label Enhancement Model
    Li, Xin
    Zhao, Feipeng
    Guo, Yuhong
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 430 - 439
  • [44] Multi-label Classification via Label-Topic Pairs
    Chen, Gang
    Peng, Yue
    Wang, Chongjun
    WEB AND BIG DATA (APWEB-WAIM 2018), PT I, 2018, 10987 : 32 - 44
  • [45] Label Embedding for Multi-label Classification Via Dependence Maximization
    Yachong Li
    Youlong Yang
    Neural Processing Letters, 2020, 52 : 1651 - 1674
  • [46] Multi-label Iterated Learning for Image Classification with Label Ambiguity
    Rajeswar, Sai
    Rodriguez, Pau
    Singhal, Soumye
    Vazquez, David
    Courville, Aaron
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 4773 - 4783
  • [47] An efficient stacking model with label selection for multi-label classification
    Chen, Yan-Nan
    Weng, Wei
    Wu, Shun-Xiang
    Chen, Bai-Hua
    Fan, Yu-Ling
    Liu, Jing-Hua
    APPLIED INTELLIGENCE, 2021, 51 (01) : 308 - 325
  • [48] Label Clustering for a Novel Problem Transformation in Multi-label Classification
    Sellah, Small
    Hilaire, Vincent
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2020, 26 (01) : 71 - 88
  • [49] PLM: Partial Label Masking for Imbalanced Multi-label Classification
    Duarte, Kevin
    Rawat, Yogesh
    Shah, Mubarak
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 2733 - 2742
  • [50] Label Embedding for Multi-label Classification Via Dependence Maximization
    Li, Yachong
    Yang, Youlong
    NEURAL PROCESSING LETTERS, 2020, 52 (02) : 1651 - 1674