Multi-label classification with label clusters

被引:0
|
作者
Gatto, Elaine Cecilia [1 ]
Ferrandin, Mauri [2 ]
Cerri, Ricardo [3 ]
机构
[1] Univ Fed Sao Carlos, Dept Comp Sci, BR-13565905 Sao Carlos, SP, Brazil
[2] Univ Fed Santa Catarina, Dept Control Automat & Comp Engn, BR-89036002 Blumenau, SC, Brazil
[3] Univ Sao Paulo, Inst Math & Comp Sci, Ave Trabalhador Sao Carlense,400 Ctr, BR-13566590 Sao Carlos, SP, Brazil
关键词
Multi-label correlations; Multi-label partitioning; Multi-label clustering; Multi-label classification; Multi-label learning; CLASSIFIERS; DEPENDENCE; ENSEMBLES;
D O I
10.1007/s10115-024-02270-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label classification is the task of simultaneously predicting a set of labels for an instance, with global and local being the two predominant approaches. The global approach trains a single classifier to handle all classes simultaneously, while the local approach breaks down the problem into multiple binary problems. Despite extensive research, effectively capturing label correlations remains a challenge in both methods. In this paper, we introduce an approach that clusters the label space to create hybrid partitions (disjoint correlated label clusters), striking a balance between global and local strategies while leveraging both advantages. Our approach consists of (i) clustering the label space based on correlations, (ii) generating and validating the resulting hybrid partitions, (iii) selecting the best partitions, and (iv) evaluating their performance. We also compare our approach against an oracle, exhaustive search, and random search to assess how closely our hybrid partitions approximate the best possible partitions. The oracle selects the best partition using the test set, while the exhaustive approach relies on validation data. Experiments conducted on multiple multi-label datasets demonstrate that our method, along with random partitions, achieves results that are superior or competitive compared to traditional global and local approaches, as well as the state-of-the-art Ensemble of Classifier Chains. These findings suggest that conventional methods may not fully capture label correlations, and clustering the label space offers a promising solution.
引用
收藏
页码:1741 / 1785
页数:45
相关论文
共 50 条
  • [21] On label dependence and loss minimization in multi-label classification
    Dembczynski, Krzysztof
    Waegeman, Willem
    Cheng, Weiwei
    Huellermeier, Eyke
    MACHINE LEARNING, 2012, 88 (1-2) : 5 - 45
  • [22] Multi-label Classification with Label Correlations of Multimedia Datasets
    Glinka, Kinga
    Zakrzewska, Danuta
    MULTIMEDIA AND NETWORK INFORMATION SYSTEMS, MISSI 2016, 2017, 506 : 49 - 59
  • [23] Independent Feature and Label Components for Multi-label Classification
    Zhong, Yongjian
    Xu, Chang
    Du, Bo
    Zhang, Lefei
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 827 - 836
  • [24] Extracting Label Importance Information for Multi-label Classification
    Wang, Dengbao
    Li, Li
    Wang, Jingyuan
    Hu, Fei
    Zhang, Xiuzhen
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2018), PT II, 2018, 10828 : 424 - 439
  • [25] The importance of the label hierarchy in hierarchical multi-label classification
    Levatic, Jurica
    Kocev, Dragi
    Dzeroski, Saso
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (02) : 247 - 271
  • [26] A Label Distribution Topic Model for Multi-label Classification
    Liu, Lin
    Tang, Lin
    2019 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION PROCESSING (ICIIP 2019), 2019, : 52 - 57
  • [27] On label dependence and loss minimization in multi-label classification
    Krzysztof Dembczyński
    Willem Waegeman
    Weiwei Cheng
    Eyke Hüllermeier
    Machine Learning, 2012, 88 : 5 - 45
  • [28] Learning Label Specific Features for Multi-Label Classification
    Huang, Jun
    Li, Guorong
    Huang, Qingming
    Wu, Xindong
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 181 - 190
  • [29] Multi-dimensional multi-label classification: Towards encompassing heterogeneous label spaces and multi-label annotations
    Jia, Bin -Bin
    Zhang, Min -Ling
    PATTERN RECOGNITION, 2023, 138
  • [30] Multi-label Visual Classification with Label Exclusive Context
    Chen, Xiangyu
    Yuan, Xiao-Tong
    Chen, Qiang
    Yan, Shuicheng
    Chua, Tat-Seng
    2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2011, : 834 - 841