Towards Class-Imbalance Aware Multi-Label Learning

被引:47
|
作者
Zhang, Min-Ling [1 ,2 ]
Li, Yu-Kun [1 ,3 ]
Yang, Hao [1 ,2 ]
Liu, Xu-Ying [1 ,2 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[2] Southeast Univ, Minist Educ, Key Lab Comp Network & Informat Integrat, Nanjing, Peoples R China
[3] Baidu Inc, Business Grp Nat Language Proc, Beijing 100085, Peoples R China
基金
美国国家科学基金会;
关键词
Training; Correlation; Predictive models; Labeling; Task analysis; Couplings; Technological innovation; Class-imbalance; cross-coupling aggregation (COCOA); machine learning; multi-label learning; CLASSIFICATION; CLASSIFIERS; ENSEMBLE;
D O I
10.1109/TCYB.2020.3027509
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label learning deals with training examples each represented by a single instance while associated with multiple class labels. Due to the exponential number of possible label sets to be considered by the predictive model, it is commonly assumed that label correlations should be well exploited to design an effective multi-label learning approach. On the other hand, class-imbalance stands as an intrinsic property of multi-label data which significantly affects the generalization performance of the multi-label predictive model. For each class label, the number of training examples with positive labeling assignment is generally much less than those with negative labeling assignment. To deal with the class-imbalance issue for multi-label learning, a simple yet effective class-imbalance aware learning strategy called cross-coupling aggregation (COCOA) is proposed in this article. Specifically, COCOA works by leveraging the exploitation of label correlations as well as the exploration of class-imbalance simultaneously. For each class label, a number of multiclass imbalance learners are induced by randomly coupling with other labels, whose predictions on the unseen instance are aggregated to determine the corresponding labeling relevancy. Extensive experiments on 18 benchmark datasets clearly validate the effectiveness of COCOA against state-of-the-art multi-label learning approaches especially in terms of imbalance-specific evaluation metrics.
引用
收藏
页码:4459 / 4471
页数:13
相关论文
共 50 条
  • [21] Multi-label Learning with Incomplete Class Assignments
    Bucak, Serhat Selcuk
    Jin, Rong
    Jain, Anil K.
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [22] Multi-fairness Under Class-Imbalance
    Roy, Arjun
    Iosifidis, Vasileios
    Ntoutsi, Eirini
    DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 286 - 301
  • [23] Measuring the class-imbalance extent of multi-class problems
    Ortigosa-Hernandez, Jonathan
    Inza, Inaki
    Lozano, Jose A.
    PATTERN RECOGNITION LETTERS, 2017, 98 : 32 - 38
  • [24] An Empirical Study for Class Imbalance in Extreme Multi-label Text Classification
    Han, Sangwoo
    Lim, Chan
    Cha, Bonggeon
    Lee, Jongwuk
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 338 - 341
  • [25] Adaptive Sampling with Optimal Cost for Class-Imbalance Learning
    Peng, Yuxin
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2921 - 2927
  • [26] Generating Counterfactual Instances for Explainable Class-Imbalance Learning
    Chen, Zhi
    Duan, Jiang
    Kang, Li
    Xu, Hongyan
    Chen, Rui
    Qiu, Guoping
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (03) : 1130 - 1144
  • [27] Online Anomaly Detection via Class-Imbalance Learning
    Maurya, Chandresh Kumar
    Toshniwal, Durga
    Venkoparao, Gopalan Vijendran
    2015 EIGHTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2015, : 30 - 35
  • [28] Discovering Latent Class Labels for Multi-Label Learning
    Huang, Jun
    Xu, Linchuan
    Wang, Jing
    Feng, Lei
    Yamanishi, Kenji
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3058 - 3064
  • [29] Multi-label sampling based on local label imbalance
    Liu, Bin
    Blekas, Konstantinos
    Tsoumakas, Grigorios
    PATTERN RECOGNITION, 2022, 122
  • [30] Towards Interpretable Deep Extreme Multi-label Learning
    Kang, Yihuang
    Cheng, I-Ling
    Mao, Wenjui
    Kuo, Bowen
    Lee, Pei-Ju
    2019 IEEE 20TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2019), 2019, : 69 - 74