Residual diverse ensemble for long-tailed multi-label text classification

被引:1
|
作者
Shi, Jiangxin [1 ,2 ]
Wei, Tong [3 ,4 ]
Li, Yufeng [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[2] Nanjing Univ, Sch Artificial Intelligence, Nanjing 210023, Peoples R China
[3] Southeast Univ, Sch Comp Sci & Engn, Nanjing 210096, Peoples R China
[4] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing 210096, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
multi-label learning; extreme multi-label learning; long-tailed distribution; multi-label text classification; ensemble learning;
D O I
10.1007/s11432-022-3915-6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Long-tailed multi-label text classification aims to identify a subset of relevant labels from a large candidate label set, where the training datasets usually follow long-tailed label distributions. Many of the previous studies have treated head and tail labels equally, resulting in unsatisfactory performance for identifying tail labels. To address this issue, this paper proposes a novel learning method that combines arbitrary models with two steps. The first step is the "diverse ensemble" that encourages diverse predictions among multiple shallow classifiers, particularly on tail labels, and can improve the generalization of tail labels. The second is the "error correction" that takes advantage of accurate predictions on head labels by the base model and approximates its residual errors for tail labels. Thus, it enables the "diverse ensemble" to focus on optimizing the tail label performance. This overall procedure is called residual diverse ensemble (RDE). RDE is implemented via a single-hidden-layer perceptron and can be used for scaling up to hundreds of thousands of labels. We empirically show that RDE consistently improves many existing models with considerable performance gains on benchmark datasets, especially with respect to the propensity-scored evaluation metrics. Moreover, RDE converges in less than 30 training epochs without increasing the computational overhead.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Residual diverse ensemble for long-tailed multi-label text classification
    Jiangxin SHI
    Tong WEI
    Yufeng LI
    Science China(Information Sciences), 2024, 67 (11) : 92 - 105
  • [2] Does Head Label Help for Long-Tailed Multi-Label Text Classification
    Xiao, Lin
    Zhang, Xiangliang
    Jing, Liping
    Huang, Chi
    Song, Mingyang
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 14103 - 14111
  • [3] Exploring Contrastive Learning for Long-Tailed Multi-label Text Classification
    Audibert, Alexandre
    Gauffre, Aurelien
    Amini, Massih-Reza
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 245 - 261
  • [4] Label-Specific Feature Augmentation for Long-Tailed Multi-Label Text Classification
    Xu, Pengyu
    Xiao, Lin
    Liu, Bing
    Lu, Sijin
    Jing, Liping
    Yu, Jian
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10602 - 10610
  • [5] Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution
    Huang, Yi
    Giledereli, Buse
    Koksal, Abdullatif
    Ozgur, Arzucan
    Ozkirimli, Elif
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 8153 - 8161
  • [6] Long-tailed Extreme Multi-label Text Classification by the Retrieval of Generated Pseudo Label Descriptions
    Zhang, Ruohong
    Wang, Yau-Shian
    Yang, Yiming
    Yu, Donghan
    Vu, Tom
    Lei, Likun
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1092 - 1106
  • [7] Triple Alliance Prototype Orthotist Network for Long-Tailed Multi-Label Text Classification
    Xiao, Lin
    Xu, Pengyu
    Song, Mingyang
    Liu, Huafeng
    Jing, Liping
    Zhang, Xiangliang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2616 - 2628
  • [8] Distributionally Robust Loss for Long-Tailed Multi-label Image Classification
    Lin, Dekun
    Peng, Tailai
    Chen, Rui
    Xie, Xinran
    Qin, Xiaolin
    Cui, Zhe
    COMPUTER VISION - ECCV 2024, PT XXXIII, 2025, 15091 : 417 - 433
  • [9] Probability Guided Loss for Long-Tailed Multi-Label Image Classification
    Lin, Dekun
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 2, 2023, : 1577 - 1585
  • [10] Effect of Stage Training for Long-Tailed Multi-Label Image Classification
    Yamagishi, Yosuke
    Hanaoka, Shohei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 2713 - 2720