CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification

被引:0
|
作者
Kharbanda, Siddhant [1 ]
Banerjee, Atmadeep [1 ]
Schultheis, Erik [1 ]
Babbar, Rohit [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
基金
芬兰科学院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent approaches, such as XR-Transformer and LightXML, leverage a transformer instance to achieve state-of-the-art performance. However, in this process, these approaches need to make various trade-offs between performance and computational requirements. A major shortcoming, as compared to the Bi-LSTM based AttentionXML, is that they fail to keep separate feature representations for each resolution in a label tree. We thus propose CascadeXML, an end-to-end multi-resolution learning pipeline, which can harness the multi-layered architecture of a transformer model for attending to different label resolutions with separate feature representations. CascadeXML significantly outperforms all existing approaches with non-trivial gains obtained on benchmark datasets consisting of up to three million labels. Code for CascadeXML will be made publicly available at https://github.com/xmc-aalto/cascadexml.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [41] Long-tail Mixup for Extreme Multi-label Classification
    Han, Sangwoo
    Choi, Eunseong
    Lim, Chan
    Shim, Hyunjung
    Lee, Jongwuk
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3998 - 4002
  • [42] Ranking-Based Autoencoder for Extreme Multi-label Classification
    Wang, Bingyu
    Chen, Li
    Sun, Wei
    Qin, Kechen
    Li, Kefeng
    Zhou, Hui
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2820 - 2830
  • [43] Multi-Label Classification Method Based on Extreme Learning Machines
    Venkatesan, Rajasekar
    Er, Meng Joo
    2014 13TH INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION ROBOTICS & VISION (ICARCV), 2014, : 619 - 624
  • [44] Extreme Multi-Label Text Classification Based on Balance Function
    Chen, Zhaohong
    Hong, Zhiyong
    Yu, Wenhua
    Zhang, Xin
    Computer Engineering and Applications, 2024, 60 (04) : 163 - 172
  • [45] Bonsai: diverse and shallow trees for extreme multi-label classification
    Sujay Khandagale
    Han Xiao
    Rohit Babbar
    Machine Learning, 2020, 109 : 2099 - 2119
  • [46] A Comparative Analysis on Various Extreme Multi-Label Classification Algorithms
    Kumar, Puneet
    Dubey, Vikash Kumar
    Showrov, Md Imran Hossain
    2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2019, : 265 - 268
  • [47] Bonsai: diverse and shallow trees for extreme multi-label classification
    Khandagale, Sujay
    Xiao, Han
    Babbar, Rohit
    MACHINE LEARNING, 2020, 109 (11) : 2099 - 2119
  • [48] Visual Transformers with Primal Object Queries for Multi-Label Image Classification
    Yazici, Vacit Oguz
    Van De Weijer, Joost
    Yu, Longlong
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 3014 - 3020
  • [49] Multi-label dimensionality reduction and classification with extreme learning machines
    Feng, Lin
    Wang, Jing
    Liu, Shenglan
    Xiao, Yao
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2014, 25 (03) : 502 - 513
  • [50] DiSMEC - Distributed Sparse Machines for Extreme Multi-label Classification
    Babbar, Rohit
    Schoelkopf, Bernhard
    WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 721 - 729