CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification

被引:0
|
作者
Kharbanda, Siddhant [1 ]
Banerjee, Atmadeep [1 ]
Schultheis, Erik [1 ]
Babbar, Rohit [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
基金
芬兰科学院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent approaches, such as XR-Transformer and LightXML, leverage a transformer instance to achieve state-of-the-art performance. However, in this process, these approaches need to make various trade-offs between performance and computational requirements. A major shortcoming, as compared to the Bi-LSTM based AttentionXML, is that they fail to keep separate feature representations for each resolution in a label tree. We thus propose CascadeXML, an end-to-end multi-resolution learning pipeline, which can harness the multi-layered architecture of a transformer model for attending to different label resolutions with separate feature representations. CascadeXML significantly outperforms all existing approaches with non-trivial gains obtained on benchmark datasets consisting of up to three million labels. Code for CascadeXML will be made publicly available at https://github.com/xmc-aalto/cascadexml.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [1] End-to-End Probabilistic Label-Specific Feature Learning for Multi-Label Classification
    Hang, Jun-Yi
    Zhang, Min-Ling
    Feng, Yanghe
    Song, Xiaocheng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6847 - 6855
  • [2] Fast Multi-Resolution Transformer Fine-tuning for Extreme Multi-label Text Classification
    Zhang, Jiong
    Chang, Wei-cheng
    Yu, Hsiang-fu
    Dhillon, Inderjit S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [3] An end-to-end multi-resolution feature fusion defogging network
    Ping Xue
    ShiXiong Deng
    Signal, Image and Video Processing, 2023, 17 : 4189 - 4197
  • [4] An end-to-end multi-resolution feature fusion defogging network
    Xue, Ping
    Deng, ShiXiong
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (08) : 4189 - 4197
  • [5] Vertebrae Labeling via End-to-End Integral Regression Localization and Multi-Label Classification Network
    Qin, Chunli
    Zhou, Ji
    Yao, Demin
    Zhuang, Han
    Wang, Hui
    Chen, Shiyao
    Shi, Yonghong
    Song, Zhijian
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (06) : 2726 - 2736
  • [6] Taming Pretrained Transformers for Extreme Multi-label Text Classification
    Chang, Wei-Cheng
    Yu, Hsiang-Fu
    Zhong, Kai
    Yang, Yiming
    Dhillon, Inderjit S.
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3163 - 3171
  • [7] Focal Modulation Based End-to-End Multi-Label Classification for Chest X-ray Image Classification
    Ozturk, Saban
    Cukur, Tolga
    2023 31ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2023,
  • [8] An End-to-End Multi-label classification model for Arrhythmia based on varied-length ECG signals
    Dong, Yanfang
    Cai, Wenqiang
    Zhu, Wenliang
    Wang, Lirong
    2022 IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, 2022, : 1149 - 1154
  • [9] An End-to-End Deep Learning Pipeline for Emphysema Quantification Using Multi-label Learning
    Negahdar, Mohammadreza
    Coy, Adam
    Beymer, David
    2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 929 - 932
  • [10] General Multi-label Image Classification with Transformers
    Lanchantin, Jack
    Wang, Tianlu
    Ordonez, Vicente
    Qi, Yanjun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 16473 - 16483