CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification

被引:0
|
作者
Kharbanda, Siddhant [1 ]
Banerjee, Atmadeep [1 ]
Schultheis, Erik [1 ]
Babbar, Rohit [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
基金
芬兰科学院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent approaches, such as XR-Transformer and LightXML, leverage a transformer instance to achieve state-of-the-art performance. However, in this process, these approaches need to make various trade-offs between performance and computational requirements. A major shortcoming, as compared to the Bi-LSTM based AttentionXML, is that they fail to keep separate feature representations for each resolution in a label tree. We thus propose CascadeXML, an end-to-end multi-resolution learning pipeline, which can harness the multi-layered architecture of a transformer model for attending to different label resolutions with separate feature representations. CascadeXML significantly outperforms all existing approaches with non-trivial gains obtained on benchmark datasets consisting of up to three million labels. Code for CascadeXML will be made publicly available at https://github.com/xmc-aalto/cascadexml.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [21] End-to-End Emotion-Cause Pair Extraction based on Sliding Window Multi-Label Learning
    Ding, Zixiang
    Rui Xia
    Yu, Jianfei
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 3574 - 3583
  • [22] Deep Learning with a Rethinking Structure for Multi-label Classification
    Yang, Yao-Yuan
    Lin, Yi-An
    Chu, Hong-Min
    Lin, Hsuan-Tien
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 101, 2019, 101 : 125 - 140
  • [23] Exploiting Label Dependencies for Multi-Label Document Classification Using Transformers
    Fallah, Haytame
    Bruno, Emmanuel
    Bellot, Patrice
    Murisasco, Elisabeth
    PROCEEDINGS OF THE 2023 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, DOCENG 2023, 2023,
  • [24] Multi-task multi-resolution char-to-BPE cross-attention decoder for end-to-end speech recognition
    Gowda, Dhananjaya
    Garg, Abhinav
    Kim, Kwangyoun
    Kumar, Mehul
    Kim, Chanwoo
    INTERSPEECH 2019, 2019, : 2783 - 2787
  • [25] CNN BASED TWO-STAGE MULTI-RESOLUTION END-TO-END MODEL FOR SINGING MELODY EXTRACTION
    Chen, Ming-Tso
    Li, Bo-Jun
    Chi, Tai-Shih
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1005 - 1009
  • [26] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [27] Data scarcity, robustness and extreme multi-label classification
    Rohit Babbar
    Bernhard Schölkopf
    Machine Learning, 2019, 108 : 1329 - 1351
  • [28] Correlation Networks for Extreme Multi-label Text Classification
    Xun, Guangxu
    Jha, Kishlay
    Sun, Jianhui
    Zhang, Aidong
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1074 - 1082
  • [29] Sparse Local Embeddings for Extreme Multi-label Classification
    Bhatia, Kush
    Jain, Himanshu
    Kar, Purushottam
    Varma, Manik
    Jain, Prateek
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [30] Data scarcity, robustness and extreme multi-label classification
    Babbar, Rohit
    Schoelkopf, Bernhard
    MACHINE LEARNING, 2019, 108 (8-9) : 1329 - 1351