CascadeXML: Rethinking Transformers for End-to-end Multi-resolution Training in Extreme Multi-label Classification

被引:0
|
作者
Kharbanda, Siddhant [1 ]
Banerjee, Atmadeep [1 ]
Schultheis, Erik [1 ]
Babbar, Rohit [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
基金
芬兰科学院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme Multi-label Text Classification (XMC) involves learning a classifier that can assign an input with a subset of most relevant labels from millions of label choices. Recent approaches, such as XR-Transformer and LightXML, leverage a transformer instance to achieve state-of-the-art performance. However, in this process, these approaches need to make various trade-offs between performance and computational requirements. A major shortcoming, as compared to the Bi-LSTM based AttentionXML, is that they fail to keep separate feature representations for each resolution in a label tree. We thus propose CascadeXML, an end-to-end multi-resolution learning pipeline, which can harness the multi-layered architecture of a transformer model for attending to different label resolutions with separate feature representations. CascadeXML significantly outperforms all existing approaches with non-trivial gains obtained on benchmark datasets consisting of up to three million labels. Code for CascadeXML will be made publicly available at https://github.com/xmc-aalto/cascadexml.
引用
下载
收藏
页数:14
相关论文
共 50 条
  • [31] Transformers for Multi-label Classification of Medical Text: An Empirical Comparison
    Yogarajan, Vithya
    Montiel, Jacob
    Smith, Tony
    Pfahringer, Bernhard
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2021), 2021, : 114 - 123
  • [32] Exploring Transformers for Multi-Label Classification of Java']Java Vulnerabilities
    Mamede, Claudia
    Pinconschi, Eduard
    Abreu, Rui
    Campos, Jose
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 43 - 52
  • [33] End-to-End Multi-Perspective Matching for Entity Resolution
    Fu, Cheng
    Han, Xianpei
    Sun, Le
    Chen, Bo
    Zhang, Wei
    Wu, Suhui
    Kong, Hao
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4961 - 4967
  • [34] Cluster-Guided Label Generation in Extreme Multi-Label Classification
    Jung, Taehee
    Kim, Joo-Kyung
    Lee, Sungjin
    Kang, Dongyeop
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1670 - 1685
  • [35] Multi-task and multi-view training for end-to-end relation extraction
    Zhang, Junchi
    Zhang, Yue
    Ji, Donghong
    Liu, Mengchi
    NEUROCOMPUTING, 2019, 364 : 245 - 253
  • [36] Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
    Brattoli, Biagio
    Tighe, Joseph
    Zhdanov, Fedor
    Perona, Pietro
    Chalupka, Krzysztof
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 4612 - 4622
  • [37] Multi-label dimensionality reduction and classification with extreme learning machines
    Lin Feng
    Jing Wang
    Shenglan Liu
    Yao Xiao
    Journal of Systems Engineering and Electronics, 2014, 25 (03) : 502 - 513
  • [38] Combining instance and feature neighbours for extreme multi-label classification
    Feremans, Len
    Cule, Boris
    Vens, Celine
    Goethals, Bart
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 10 (03) : 215 - 231
  • [39] Cold Start Thread Recommendation as Extreme Multi-label Classification
    Halder, Kishaloy
    Poddar, Lahari
    Kan, Min-Yen
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1911 - 1918
  • [40] Combining instance and feature neighbours for extreme multi-label classification
    Len Feremans
    Boris Cule
    Celine Vens
    Bart Goethals
    International Journal of Data Science and Analytics, 2020, 10 : 215 - 231