Taming Pretrained Transformers for Extreme Multi-label Text Classification

被引:116
|
作者
Chang, Wei-Cheng [1 ]
Yu, Hsiang-Fu [2 ]
Zhong, Kai [2 ]
Yang, Yiming [1 ]
Dhillon, Inderjit S. [2 ,3 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Amazon, Bellevue, WA USA
[3] UT Austin, Austin, TX USA
关键词
Transformer models; eXtreme Multi-label text classification;
D O I
10.1145/3394486.3403368
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the extreme multi-label text classification (XMC) problem: given an input text, return the most relevant labels from a large label collection. For example, the input text could be a product description on Amazon.com and the labels could be product categories. XMC is an important yet challenging problem in the NLP community. Recently, deep pretrained transformer models have achieved state-of-the-art performance on many NLP tasks including sentence classification, albeit with small label sets. However, naively applying deep transformer models to the XMC problem leads to sub-optimal performance due to the large output space and the label sparsity issue. In this paper, we propose X-Transformer, the first scalable approach to fine-tuning deep transformer models for the XMC problem. The proposed method achieves new state-of-the-art results on four XMC benchmark datasets. In particular, on a Wiki dataset with around 0.5 million labels, the prec@1 of X-Transformer is 77.28%, a substantial improvement over state-of-the-art XMC approaches Parabel (linear) and AttentionXML (neural), which achieve 68.70% and 76.95% precision@1, respectively. We further apply X-Transformer to a product2query dataset from Amazon and gained 10.7% relative improvement on prec@1 over Parabel.
引用
收藏
页码:3163 / 3171
页数:9
相关论文
共 50 条
  • [1] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [2] Correlation Networks for Extreme Multi-label Text Classification
    Xun, Guangxu
    Jha, Kishlay
    Sun, Jianhui
    Zhang, Aidong
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1074 - 1082
  • [3] Transformers for Multi-label Classification of Medical Text: An Empirical Comparison
    Yogarajan, Vithya
    Montiel, Jacob
    Smith, Tony
    Pfahringer, Bernhard
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2021), 2021, : 114 - 123
  • [4] MatchXML: An Efficient Text-Label Matching Framework for Extreme Multi-Label Text Classification
    Ye, Hui
    Sunderraman, Rajshekhar
    Ji, Shihao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (09) : 4781 - 4793
  • [5] An Empirical Study for Class Imbalance in Extreme Multi-label Text Classification
    Han, Sangwoo
    Lim, Chan
    Cha, Bonggeon
    Lee, Jongwuk
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 338 - 341
  • [6] Deep Learning Method with Attention for Extreme Multi-label Text Classification
    Chen, Si
    Wang, Liangguo
    Li, Wan
    Zhang, Kun
    [J]. PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 179 - 190
  • [7] Deep neural network for hierarchical extreme multi-label text classification
    Gargiulo, Francesco
    Silvestri, Stefano
    Ciampi, Mario
    De Pietro, Giuseppe
    [J]. APPLIED SOFT COMPUTING, 2019, 79 : 125 - 138
  • [8] Label prompt for multi-label text classification
    Song, Rui
    Liu, Zelong
    Chen, Xingbing
    An, Haining
    Zhang, Zhiqi
    Wang, Xiaoguang
    Xu, Hao
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 8761 - 8775
  • [9] Label prompt for multi-label text classification
    Rui Song
    Zelong Liu
    Xingbing Chen
    Haining An
    Zhiqi Zhang
    Xiaoguang Wang
    Hao Xu
    [J]. Applied Intelligence, 2023, 53 : 8761 - 8775
  • [10] TLC-XML: Transformer with Label Correlation for Extreme Multi-label Text Classification
    Zhao, Fei
    Ai, Qing
    Li, Xiangna
    Wang, Wenhui
    Gao, Qingyun
    Liu, Yichun
    [J]. NEURAL PROCESSING LETTERS, 2024, 56 (01)