TLC-XML: Transformer with Label Correlation for Extreme Multi-label Text Classification

被引:1
|
作者
Zhao, Fei [1 ]
Ai, Qing [1 ]
Li, Xiangna [2 ]
Wang, Wenhui [3 ,4 ]
Gao, Qingyun [1 ]
Liu, Yichun [1 ]
机构
[1] Univ Sci & Technol Liaoning, Sch Comp Sci & Software Engn, Anshan 114051, Peoples R China
[2] State Grid Corp China, State Grid Informat & Telecommun Grp Co Ltd, Beijing 100053, Peoples R China
[3] Chinese Acad Sci, Beijing Synchrotron Radiat Facil, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Chinese Spallat Neutron Source Sci Ctr, Dongguan 523808, Peoples R China
关键词
Extreme multi-label text classification; Label correlation; Graph convolutional network; Transformer model;
D O I
10.1007/s11063-024-11460-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extreme multi-label text classification (XMTC) annotates related labels for unknown text from large-scale label sets. Transformer-based methods have become the dominant approach for solving the XMTC task due to their effective text representation capabilities. However, the existing Transformer-based methods fail to effectively exploit the correlation between labels in the XMTC task. To address this shortcoming, we propose a novel model called TLC-XML, i.e., a Transformer with label correlation for extreme multi-label text classification. TLC-XML comprises three modules: Partition, Matcher and Ranker. In the Partition module, we exploit the semantic and co-occurrence information of labels to construct the label correlation graph, and further partition the strongly correlated labels into the same cluster. In the Matcher module, we propose cluster correlation learning, which uses the graph convolutional network (GCN) to extract the correlation between clusters. We then introduce these valuable correlations into the classifier to match related clusters. In the Ranker module, we propose label interaction learning, which aggregates the raw label prediction with the information of the neighboring labels. The experimental results on benchmark datasets show that TLC-XML significantly outperforms state-of-the-art XMTC methods.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] TLC-XML: Transformer with Label Correlation for Extreme Multi-label Text Classification
    Fei Zhao
    Qing Ai
    Xiangna Li
    Wenhui Wang
    Qingyun Gao
    Yichun Liu
    [J]. Neural Processing Letters, 56
  • [2] Correlation Networks for Extreme Multi-label Text Classification
    Xun, Guangxu
    Jha, Kishlay
    Sun, Jianhui
    Zhang, Aidong
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1074 - 1082
  • [3] Deep Learning for Extreme Multi-label Text Classification
    Liu, Jingzhou
    Chang, Wei-Cheng
    Wu, Yuexin
    Yang, Yiming
    [J]. SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 115 - 124
  • [4] Multi-label text classification based on the label correlation mixture model
    He, Zhiyang
    Wu, Ji
    Lv, Ping
    [J]. INTELLIGENT DATA ANALYSIS, 2017, 21 (06) : 1371 - 1392
  • [5] Label prompt for multi-label text classification
    Song, Rui
    Liu, Zelong
    Chen, Xingbing
    An, Haining
    Zhang, Zhiqi
    Wang, Xiaoguang
    Xu, Hao
    [J]. APPLIED INTELLIGENCE, 2023, 53 (08) : 8761 - 8775
  • [6] Label prompt for multi-label text classification
    Rui Song
    Zelong Liu
    Xingbing Chen
    Haining An
    Zhiqi Zhang
    Xiaoguang Wang
    Hao Xu
    [J]. Applied Intelligence, 2023, 53 : 8761 - 8775
  • [7] MatchXML: An Efficient Text-Label Matching Framework for Extreme Multi-Label Text Classification
    Ye, Hui
    Sunderraman, Rajshekhar
    Ji, Shihao
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (09) : 4781 - 4793
  • [8] BGNN-XML: Bilateral Graph Neural Networks for Extreme Multi-Label Text Classification
    Zong, Daoming
    Sun, Shiliang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (07) : 6698 - 6709
  • [9] Label Correlation Based Graph Convolutional Network for Multi-label Text Classification
    Huy-The Vu
    Minh-Tien Nguyen
    Van-Chien Nguyen
    Manh-Tran Tien
    Van-Hau Nguyen
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] Taming Pretrained Transformers for Extreme Multi-label Text Classification
    Chang, Wei-Cheng
    Yu, Hsiang-Fu
    Zhong, Kai
    Yang, Yiming
    Dhillon, Inderjit S.
    [J]. KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3163 - 3171