Multi-label sequence generating model via label semantic attention mechanism

被引:2
|
作者
Zhang, Xiuling [1 ]
Tan, Xiaofei [1 ]
Luo, Zhaoci [1 ]
Zhao, Jun [1 ]
机构
[1] Yanshan Univ, Engn Res Ctr, Minist Educ Intelligent Control Syst & Intelligen, Qinhuangdao 066000, Hebei, Peoples R China
关键词
Multi-label text classification; Seq2Seq; Label semantic attention mechanism; Policy gradient; NEURAL-NETWORKS; TEXT;
D O I
10.1007/s13042-022-01722-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, a new attempt has been made to capture label co-occurrence by applying the sequence-to-sequence (Seq2Seq) model to multi-label text classification (MLTC). However, existing approaches frequently ignore the semantic information contained in the labels themselves. Besides, the Seq2Seq model is susceptible to the negative impact of label sequence order. Furthermore, it has been demonstrated that the traditional attention mechanism underperforms in MLTC. Therefore, we propose a novel Seq2Seq model with a different label semantic attention mechanism (S2S-LSAM), which generates fused information containing label and text information through the interaction of label semantics and text features in the label semantic attention mechanism. With the fused information, our model can select the text features that are most relevant to the labels more effectively. A combination of the cross-entropy loss function and the policy gradient-based loss function is employed to reduce the label sequence order effect. The experiments show that our model outperforms the baseline models.
引用
收藏
页码:1711 / 1723
页数:13
相关论文
共 50 条
  • [31] Weakly Supervised Multi-Label Learning via Label Enhancement
    Lv, Jia-Qi
    Xu, Ning
    Zheng, Ren-Yi
    Geng, Xin
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3101 - 3107
  • [32] Partial Multi-Label Learning via Credible Label Elicitation
    Zhang, Min-Ling
    Fang, Jun-Peng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3587 - 3599
  • [33] Partial Multi-Label Learning via Credible Label Elicitation
    Fang, Jun-Peng
    Zhang, Min-Ling
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3518 - 3525
  • [34] All is attention for multi-label text classification
    Liu, Zhi
    Huang, Yunjie
    Xia, Xincheng
    Zhang, Yihao
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1249 - 1270
  • [35] Visual Attention in Multi-Label Image Classification
    Luo, Yan
    Jiang, Ming
    Zhao, Qi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 820 - 827
  • [36] Double Attention for Multi-Label Image Classification
    Zhao, Haiying
    Zhou, Wei
    Hou, Xiaogang
    Zhu, Hui
    IEEE ACCESS, 2020, 8 : 225539 - 225550
  • [37] Label-Aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification
    Huang, Xin
    Chen, Boli
    Xiao, Lin
    Yu, Jian
    Jing, Liping
    NEURAL PROCESSING LETTERS, 2022, 54 (05) : 3601 - 3617
  • [38] Label-Aware Document Representation via Hybrid Attention for Extreme Multi-Label Text Classification
    Xin Huang
    Boli Chen
    Lin Xiao
    Jian Yu
    Liping Jing
    Neural Processing Letters, 2022, 54 : 3601 - 3617
  • [39] Multi-Label Text Classification Model Based on Multi-Level Constraint Augmentation and Label Association Attention
    Wei, Xiao
    Huang, Jianbao
    Zhao, Rui
    Yu, Hang
    Xu, Zheng
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (01)
  • [40] A multi-scale semantic attention representation for multi-label image recognition with graph networks
    Liang, Jun
    Xu, Feiteng
    Yu, Songsen
    NEUROCOMPUTING, 2022, 491 : 14 - 23