Hierarchical Gated Convolutional Networks with Multi-Head Attention for Text Classification

被引:0
|
作者
Du, Haizhou [1 ]
Qian, Jingu [1 ]
机构
[1] Shanghai Univ Elect Power, Sch Comp Sci & Technol, Shanghai 200090, Peoples R China
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text classification is a fundamental problem in natural language processing. Recently, neural network models have been demonstrated to be capable of achieving remarkable performance in this domain. However, none of existing method can achieve excellent classification accuracy while concerning of computational cost. To solve this problem, we proposed hierarchical gated convolutional networks with multi-head attention which reduces computational cost through its two distinctive characteristics to save considerable model parameters. First, it has a hierarchical structure the same as the hierarchical structure of documents that has word-level and sentence-level, which not only benefits to classification performance but also reduces computational cost significantly by reusing parameters of the model in each sentence. Second, we apply gated convolutional network on both levels that enables our model achieved comparable performance to very deep networks with relatively shallow network depth. To further improve the performance of our model, multi-head attention mechanism is employed to differentiate more or less importance of words or sentences for better construction of document representation. Experiments conducted on the commonly used Yelp reviews datasets demonstrate that the proposed architecture obtains competitive performance against the state-of-the-art methods.
引用
收藏
页码:1170 / 1175
页数:6
相关论文
共 50 条
  • [1] Graph convolutional networks with hierarchical multi-head attention for aspect-level sentiment classification
    Li, Xiaowen
    Lu, Ran
    Liu, Peiyu
    Zhu, Zhenfang
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (13): : 14846 - 14865
  • [2] Graph convolutional networks with hierarchical multi-head attention for aspect-level sentiment classification
    Xiaowen Li
    Ran Lu
    Peiyu Liu
    Zhenfang Zhu
    [J]. The Journal of Supercomputing, 2022, 78 : 14846 - 14865
  • [3] Hierarchical Hybrid Neural Networks With Multi-Head Attention for Document Classification
    Huang, Weihao
    Chen, Jiaojiao
    Cai, Qianhua
    Liu, Xuejie
    Zhang, Yudong
    Hu, Xiaohui
    [J]. INTERNATIONAL JOURNAL OF DATA WAREHOUSING AND MINING, 2022, 18 (01)
  • [4] Hierarchical Convolutional Attention Networks for Text Classification
    Gao, Shang
    Ramanathan, Arvind
    Tourassi, Georgia
    [J]. REPRESENTATION LEARNING FOR NLP, 2018, : 11 - 23
  • [5] Multi-head self-attention based gated graph convolutional networks for aspect-based sentiment classification
    Luwei Xiao
    Xiaohui Hu
    Yinong Chen
    Yun Xue
    Bingliang Chen
    Donghong Gu
    Bixia Tang
    [J]. Multimedia Tools and Applications, 2022, 81 : 19051 - 19070
  • [6] Multi-head self-attention based gated graph convolutional networks for aspect-based sentiment classification
    Xiao, Luwei
    Hu, Xiaohui
    Chen, Yinong
    Xue, Yun
    Chen, Bingliang
    Gu, Donghong
    Tang, Bixia
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (14) : 19051 - 19070
  • [7] Multi-head Attention Induced Dynamic Hypergraph Convolutional Networks
    Peng, Xu
    Lin, Wei
    Jin, Taisong
    [J]. PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT IX, 2024, 14433 : 256 - 268
  • [8] Attention-Enhanced Graph Convolutional Networks for Aspect-Based Sentiment Classification with Multi-Head Attention
    Xu, Guangtao
    Liu, Peiyu
    Zhu, Zhenfang
    Liu, Jie
    Xu, Fuyong
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [9] An Ensemble of Text Convolutional Neural Networks and Multi-Head Attention Layers for Classifying Threats in Network Packets
    Kim, Hyeonmin
    Yoon, Young
    [J]. ELECTRONICS, 2023, 12 (20)
  • [10] Chinese Short Text Classification with Hybrid Features and Multi-Head Attention
    Jiang, Jielin
    Zhu, Yongwei
    Xu, Xiaolong
    Cui, Yan
    Zhao, Yingnan
    [J]. Computer Engineering and Applications, 60 (09): : 237 - 243