Hierarchical Multiple Granularity Attention Network for Long Document Classification

被引:1
|
作者
Hu, Yongli [1 ]
Ding, Wen [1 ]
Liu, Tengfei [1 ]
Gao, Junbin [2 ]
Sun, Yanfeng [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Inst Artificial Intelligence, Beijing Key Lab Multimedia & Intelligent Software, Beijing 100124, Peoples R China
[2] Univ Sydney, Sch Business, Discipline Business Analyt, Sydney, NSW, Australia
基金
中国国家自然科学基金;
关键词
D O I
10.1109/IJCNN55064.2022.9892046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long document classification has aroused tremendous attention in the field of Nature Language Processing, due to the exponential increasing of publications. Although the common text classification methods can be extended for long document classification, they are confined to the length of text and do not have enough expressiveness to model the structure of the long document. To solve these problems, we proposed a hierarchical multiple granularity attention network for long document classification, in which the word and section level features are extracted and fused to represent the complex structure of the long document. Furthermore, a feature-based section pooling module is adopted to eliminate redundant text information and accelerate the computing. A series of experiments are conducted to evaluate the proposed method. The experimental results verify that our method is effective, efficient and competitive compared with the related state-of-the-art methods.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Hierarchical Multi-Granularity Interaction Graph Convolutional Network for Long Document Classification
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1762 - 1775
  • [2] Cross-modal Multiple Granularity Interactive Fusion Network for Long Document Classification
    Liu, Tengfei
    Hu, Yongli
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2024, 18 (04)
  • [3] Hierarchical Attention Transformer Networks for Long Document Classification
    Hu, Yongli
    Chen, Puman
    Liu, Tengfei
    Gao, Junbin
    Sun, Yanfeng
    Yin, Baocai
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] DOCUMENT CLASSIFICATION BASED ON CONVOLUTIONAL NEURAL NETWORK AND HIERARCHICAL ATTENTION NETWORK
    Cheng, Y.
    Ye, Z.
    Wang, M.
    Zhang, Q.
    [J]. NEURAL NETWORK WORLD, 2019, 29 (02) : 83 - 98
  • [5] Hierarchical Recurrent and Convolutional Neural Network Based on Attention for Chinese Document Classification
    Lei, Lianxin
    Lu, Junguo
    Ruan, Shaohui
    [J]. PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 809 - 814
  • [6] Hierarchical Multi-Granularity Attention- Based Hybrid Neural Network for Text Classification
    Liu, Zhenyu
    Lu, Chaohong
    Huang, Haiwei
    Lyu, Shengfei
    Tao, Zhenchao
    [J]. IEEE Access, 2020, 8 : 149362 - 149371
  • [7] HIERARCHICAL TRANSFORMERS FOR LONG DOCUMENT CLASSIFICATION
    Pappagari, Raghavendra
    Zelasko, Piotr
    Villalba, Jesus
    Carmiel, Yishay
    Dehak, Najim
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 838 - 844
  • [8] Hierarchical Inter-Attention Network for Document Classification with Multi-Task Learning
    Tian, Bing
    Zhang, Yong
    Wang, Jin
    Xing, Chunxiao
    [J]. PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 3569 - 3575
  • [9] Hierarchical multi-attention networks for document classification
    Yingren Huang
    Jiaojiao Chen
    Shaomin Zheng
    Yun Xue
    Xiaohui Hu
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 1639 - 1647
  • [10] Hierarchical multi-attention networks for document classification
    Huang, Yingren
    Chen, Jiaojiao
    Zheng, Shaomin
    Xue, Yun
    Hu, Xiaohui
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) : 1639 - 1647