Exploring Hierarchical Multi-Label Text Classification Models using Attention-Based Approaches for Vietnamese language

被引:0
|
作者
Lam, Van [1 ,2 ]
Quach, Khoi [1 ,2 ]
Nguyen, Long [1 ,2 ]
Dinh, Dien [1 ,2 ]
机构
[1] Univ Sci Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
关键词
Hierarchical Attention-based Recurrent Neural Network; Word Embedding; Vietnamese articles;
D O I
10.1145/3639233.3639244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hierarchical Attention-based Recurrent Neural Network (HARNN) is a system designed to categorize documents efficiently, taking into account both the content of the texts and their hierarchical category structure. This system is comprised of three primary components: the Document Representation Layer (DRL), which is used for semantic encoding, the Hierarchical Attention-based Recurrent Layer (HARL), that models dependencies between different hierarchical levels, and the Hybrid Predicting Layer (HPL), which is responsible for accurate category predictions. In this research, we put HARNN to the test, using a dataset of Vietnamese articles from VnExpress. We then contrast the performance of four different word embeddings (Word2Vec, FastText, PhoBERT, and BERT multilingual). Additionally, we introduce a domain-based approach for the HARNN model to compare the accuracy with the original manner. Experimental findings indicate that HARNN performs effectively in the context of Vietnamese language and that our domain-based approach can be advantageous in specific domains HMTC task.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [1] Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach
    Huang, Wei
    Chen, Enhong
    Liu, Qi
    Chen, Yuying
    Huang, Zai
    Liu, Yang
    Zhao, Zhou
    Zhang, Dan
    Wang, Shijin
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1051 - 1060
  • [2] MAGNET: Multi-Label Text Classification using Attention-based Graph Neural Network
    Pal, Ankit
    Selvakumar, Muru
    Sankarasubbu, Malaikannan
    ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2020, : 494 - 505
  • [3] InPHYNet: Leveraging attention-based multitask recurrent networks for multi-label physics text classification
    Udandarao, Vishaal
    Agarwal, Abhishek
    Gupta, Anubha
    Chakraborty, Tanmoy
    KNOWLEDGE-BASED SYSTEMS, 2021, 211
  • [4] Multi-label text classification using multinomial models
    Vilar, D
    Castro, MJ
    Sanchis, E
    ADVANCES IN NATURAL LANGUAGE PROCESSING, 2004, 3230 : 220 - 230
  • [5] On Exploring Attention-based Explanation for Transformer Models in Text Classification
    Liu, Shengzhong
    Le, Franck
    Chakraborty, Supriyo
    Abdelzaher, Tarek
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1193 - 1203
  • [6] LA-HCN: Label-based Attention for Hierarchical Multi-label Text Classification Neural Network
    Zhang, Xinyi
    Xu, Jiahao
    Soh, Charlie
    Chen, Lihui
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [7] Multi-label Aspect Classification on Question-Answering Text with Contextualized Attention-Based Neural Network
    Wu, Hanqian
    Zhang, Shangbin
    Wang, Jingjing
    Liu, Mumu
    Li, Shoushan
    CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 479 - 491
  • [8] A Label-Specific Attention-Based Network with Regularized Loss for Multi-label Classification
    Luo, Xiangyang
    Ran, Xiangying
    Sun, Wei
    Xu, Yunlai
    Wang, Chongjun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 731 - 742
  • [9] Hierarchical Multi-label Classification of Text with Capsule Networks
    Aly, Rami
    Remus, Steffen
    Biemann, Chris
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019:): STUDENT RESEARCH WORKSHOP, 2019, : 323 - 330
  • [10] Academic Resource Text Hierarchical Multi-Label Classification
    Wang, Yue
    Li, Yawen
    Li, Ang
    Computer Engineering and Applications, 2023, 59 (13): : 92 - 98