Exploring Hierarchical Multi-Label Text Classification Models using Attention-Based Approaches for Vietnamese language

被引:0
|
作者
Lam, Van [1 ,2 ]
Quach, Khoi [1 ,2 ]
Nguyen, Long [1 ,2 ]
Dinh, Dien [1 ,2 ]
机构
[1] Univ Sci Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City, Vietnam
[2] Vietnam Natl Univ, Ho Chi Minh City, Vietnam
关键词
Hierarchical Attention-based Recurrent Neural Network; Word Embedding; Vietnamese articles;
D O I
10.1145/3639233.3639244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hierarchical Attention-based Recurrent Neural Network (HARNN) is a system designed to categorize documents efficiently, taking into account both the content of the texts and their hierarchical category structure. This system is comprised of three primary components: the Document Representation Layer (DRL), which is used for semantic encoding, the Hierarchical Attention-based Recurrent Layer (HARL), that models dependencies between different hierarchical levels, and the Hybrid Predicting Layer (HPL), which is responsible for accurate category predictions. In this research, we put HARNN to the test, using a dataset of Vietnamese articles from VnExpress. We then contrast the performance of four different word embeddings (Word2Vec, FastText, PhoBERT, and BERT multilingual). Additionally, we introduce a domain-based approach for the HARNN model to compare the accuracy with the original manner. Experimental findings indicate that HARNN performs effectively in the context of Vietnamese language and that our domain-based approach can be advantageous in specific domains HMTC task.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [41] Improving Multi-label Text Classification Models with Knowledge Graphs
    Prabhu, Divya
    Rajabi, Enayat
    Ganta, Mohan Kumar
    Thomas, Tressy
    SERVICE-ORIENTED COMPUTING, ICSOC 2021 WORKSHOPS, 2022, 13236 : 117 - 124
  • [42] Multi-label Classification for Clinical Text with Feature-level Attention
    Pan, Disheng
    Zheng, Xizi
    Liu, Weijie
    Li, Mengya
    Ma, Meng
    Zhou, Ying
    Yang, Li
    Wang, Ping
    2020 IEEE 6TH INT CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY) / 6TH IEEE INT CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) / 5TH IEEE INT CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2020, : 186 - 191
  • [43] Online multi-label dependency topic models for text classification
    Burkhardt, Sophie
    Kramer, Stefan
    MACHINE LEARNING, 2018, 107 (05) : 859 - 886
  • [44] Deep Learning Method with Attention for Extreme Multi-label Text Classification
    Chen, Si
    Wang, Liangguo
    Li, Wan
    Zhang, Kun
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2019, 11672 : 179 - 190
  • [45] HGBL: A Fine Granular Hierarchical Multi-Label Text Classification ModelHGBL: A Fine Granular Hierarchical Multi-Label Text Classification ModelC. Zhang et al.
    Chaoqun Zhang
    Linlin Dai
    Chengxing Liu
    Longhao Zhang
    Neural Processing Letters, 57 (1)
  • [46] Label-text bi-attention capsule networks model for multi-label text classification
    Wang, Gang
    Du, Yajun
    Jiang, Yurui
    Liu, Jia
    Li, Xianyong
    Chen, Xiaoliang
    Gao, Hongmei
    Xie, Chunzhi
    Lee, Yan-li
    NEUROCOMPUTING, 2024, 588
  • [47] Text Classification Based on Natural Language Processing and Machine Learning in Multi-Label Corpus
    Yu, Haitao
    Xiong, Feng
    Chen, Zuh ui
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (08)
  • [48] Multi-Label Emotion Classification of Online Learners' Reviews Using Machine Learning Text-Based Multi-Label Classification Approach
    Makhoukhi, Hajar
    Roubi, Sarra
    2024 5TH INTERNATIONAL CONFERENCE ON EDUCATION DEVELOPMENT AND STUDIES, ICEDS 2024, 2024, : 59 - 64
  • [49] Multi-label Text Classification Method Based on Label Semantic Information
    Xiao L.
    Chen B.-L.
    Huang X.
    Liu H.-F.
    Jing L.-P.
    Yu J.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1079 - 1089
  • [50] Multi-label text classification based on the label correlation mixture model
    He, Zhiyang
    Wu, Ji
    Lv, Ping
    INTELLIGENT DATA ANALYSIS, 2017, 21 (06) : 1371 - 1392