Hierarchical Transfer Learning for Multi-label Text Classification

被引:0
|
作者
Banerjee, Siddhartha [1 ]
Akkaya, Cem [1 ]
Perez-Sorrosal, Francisco [1 ]
Tsioutsiouliklis, Kostas [1 ]
机构
[1] Yahoo Res, 701 First Ave, Sunnyvale, CA 94089 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-Label Hierarchical Text Classification (MLHTC) is the task of categorizing documents into one or more topics organized in an hierarchical taxonomy. MLHTC can be formulated by combining multiple binary classification problems with an independent classifier for each category. We propose a novel transfer learning based strategy, HTrans, where binary classifiers at lower levels in the hierarchy are initialized using parameters of the parent classifier and fine-tuned on the child category classification task. In HTrans, we use a Gated Recurrent Unit (GRU)-based deep learning architecture coupled with attention. Compared to binary classifiers trained from scratch, our HTrans approach results in significant improvements of 1% on micro-F1 and 3% on macro-F1 on the RCV1 dataset. Our experiments also show that binary classifiers trained from scratch are significantly better than single multi-label models.
引用
收藏
页码:6295 / 6300
页数:6
相关论文
共 50 条
  • [21] Hierarchical Multi-label Text Classification with Horizontal and Vertical Category Correlations
    Xu, Linli
    Teng, Sijie
    Zhao, Ruoyu
    Guo, Junliang
    Xiao, Chi
    Jiang, Deqiang
    Ren, Bo
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2459 - 2468
  • [22] Deep neural network for hierarchical extreme multi-label text classification
    Gargiulo, Francesco
    Silvestri, Stefano
    Ciampi, Mario
    De Pietro, Giuseppe
    APPLIED SOFT COMPUTING, 2019, 79 : 125 - 138
  • [23] HGBL: A Fine Granular Hierarchical Multi-Label Text Classification ModelHGBL: A Fine Granular Hierarchical Multi-Label Text Classification ModelC. Zhang et al.
    Chaoqun Zhang
    Linlin Dai
    Chengxing Liu
    Longhao Zhang
    Neural Processing Letters, 57 (1)
  • [24] EnML: Multi-label Ensemble Learning for Urdu Text Classification
    Mehmood, Faiza
    Shahzadi, Rehab
    Ghafoor, Hina
    Asim, Muhammad Nabeel
    Ghani, Muhammad Usman
    Mahmood, Waqar
    Dengel, Andreas
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (09)
  • [25] An Effective Deployment of Contrastive Learning in Multi-label Text Classification
    Lin, Nankai
    Qin, Guanqiu
    Wang, Jigang
    Zhou, Dong
    Yang, Aimin
    Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2023, : 8730 - 8744
  • [26] A Survey of Multi-label Text Classification Based on Deep Learning
    Chen, Xiaolong
    Cheng, Jieren
    Liu, Jingxin
    Xu, Wenghang
    Hua, Shuai
    Tang, Zhu
    Sheng, Victor S.
    ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 443 - 456
  • [27] Multi-Label Classification of Text Documents Using Deep Learning
    Mohammed, Hamza Haruna
    Dogdu, Erdogan
    Gorur, Abdul Kadir
    Choupani, Roya
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4681 - 4689
  • [28] Multi-Label Arabic Text Classification Based On Deep Learning
    Alsukhni, Batool
    2021 12TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2021, : 475 - 477
  • [29] Multi-Label Text Classification Based on Contrastive and Correlation Learning
    Yang, Shuo
    Gao, Shu
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CYBER SECURITY, ARTIFICIAL INTELLIGENCE AND DIGITAL ECONOMY, CSAIDE 2024, 2024, : 325 - 330
  • [30] Hybrid embedding-based text representation for hierarchical multi-label text classification
    Ma, Yinglong
    Liu, Xiaofeng
    Zhao, Lijiao
    Liang, Yue
    Zhang, Peng
    Jin, Beihong
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187