Hybrid Feature-Based Multi-label Text Classification-A Framework

被引:0
|
作者
Agarwal, Nancy [1 ]
Wani, Mudasir Ahmad [2 ]
ELAffendi, Mohammed [2 ]
机构
[1] Norwegian Univ Sci & Technol, N-2814 Gjovik, Norway
[2] Prince Sultan Univ PSU, Coll Comp & Informat Sci CCIS, Riyadh 11586, Saudi Arabia
关键词
Multi-label text classification; Natural language processing; Ensemble learning; Deep learning; SYSTEM;
D O I
10.1007/978-3-031-21101-0_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label Text Classification (MLTC) as a problem is a scenario in which a text document can belong to one or more classes simultaneously. Such classification tasks pose several general as well as specific research challenges. The general challenges include dependency among classes, imbalanced data, and scalability in the presence of an excessive number of labels. On the other hand, the MLTC-specific challenges include high dimensional feature space, obtaining contextual and semantic knowledge from the text, and understanding content diversity. This paper provides a brief description of the multi-label classification approaches such as problem transformation, algorithm adaptation, and ensemble learning along with their strengths and weaknesses. Furthermore, we proposed an MLTC framework referred to as HMTCS (Hybrid feature-based Multi-label Text Classification System) that handles both general multi-labeling issues and text categorization-specific issues. The proposed framework has three modules, namely, Labels Knowledge Base, Hybrid Feature Extraction, and Ensemble Learning.
引用
收藏
页码:211 / 221
页数:11
相关论文
共 50 条
  • [21] Multi-Label Text Classification Based on Label Combination and Fusion of Attentions
    Wu, Xinke
    Sun, Jun
    Li, Zhihua
    [J]. Computer Engineering and Applications, 2023, 59 (06) : 125 - 133
  • [22] Feature distribution-based label correlation in multi-label classification
    Xiaoya Che
    Degang Chen
    Jusheng Mi
    [J]. International Journal of Machine Learning and Cybernetics, 2021, 12 : 1705 - 1719
  • [23] Feature distribution-based label correlation in multi-label classification
    Che, Xiaoya
    Chen, Degang
    Mi, Jusheng
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (06) : 1705 - 1719
  • [24] LABEL-AWARE TEXT REPRESENTATION FOR MULTI-LABEL TEXT CLASSIFICATION
    Guo, Hao
    Li, Xiangyang
    Zhang, Lei
    Liu, Jia
    Chen, Wei
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 7728 - 7732
  • [25] Independent Feature and Label Components for Multi-label Classification
    Zhong, Yongjian
    Xu, Chang
    Du, Bo
    Zhang, Lefei
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 827 - 836
  • [26] Multi-label text classification model based on semantic embedding
    Yan Danfeng
    Ke Nan
    Gu Chao
    Cui Jianfei
    Ding Yiqi
    [J]. The Journal of China Universities of Posts and Telecommunications, 2019, 26 (01) : 95 - 104
  • [27] An Ensemble Embedded Feature Selection Method for Multi-Label Clinical Text Classification
    Guo, Yumeng
    Chung, Fulai
    Li, Guozheng
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 823 - 826
  • [28] Extreme Multi-Label Text Classification Based on Balance Function
    Chen, Zhaohong
    Hong, Zhiyong
    Yu, Wenhua
    Zhang, Xin
    [J]. Computer Engineering and Applications, 2024, 60 (04) : 163 - 172
  • [29] A Survey of Multi-label Text Classification Based on Deep Learning
    Chen, Xiaolong
    Cheng, Jieren
    Liu, Jingxin
    Xu, Wenghang
    Hua, Shuai
    Tang, Zhu
    Sheng, Victor S.
    [J]. ARTIFICIAL INTELLIGENCE AND SECURITY, ICAIS 2022, PT I, 2022, 13338 : 443 - 456
  • [30] Multi-Label Text Classification Based on Shared Semantic Space
    Sun, Kun
    Qin, Bowen
    Sang, Jitao
    Yu, Jian
    [J]. Computer Engineering and Applications, 2023, 59 (12): : 100 - 105