Hybrid Feature-Based Multi-label Text Classification-A Framework

被引:0
|
作者
Agarwal, Nancy [1 ]
Wani, Mudasir Ahmad [2 ]
ELAffendi, Mohammed [2 ]
机构
[1] Norwegian Univ Sci & Technol, N-2814 Gjovik, Norway
[2] Prince Sultan Univ PSU, Coll Comp & Informat Sci CCIS, Riyadh 11586, Saudi Arabia
关键词
Multi-label text classification; Natural language processing; Ensemble learning; Deep learning; SYSTEM;
D O I
10.1007/978-3-031-21101-0_17
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label Text Classification (MLTC) as a problem is a scenario in which a text document can belong to one or more classes simultaneously. Such classification tasks pose several general as well as specific research challenges. The general challenges include dependency among classes, imbalanced data, and scalability in the presence of an excessive number of labels. On the other hand, the MLTC-specific challenges include high dimensional feature space, obtaining contextual and semantic knowledge from the text, and understanding content diversity. This paper provides a brief description of the multi-label classification approaches such as problem transformation, algorithm adaptation, and ensemble learning along with their strengths and weaknesses. Furthermore, we proposed an MLTC framework referred to as HMTCS (Hybrid feature-based Multi-label Text Classification System) that handles both general multi-labeling issues and text categorization-specific issues. The proposed framework has three modules, namely, Labels Knowledge Base, Hybrid Feature Extraction, and Ensemble Learning.
引用
收藏
页码:211 / 221
页数:11
相关论文
共 50 条
  • [1] Hybrid Feature Extraction for Multi-Label Emotion Classification in English Text Messages
    Ahanin, Zahra
    Ismail, Maizatul Akmar
    Singh, Narinderjit Singh Sawaran
    AL-Ashmori, Ammar
    Syafrudin, Muhammad
    Alfian, Ganjar
    Fitriyani, Norma Latif
    Anshari, Muhammad
    [J]. SUSTAINABILITY, 2023, 15 (16)
  • [2] Multi-label text classification with an ensemble feature space
    Tandon, Kushagri
    Chatterjee, Niladri
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (05) : 4425 - 4436
  • [3] Multi-label text classification with an ensemble feature space
    Tandon, Kushagri
    Chatterjee, Niladri
    [J]. Journal of Intelligent and Fuzzy Systems, 2022, 42 (05): : 4425 - 4436
  • [4] A lightweight filter based feature selection approach for multi-label text classification
    Dhal P.
    Azad C.
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12345 - 12357
  • [5] A COPRAS-based Approach to Multi-Label Feature Selection for Text Classification
    Mohanrasu, S. S.
    Janani, K.
    Rakkiyappan, R.
    [J]. MATHEMATICS AND COMPUTERS IN SIMULATION, 2024, 222 : 3 - 23
  • [6] Reasearch on Feature Mapping Based on Labels Information in Multi-label Text Classification
    Wang, Tao
    Luo, Tao
    Li, Jianfeng
    Wang, Cong
    [J]. PROCEEDINGS OF 2017 IEEE 7TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC), 2017, : 452 - 456
  • [7] Hybrid embedding-based text representation for hierarchical multi-label text classification
    Ma, Yinglong
    Liu, Xiaofeng
    Zhao, Lijiao
    Liang, Yue
    Zhang, Peng
    Jin, Beihong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [8] Improving Multi-Label Medical Text Classification by Feature Selection
    Glinka, Kinga
    Wozniak, Rafal
    Zakrzewska, Danuta
    [J]. 2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 176 - 181
  • [9] An Efficient Framework by Topic Model for Multi-label Text Classification
    Sun, Wei
    Ran, Xiangying
    Luo, Xiangyang
    Wang, Chongjun
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [10] gMLC: a multi-label feature selection framework for graph classification
    Xiangnan Kong
    Philip S. Yu
    [J]. Knowledge and Information Systems, 2012, 31 : 281 - 305