Deep label relevance and label ambiguity based multi-label feature selection for text classification

被引:0
|
作者
Verma, Gurudatta [1 ]
Sahu, Tirath Prasad [1 ]
机构
[1] Natl Inst Technol Raipur, Dept Informat Technol, Raipur, India
关键词
Grey Relational Analysis; Feature selection; Multi-label learning; Particle Swarm Optimization; Text classification; Multi-Label K-Nearest Neighbors; MISSING LABELS; ALGORITHM;
D O I
10.1016/j.engappai.2025.110403
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-label text classification, where each document can be associated with multiple labels simultaneously, poses unique challenges in feature selection due to the complex relationships between features and labels. In this paper, we propose a novel Deep Label Relevance and Label Ambiguity (DLRLA) based multi-label feature selection method designed for multi-label text data. Our approach constructs a quasi-relevance matrix integrating low-order, high-order feature-label relevance and label ambiguity. The low-order relevance captures the direct association between individual features and labels, while the high-order relevance accounts for the interactions between feature combinations and labels, collectively termed as deep label relevance. Label ambiguity, measured using information entropy, quantifies the uncertainty associated with each label. The quasi-relevance matrix is then evaluated using Grey Relation Optimization to rank and select the most informative features based on multiple relevance criteria. Additionally, feature-feature relevance is incorporated to reduce the candidate set of high-order features, mitigating computational complexity. Elastic Net Regression, a linear regularized model, estimates feature-label relevance, enabling efficient feature selection while addressing multicollinearity. For multi-label classification, we leverage the Multi-Label K-Nearest Neighbors algorithm, where the key parameters (number of neighbours k and smoothing factor s) are optimized using Particle Swarm Optimization. The proposed DLRLA method is extensively evaluated on ten multi-label text benchmark datasets, considering six performance evaluation metrics. Comparative analyses with seven state-of-the-art methods are conducted. Furthermore, a stability analysis of DLRLA is performed across all datasets and evaluation metrics, showcasing its robustness and consistency.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Multi-Label Feature Selection Based on Min-Relevance Label
    Gao, Wanfu
    Pan, Hanlin
    IEEE ACCESS, 2023, 11 : 410 - 420
  • [2] Multi-label feature selection based on stable label relevance and label-specific features
    Yang, Yong
    Chen, Hongmei
    Mi, Yong
    Luo, Chuan
    Horng, Shi-Jinn
    Li, Tianrui
    INFORMATION SCIENCES, 2023, 648
  • [3] Improving Multi-Label Medical Text Classification by Feature Selection
    Glinka, Kinga
    Wozniak, Rafal
    Zakrzewska, Danuta
    2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 176 - 181
  • [4] Label prompt for multi-label text classification
    Song, Rui
    Liu, Zelong
    Chen, Xingbing
    An, Haining
    Zhang, Zhiqi
    Wang, Xiaoguang
    Xu, Hao
    APPLIED INTELLIGENCE, 2023, 53 (08) : 8761 - 8775
  • [5] Label prompt for multi-label text classification
    Rui Song
    Zelong Liu
    Xingbing Chen
    Haining An
    Zhiqi Zhang
    Xiaoguang Wang
    Hao Xu
    Applied Intelligence, 2023, 53 : 8761 - 8775
  • [6] A lightweight filter based feature selection approach for multi-label text classification
    Dhal P.
    Azad C.
    Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12345 - 12357
  • [7] A COPRAS-based Approach to Multi-Label Feature Selection for Text Classification
    Mohanrasu, S. S.
    Janani, K.
    Rakkiyappan, R.
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2024, 222 : 3 - 23
  • [8] Label Construction for Multi-label Feature Selection
    Spolaor, Newton
    Monard, Maria Carolina
    Tsoumakas, Grigorios
    Lee, Huei Diana
    2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 247 - 252
  • [9] Label Relevance Based Multi-Label Scratch Classification Algorithm
    Peng C.
    Sun Y.
    Qi P.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 134 - 141
  • [10] Multi-label feature selection based on label correlations and feature redundancy
    Fan, Yuling
    Chen, Baihua
    Huang, Weiqin
    Liu, Jinghua
    Weng, Wei
    Lan, Weiyao
    KNOWLEDGE-BASED SYSTEMS, 2022, 241