Deep label relevance and label ambiguity based multi-label feature selection for text classification

被引：0

作者：

Verma, Gurudatta ^{[1
]}

Sahu, Tirath Prasad ^{[1
]}

机构：

[1] Natl Inst Technol Raipur, Dept Informat Technol, Raipur, India

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 148卷

关键词：

Grey Relational Analysis; Feature selection; Multi-label learning; Particle Swarm Optimization; Text classification; Multi-Label K-Nearest Neighbors; MISSING LABELS; ALGORITHM;

D O I：

10.1016/j.engappai.2025.110403

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-label text classification, where each document can be associated with multiple labels simultaneously, poses unique challenges in feature selection due to the complex relationships between features and labels. In this paper, we propose a novel Deep Label Relevance and Label Ambiguity (DLRLA) based multi-label feature selection method designed for multi-label text data. Our approach constructs a quasi-relevance matrix integrating low-order, high-order feature-label relevance and label ambiguity. The low-order relevance captures the direct association between individual features and labels, while the high-order relevance accounts for the interactions between feature combinations and labels, collectively termed as deep label relevance. Label ambiguity, measured using information entropy, quantifies the uncertainty associated with each label. The quasi-relevance matrix is then evaluated using Grey Relation Optimization to rank and select the most informative features based on multiple relevance criteria. Additionally, feature-feature relevance is incorporated to reduce the candidate set of high-order features, mitigating computational complexity. Elastic Net Regression, a linear regularized model, estimates feature-label relevance, enabling efficient feature selection while addressing multicollinearity. For multi-label classification, we leverage the Multi-Label K-Nearest Neighbors algorithm, where the key parameters (number of neighbours k and smoothing factor s) are optimized using Particle Swarm Optimization. The proposed DLRLA method is extensively evaluated on ten multi-label text benchmark datasets, considering six performance evaluation metrics. Comparative analyses with seven state-of-the-art methods are conducted. Furthermore, a stability analysis of DLRLA is performed across all datasets and evaluation metrics, showcasing its robustness and consistency.

引用

页数：21

共 50 条

[1] Multi-Label Feature Selection Based on Min-Relevance Label
Gao, Wanfu
Pan, Hanlin
IEEE ACCESS, 2023, 11 : 410 - 420
[2] Multi-label feature selection based on stable label relevance and label-specific features
Yang, Yong
Chen, Hongmei
Mi, Yong
Luo, Chuan
Horng, Shi-Jinn
Li, Tianrui
INFORMATION SCIENCES, 2023, 648
[3] Improving Multi-Label Medical Text Classification by Feature Selection
Glinka, Kinga
Wozniak, Rafal
Zakrzewska, Danuta
2017 IEEE 26TH INTERNATIONAL CONFERENCE ON ENABLING TECHNOLOGIES - INFRASTRUCTURE FOR COLLABORATIVE ENTERPRISES (WETICE), 2017, : 176 - 181
[4] Label prompt for multi-label text classification
Song, Rui
Liu, Zelong
Chen, Xingbing
An, Haining
Zhang, Zhiqi
Wang, Xiaoguang
Xu, Hao
APPLIED INTELLIGENCE, 2023, 53 (08) : 8761 - 8775
[5] Label prompt for multi-label text classification
Rui Song
Zelong Liu
Xingbing Chen
Haining An
Zhiqi Zhang
Xiaoguang Wang
Hao Xu
Applied Intelligence, 2023, 53 : 8761 - 8775
[6] A lightweight filter based feature selection approach for multi-label text classification
Dhal P.
Azad C.
Journal of Ambient Intelligence and Humanized Computing, 2023, 14 (09) : 12345 - 12357
[7] A COPRAS-based Approach to Multi-Label Feature Selection for Text Classification
Mohanrasu, S. S.
Janani, K.
Rakkiyappan, R.
MATHEMATICS AND COMPUTERS IN SIMULATION, 2024, 222 : 3 - 23
[8] Label Construction for Multi-label Feature Selection
Spolaor, Newton
Monard, Maria Carolina
Tsoumakas, Grigorios
Lee, Huei Diana
2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 247 - 252
[9] Label Relevance Based Multi-Label Scratch Classification Algorithm
Peng C.
Sun Y.
Qi P.
Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2019, 42 (06): : 134 - 141
[10] Multi-label feature selection based on label correlations and feature redundancy
Fan, Yuling
Chen, Baihua
Huang, Weiqin
Liu, Jinghua
Weng, Wei
Lan, Weiyao
KNOWLEDGE-BASED SYSTEMS, 2022, 241

← 1 2 3 4 5 →