An Instance- and Label-Based Feature Selection Method in Classification Tasks

被引:1
|
作者
Fan, Qingcheng [1 ]
Liu, Sicong [1 ]
Zhao, Chunjiang [1 ,2 ]
Li, Shuqin [1 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, 3 Taicheng Rd, Xianyang 712100, Peoples R China
[2] Beijing Acad Agr & Forestry Sci, Res Ctr Informat Technol, Beijing 100097, Peoples R China
关键词
feature selection; manifold learning; classification;
D O I
10.3390/info14100532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is crucial in classification tasks as it helps to extract relevant information while reducing redundancy. This paper presents a novel method that considers both instance and label correlation. By employing the least squares method, we calculate the linear relationship between each feature and the target variable, resulting in correlation coefficients. Features with high correlation coefficients are selected. Compared to traditional methods, our approach offers two advantages. Firstly, it effectively selects features highly correlated with the target variable from a large feature set, reducing data dimensionality and improving analysis and modeling efficiency. Secondly, our method considers label correlation between features, enhancing the accuracy of selected features and subsequent model performance. Experimental results on three datasets demonstrate the effectiveness of our method in selecting features with high correlation coefficients, leading to superior model performance. Notably, our approach achieves a minimum accuracy improvement of 3.2% for the advanced classifier, lightGBM, surpassing other feature selection methods. In summary, our proposed method, based on instance and label correlation, presents a suitable solution for classification problems.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Decoupled Instance-label Extreme Multi-label Classification with Skew Coordinate Feature Space
    Song, Jihyeon
    Moon, Bongki
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 1919 - 1924
  • [42] Ensembles of instance selection methods based on feature subset
    Blachnik, Marcin
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 : 388 - 396
  • [43] A scalable saliency-based feature selection method with instance-level information
    Cancela, Brais
    Bolon-Canedo, Veronica
    Alonso-Betanzos, Amparo
    Gama, Joao
    KNOWLEDGE-BASED SYSTEMS, 2020, 192
  • [44] A Feature Selection Method Based on Graph Theory for Cancer Classification
    Zhou, Kai
    Yin, Zhixiang
    Gu, Jiaying
    Zeng, Zhiliang
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2024, 27 (05) : 650 - 660
  • [45] A Feature Selection-based Ensemble Method for Arrhythmia Classification
    Namsrai, Erdenetuya
    Munkhdalai, Tsendsuren
    Li, Meijing
    Shin, Jung-Hoon
    Namsrai, Oyun-Erdene
    Ryu, Keun Ho
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2013, 9 (01): : 31 - 40
  • [46] Study on a Method of Feature Classification Selection Based on χ2 Statistics
    Tan Z.
    Wang Z.
    Hu H.
    Data Analysis and Knowledge Discovery, 2019, 3 (02) : 72 - 78
  • [47] A Feature Selection Method for Classification of ADHD
    Miao, Bo
    Zhang, Yulin
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 21 - 25
  • [48] A systematic review of multi-label feature selection and a new method based on label construction
    Spolaor, Newton
    Monard, Maria Carolina
    Tsoumakas, Grigorios
    Lee, Huei Diana
    NEUROCOMPUTING, 2016, 180 : 3 - 15
  • [49] Algorithmic Feature Selection and Dimensionality Reduction in Signal Classification Tasks
    Zavadil, Jan
    Kus, Vaclav
    Chlada, Milan
    MATHEMATICAL MODELING IN PHYSICAL SCIENCES, IC-MSQUARE 2023, 2024, 446 : 187 - 193
  • [50] Identifying the best data-driven feature selection method for boosting reproducibility in classification tasks
    Georges, Nicolas
    Mhiri, Islem
    Rekik, Islem
    PATTERN RECOGNITION, 2020, 101