An Instance- and Label-Based Feature Selection Method in Classification Tasks

被引:1
|
作者
Fan, Qingcheng [1 ]
Liu, Sicong [1 ]
Zhao, Chunjiang [1 ,2 ]
Li, Shuqin [1 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, 3 Taicheng Rd, Xianyang 712100, Peoples R China
[2] Beijing Acad Agr & Forestry Sci, Res Ctr Informat Technol, Beijing 100097, Peoples R China
关键词
feature selection; manifold learning; classification;
D O I
10.3390/info14100532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is crucial in classification tasks as it helps to extract relevant information while reducing redundancy. This paper presents a novel method that considers both instance and label correlation. By employing the least squares method, we calculate the linear relationship between each feature and the target variable, resulting in correlation coefficients. Features with high correlation coefficients are selected. Compared to traditional methods, our approach offers two advantages. Firstly, it effectively selects features highly correlated with the target variable from a large feature set, reducing data dimensionality and improving analysis and modeling efficiency. Secondly, our method considers label correlation between features, enhancing the accuracy of selected features and subsequent model performance. Experimental results on three datasets demonstrate the effectiveness of our method in selecting features with high correlation coefficients, leading to superior model performance. Notably, our approach achieves a minimum accuracy improvement of 3.2% for the advanced classifier, lightGBM, surpassing other feature selection methods. In summary, our proposed method, based on instance and label correlation, presents a suitable solution for classification problems.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] TOPSIS-ACO based feature selection for multi-label classification
    Verma G.
    Sahu T.P.
    International Journal of Computers and Applications, 2024, 46 (06) : 363 - 380
  • [32] Label-correlation-based Common and Specific Feature Selection for Hierarchical Classification
    Lin Y.-J.
    Bai S.-X.
    Zhao H.
    Li S.-Z.
    Hu Q.-H.
    Ruan Jian Xue Bao/Journal of Software, 2022, 33 (07): : 2667 - 2682
  • [33] Feature selection for multi-label classification based on neighborhood rough sets
    Duan, Jie
    Hu, Qinghua
    Zhang, Lingjun
    Qian, Yuhua
    Li, Deyu
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2015, 52 (01): : 56 - 65
  • [34] Feature Selection for Hierarchical Multi-label Classification
    da Silva, Luan V. M.
    Cerri, Ricardo
    ADVANCES IN INTELLIGENT DATA ANALYSIS XIX, IDA 2021, 2021, 12695 : 196 - 208
  • [35] Feature Selection for Multi-label Classification Problems
    Doquire, Gauthier
    Verleysen, Michel
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2011, PT I, 2011, 6691 : 9 - 16
  • [36] OPTIMUM FEATURE ORDERING FOR DYNAMIC INSTANCE-WISE JOINT FEATURE SELECTION AND CLASSIFICATION
    Liyanage, Yasitha Warahena
    Zois, Daphney-Stavroula
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3370 - 3374
  • [37] Robust Method of Sparse Feature Selection for Multi-Label Classification with Naive Bayes
    Ruta, Dymitr
    FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2014, 2014, 2 : 375 - 380
  • [38] An Ensemble Embedded Feature Selection Method for Multi-Label Clinical Text Classification
    Guo, Yumeng
    Chung, Fulai
    Li, Guozheng
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 823 - 826
  • [39] Multi-label feature selection method based on dynamic weight
    Zhang, Ping
    Sheng, Jiyao
    Gao, Wanfu
    Hu, Juncheng
    Li, Yonghao
    SOFT COMPUTING, 2022, 26 (06) : 2793 - 2805
  • [40] Multi-label feature selection method based on dynamic weight
    Ping Zhang
    Jiyao Sheng
    Wanfu Gao
    Juncheng Hu
    Yonghao Li
    Soft Computing, 2022, 26 : 2793 - 2805