An Instance- and Label-Based Feature Selection Method in Classification Tasks

被引:1
|
作者
Fan, Qingcheng [1 ]
Liu, Sicong [1 ]
Zhao, Chunjiang [1 ,2 ]
Li, Shuqin [1 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, 3 Taicheng Rd, Xianyang 712100, Peoples R China
[2] Beijing Acad Agr & Forestry Sci, Res Ctr Informat Technol, Beijing 100097, Peoples R China
关键词
feature selection; manifold learning; classification;
D O I
10.3390/info14100532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is crucial in classification tasks as it helps to extract relevant information while reducing redundancy. This paper presents a novel method that considers both instance and label correlation. By employing the least squares method, we calculate the linear relationship between each feature and the target variable, resulting in correlation coefficients. Features with high correlation coefficients are selected. Compared to traditional methods, our approach offers two advantages. Firstly, it effectively selects features highly correlated with the target variable from a large feature set, reducing data dimensionality and improving analysis and modeling efficiency. Secondly, our method considers label correlation between features, enhancing the accuracy of selected features and subsequent model performance. Experimental results on three datasets demonstrate the effectiveness of our method in selecting features with high correlation coefficients, leading to superior model performance. Notably, our approach achieves a minimum accuracy improvement of 3.2% for the advanced classifier, lightGBM, surpassing other feature selection methods. In summary, our proposed method, based on instance and label correlation, presents a suitable solution for classification problems.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Combining instance and feature neighbours for extreme multi-label classification
    Feremans, Len
    Cule, Boris
    Vens, Celine
    Goethals, Bart
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 10 (03) : 215 - 231
  • [22] A Feature Selection Method for Multi-Label Text Based on Feature Importance
    Zhang, Lu
    Duan, Qingling
    APPLIED SCIENCES-BASEL, 2019, 9 (04):
  • [23] A PSO-based multi-objective multi-label feature selection method in classification
    Yong Zhang
    Dun-wei Gong
    Xiao-yan Sun
    Yi-nan Guo
    Scientific Reports, 7
  • [24] Consistency measure based simultaneous feature selection and instance purification for multimedia traffic classification
    Wu, Zheng
    Dong, Yu-ning
    Wei, Hua-Liang
    Tian, Wei
    COMPUTER NETWORKS, 2020, 173 (173)
  • [25] A Classification Method Based on Feature Selection for Imbalanced Data
    Liu, Yi
    Wang, Yanzhen
    Ren, Xiaoguang
    Zhou, Hao
    Diao, Xingchun
    IEEE ACCESS, 2019, 7 : 81794 - 81807
  • [26] A Weighted Classification Method Based on Adaptive Feature Selection
    Ni, Ruizheng
    Qiu, Ruichang
    Luo, Zhiwei
    Chen, Jie
    Jin, Zheming
    Liu, Zhigang
    IEEE ACCESS, 2022, 10 : 58635 - 58646
  • [27] A hybrid feature selection method based on instance learning and cooperative subset search
    Ben Brahim, Afef
    Limam, Mohamed
    PATTERN RECOGNITION LETTERS, 2016, 69 : 28 - 34
  • [28] Efficient Multi-label Classification using Attribute and Instance Selection
    Sane, Shirish S.
    Tidake, Vaishali S.
    BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS, 2020, 13 (14): : 221 - 226
  • [29] A feature selection-based speaker clustering method for paralinguistic tasks
    Gábor Gosztolya
    László Tóth
    Pattern Analysis and Applications, 2018, 21 : 193 - 204
  • [30] A feature selection-based speaker clustering method for paralinguistic tasks
    Gosztolya, Gabor
    Toth, Laszlo
    PATTERN ANALYSIS AND APPLICATIONS, 2018, 21 (01) : 193 - 204