An Instance- and Label-Based Feature Selection Method in Classification Tasks

被引:1
|
作者
Fan, Qingcheng [1 ]
Liu, Sicong [1 ]
Zhao, Chunjiang [1 ,2 ]
Li, Shuqin [1 ]
机构
[1] Northwest A&F Univ, Coll Informat Engn, 3 Taicheng Rd, Xianyang 712100, Peoples R China
[2] Beijing Acad Agr & Forestry Sci, Res Ctr Informat Technol, Beijing 100097, Peoples R China
关键词
feature selection; manifold learning; classification;
D O I
10.3390/info14100532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Feature selection is crucial in classification tasks as it helps to extract relevant information while reducing redundancy. This paper presents a novel method that considers both instance and label correlation. By employing the least squares method, we calculate the linear relationship between each feature and the target variable, resulting in correlation coefficients. Features with high correlation coefficients are selected. Compared to traditional methods, our approach offers two advantages. Firstly, it effectively selects features highly correlated with the target variable from a large feature set, reducing data dimensionality and improving analysis and modeling efficiency. Secondly, our method considers label correlation between features, enhancing the accuracy of selected features and subsequent model performance. Experimental results on three datasets demonstrate the effectiveness of our method in selecting features with high correlation coefficients, leading to superior model performance. Notably, our approach achieves a minimum accuracy improvement of 3.2% for the advanced classifier, lightGBM, surpassing other feature selection methods. In summary, our proposed method, based on instance and label correlation, presents a suitable solution for classification problems.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Weighted Feature Selection Method for Instance-Based Classification
    Agre, Gennady
    Dzhondzhorov, Anton
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2016, 2016, 9883 : 14 - 25
  • [2] Sample Label-based PLS and Feature extraction
    Yang MaoLong
    Mao WeiHao
    Sun QuanSen
    Xia DeShen
    PLS '09: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PARTIAL LEAST SQUARES AND RELATED METHODS, 2009, : 127 - 131
  • [3] Topic-Based Instance and Feature Selection in Multilabel Classification
    Ma, Jianghong
    Chow, Tommy W. S.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (01) : 315 - 329
  • [4] Integrated Instance- and Class-based Generative Modeling for Text Classification
    Puurula, Antti
    Myaeng, Sung-Hyon
    PROCEEDINGS OF THE 18TH AUSTRALASIAN DOCUMENT COMPUTING SYMPOSIUM (ADCS 2013), 2013, : 66 - 73
  • [5] Towards Multi-label Feature Selection by Instance and Label Selections
    Mansouri, Dou El Kefel
    Benabdeslem, Khalid
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT II, 2021, 12713 : 233 - 244
  • [6] A lazy feature selection method for multi-label classification
    Pereira, Rafael B.
    Plastino, Alexandre
    Zadrozny, Bianca
    Merschmann, Luiz H. C.
    INTELLIGENT DATA ANALYSIS, 2021, 25 (01) : 21 - 34
  • [7] FISA: Feature-based instance selection for imbalanced text classification
    Sun, Aixin
    Lim, Ee-Peng
    Benatallah, Boualem
    Hassan, Mahbub
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 250 - 254
  • [8] A Feature Selection Approach Based on Information Theory for Classification Tasks
    Jesus, Jhoseph
    Canuto, Anne
    Araujo, Daniel
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 359 - 367
  • [9] A Multi-Label Classification With Hybrid Label-Based Meta-Learning Method in Internet of Things
    Lin, Sung-Chiang
    Chen, Chih-Jou
    Lee, Tsung-Ju
    IEEE ACCESS, 2020, 8 : 42261 - 42269
  • [10] An Unsupervised-based Dynamic Feature Selection for Classification tasks
    Nunes, Romulo de O.
    Dantas, Carine A.
    Canuto, Anne M. P.
    Xavier-Junior, Joao C.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4213 - 4220