An Effective Feature Selection Method Based on Pair-Wise Feature Proximity for High Dimensional Low Sample Size Data

被引:0
|
作者
Happy, S. L. [1 ]
Mohanty, Ramanarayan [2 ]
Routray, Aurobinda [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
关键词
Feature selection; pair-wise feature proximity; high dimensional low sample size data;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Feature selection has been studied widely in the literature. However, the efficacy of the selection criteria for low sample size applications is neglected in most cases. Most of the existing feature selection criteria are based on the sample similarity. However, the distance measures become insignificant for high dimensional low sample size (HDLSS) data. Moreover, the variance of a feature with a few samples is pointless unless it represents the data distribution efficiently. Instead of looking at the samples in groups, we evaluate their efficiency based on pair-wise fashion. In our investigation, we noticed that considering a pair of samples at a time and selecting the features that bring them closer or put them far away is a better choice for feature selection. Experimental results on benchmark data sets demonstrate the effectiveness of the proposed method with low sample size, which outperforms many other state-of-the-art feature selection methods.
引用
收藏
页码:1574 / 1578
页数:5
相关论文
共 50 条
  • [1] Intelligent fault diagnosis method for common rail injectors based on hierarchical weighted permutation entropy and pair-wise feature proximity feature selection
    Ke, Yun
    Yao, Chong
    Song, Enzhe
    Yang, Liping
    Dong, Quan
    [J]. JOURNAL OF VIBRATION AND CONTROL, 2022, 28 (17-18) : 2386 - 2398
  • [2] An effective heuristic for developing hybrid feature selection in high dimensional and low sample size datasets
    Shin, Hyunseok
    Oh, Sejong
    [J]. BMC Bioinformatics, 2024, 25 (01)
  • [3] Sample Size Selection for Pair-Wise Comparisons Using Information Criteria
    Pan, Xuemei
    Dayton, C. Mitchell
    [J]. JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2005, 4 (02) : 601 - 608
  • [4] DBFS: An effective Density Based Feature Selection scheme for small sample size and high dimensional imbalanced data sets
    Alibeigi, Mina
    Hashemi, Sattar
    Hamzeh, Ali
    [J]. DATA & KNOWLEDGE ENGINEERING, 2012, 81-82 : 67 - 103
  • [5] Feature Selection and Feature Stability Measurement Method for High-Dimensional Small Sample Data Based on Big Data Technology
    Huang, Chengyuan
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [6] Recurrent Neural Network Based Feature Selection for High Dimensional and Low Sample Size Micro-array Data
    Chowdhury, Shanta
    Dong, Xishuang
    Li, Xiangfang
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 4823 - 4828
  • [7] Graph convolutional network-based feature selection for high-dimensional and low-sample size data
    Chen, Can
    Weiss, Scott T.
    Liu, Yang-Yu
    [J]. BIOINFORMATICS, 2023, 39 (04)
  • [8] Analyzing omics data by pair-wise feature evaluation with horizontal and vertical comparisons
    Huang, Xin
    Lin, Xiaohui
    Zhou, Lina
    Su, Benzhe
    [J]. JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2018, 157 : 20 - 26
  • [9] Biobjective gradient descent for feature selection on high dimension, low sample size data
    Issa, Tina
    Angel, Eric
    Zehraoui, Farida
    [J]. PLOS ONE, 2024, 19 (07):
  • [10] On feature selection protocols for very low-sample-size data
    Kuncheva, Ludmila I.
    Rodriguez, Juan J.
    [J]. PATTERN RECOGNITION, 2018, 81 : 660 - 673