Feature Selection using Mutual Information for High-dimensional Data Sets

被引:0
|
作者
Nagpal, Arpita [1 ]
Gaur, Deepti [1 ]
Gaur, Seema [2 ]
机构
[1] ITM Univ, Dept Comp Sci, Gurgaon, India
[2] Banasthali Univ, Banasthali, Rajasthan, India
关键词
Correlation; feature selection; minimum spanning tree; data set;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To reduce the dimensionality of dataset, redundant and irrelevant features need to be segregated from multidimensional dataset. To remove these features, one of the feature selection techniques needs to be used. Here, a feature selection technique to remove irrelevant features has been used. Correlation measures based on the concept of mutual information has been adopted to calculate the degree of association between features. In this paper authors are proposing a new algorithm to segregate features from high dimensional data by visualizing relevant features in the form of graph as a dataset.
引用
收藏
页码:45 / 49
页数:5
相关论文
共 50 条
  • [1] Feature selection, mutual information, and the classification of high-dimensional patterns
    Bonev, Boyan
    Escolano, Francisco
    Cazorla, Miguel
    PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (3-4) : 309 - 319
  • [2] A Feature Subset Selection Method Based On High-Dimensional Mutual Information
    Zheng, Yun
    Kwoh, Chee Keong
    ENTROPY, 2011, 13 (04) : 860 - 901
  • [3] High-dimensional supervised feature selection via optimized kernel mutual information
    Bi, Ning
    Tan, Jun
    Lai, Jian-Huang
    Suen, Ching Y.
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 81 - 95
  • [4] Feature selection, mutual information, and the classification of high-dimensional patternsApplications to image classification and microarray data analysis
    Boyan Bonev
    Francisco Escolano
    Miguel Cazorla
    Pattern Analysis and Applications, 2008, 11 : 309 - 319
  • [5] Band Selection for High-Dimensional Remote Sensing Data by Mutual Information
    Banit'ouagua, Ibtissam
    Kerroum, Mounir Ait
    Hammouch, Ahmed
    Aboutajdine, Driss
    2016 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT), 2016, : 386 - 391
  • [6] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [7] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    Computational Management Science, 2009, 6 (1) : 25 - 40
  • [8] A novel feature selection scheme for high-dimensional data sets: four-Staged Feature Selection
    Pehlivanli, Ayca Cakmak
    JOURNAL OF APPLIED STATISTICS, 2016, 43 (06) : 1140 - 1154
  • [9] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [10] Feature selection for high-dimensional data in astronomy
    Zheng, Hongwen
    Zhang, Yanxia
    ADVANCES IN SPACE RESEARCH, 2008, 41 (12) : 1960 - 1964