Feature Selection using Mutual Information for High-dimensional Data Sets

被引:0
|
作者
Nagpal, Arpita [1 ]
Gaur, Deepti [1 ]
Gaur, Seema [2 ]
机构
[1] ITM Univ, Dept Comp Sci, Gurgaon, India
[2] Banasthali Univ, Banasthali, Rajasthan, India
关键词
Correlation; feature selection; minimum spanning tree; data set;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To reduce the dimensionality of dataset, redundant and irrelevant features need to be segregated from multidimensional dataset. To remove these features, one of the feature selection techniques needs to be used. Here, a feature selection technique to remove irrelevant features has been used. Correlation measures based on the concept of mutual information has been adopted to calculate the degree of association between features. In this paper authors are proposing a new algorithm to segregate features from high dimensional data by visualizing relevant features in the form of graph as a dataset.
引用
收藏
页码:45 / 49
页数:5
相关论文
共 50 条
  • [1] Feature selection, mutual information, and the classification of high-dimensional patterns
    Bonev, Boyan
    Escolano, Francisco
    Cazorla, Miguel
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2008, 11 (3-4) : 309 - 319
  • [2] A Feature Subset Selection Method Based On High-Dimensional Mutual Information
    Zheng, Yun
    Kwoh, Chee Keong
    [J]. ENTROPY, 2011, 13 (04) : 860 - 901
  • [3] High-dimensional supervised feature selection via optimized kernel mutual information
    Bi, Ning
    Tan, Jun
    Lai, Jian-Huang
    Suen, Ching Y.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 81 - 95
  • [4] Feature selection, mutual information, and the classification of high-dimensional patternsApplications to image classification and microarray data analysis
    Boyan Bonev
    Francisco Escolano
    Miguel Cazorla
    [J]. Pattern Analysis and Applications, 2008, 11 : 309 - 319
  • [5] Band Selection for High-Dimensional Remote Sensing Data by Mutual Information
    Banit'ouagua, Ibtissam
    Kerroum, Mounir Ait
    Hammouch, Ahmed
    Aboutajdine, Driss
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRICAL AND INFORMATION TECHNOLOGIES (ICEIT), 2016, : 386 - 391
  • [6] Feature selection for high-dimensional data
    Destrero A.
    Mosci S.
    De Mol C.
    Verri A.
    Odone F.
    [J]. Computational Management Science, 2009, 6 (1) : 25 - 40
  • [7] Feature selection for high-dimensional data
    Bolón-Canedo V.
    Sánchez-Maroño N.
    Alonso-Betanzos A.
    [J]. Progress in Artificial Intelligence, 2016, 5 (2) : 65 - 75
  • [8] A novel feature selection scheme for high-dimensional data sets: four-Staged Feature Selection
    Pehlivanli, Ayca Cakmak
    [J]. JOURNAL OF APPLIED STATISTICS, 2016, 43 (06) : 1140 - 1154
  • [9] FEATURE SELECTION FOR HIGH-DIMENSIONAL DATA ANALYSIS
    Verleysen, Michel
    [J]. NCTA 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION THEORY AND APPLICATIONS, 2011, : IS23 - IS25
  • [10] Feature selection for high-dimensional imbalanced data
    Yin, Liuzhi
    Ge, Yong
    Xiao, Keli
    Wang, Xuehua
    Quan, Xiaojun
    [J]. NEUROCOMPUTING, 2013, 105 : 3 - 11