Classification via Correlation-based Feature Grouping

被引:0
|
作者
Maleki, Mina [1 ]
Rueda, Luis [1 ]
机构
[1] Univ Windsor, Sch Comp Sci, 401 Sunset Ave, Windsor, ON N9B 3P4, Canada
关键词
REGRESSION SHRINKAGE; VARIABLE SELECTION; PREDICTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Employing the most relevant and discriminating features is very important to achieve a successful classification with low computational cost. Although, different feature selection methods have been recently developed for this purpose, feature grouping can deal with high dimensional sparse feature vectors more effectively, yielding better interpretation of the data. In this paper, a correlation-based feature grouping (CFG) method is proposed. First, the features are grouped based on the variety of their correlation scores, and then, a new representative feature vector is generated for each group by combining its features. To investigate the strength of CFG method, two filter methods of. 2 and correlation are employed for feature selection, while classification is performed using a support vector machine (SVM) and k-Nearest Neighbor (k-NN). The empirical study on two datasets of protein-protein interactions (PPIs) and breast cancer verifies that the idea of employing feature grouping is more efficient than employing feature selection in identifying a set of features that exhibit high classification accuracy. In addition, a CFG diagram is introduced in this paper, which is used to visualize the groups and their corresponding features found by the proposed method.
引用
收藏
页码:61 / 66
页数:6
相关论文
共 50 条
  • [1] Correlation-based Feature Ranking for Online Classification
    Osman, Hassab Elgawi
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 3077 - 3082
  • [2] Correlation-based feature selection strategy in neural classification
    Michalak, Krzysztof
    Kwasnicka, Halina
    [J]. ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 741 - 746
  • [3] Correlation-based feature selection and classification via regression of segmented chromosomes using geometric features
    Tanvi Arora
    Renu Dhir
    [J]. Medical & Biological Engineering & Computing, 2017, 55 : 733 - 745
  • [4] Correlation-based feature selection and classification via regression of segmented chromosomes using geometric features
    Arora, Tanvi
    Dhir, Renu
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2017, 55 (05) : 733 - 745
  • [5] A canonical correlation-based feature extraction method for underwater target classification
    Pezeshki, A
    Azimi-Sadjadi, MR
    Scharf, LL
    Robinson, M
    [J]. OCEANS 2002 MTS/IEEE CONFERENCE & EXHIBITION, VOLS 1-4, CONFERENCE PROCEEDINGS, 2002, : 29 - 37
  • [6] A Correlation-Based Feature Selection and Classification Approach for Autism Spectrum Disorder
    Verma, Manvi
    Kumar, Dinesh
    [J]. INTERNATIONAL JOURNAL OF INFORMATION SYSTEM MODELING AND DESIGN, 2021, 12 (02) : 51 - 66
  • [7] Correlation-Based Feature Selection and Regression
    Cui, Yue
    Lin, Jesse S.
    Zhang, Shiliang
    Luo, Suhuai
    Tian, Qi
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 25 - +
  • [8] Pearson Correlation-Based Feature Selection for Document Classification Using Balanced Training
    Nasir, Inzamam Mashood
    Khan, Muhammad Attique
    Yasmin, Mussarat
    Shah, Jamal Hussain
    Gabryel, Marcin
    Scherer, Rafal
    Damasevicius, Robertas
    [J]. SENSORS, 2020, 20 (23) : 1 - 18
  • [9] Hybrid Classification Model of Correlation-based Feature Selection and Support Vector Machine
    Dubey, Vimal Kumar
    Saxena, Amit Kumar
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN ADVANCED COMPUTING (ICCTAC), 2016,
  • [10] Distributed correlation-based feature selection in spark
    Palma-Mendoza, Raul Jose
    de-Marcos, Luis
    Rodriguez, Daniel
    Alonso-Betanzos, Amparo
    [J]. INFORMATION SCIENCES, 2019, 496 : 287 - 299