Feature and Subfeature Selection for Classification Using Correlation Coefficient and Fuzzy Model

被引:14
|
作者
Bhuyan, Hemanta Kumar [1 ]
Chakraborty, Chinmay [2 ]
Pani, Subhendu Kumar [3 ]
Ravi, Vinayakumar [4 ]
机构
[1] Technol & Res Deemed Univ, Vignans Fdn Sci, Dept Informat Technol, Vejendla 522213, Andhra Pradesh, India
[2] Birla Inst Technol Mesra, Elect & Commun Engn, Jharhand 835215, India
[3] Biju Patnaik Univ Technol, Orissa Engn Coll, Dept Comp Sci & Engn, Rourkela 769004, Odisha, India
[4] Prince Mohammad Bin Fahd Univ, Ctr Artificial Intelligence, Khobar 34754, Saudi Arabia
关键词
Feature extraction; Correlation; Redundancy; Databases; Data models; Data mining; Task analysis; Classification; correlation coefficient; data mining; feature selection; fuzzy model; UNSUPERVISED FEATURE-SELECTION; SUB-FEATURE SELECTION;
D O I
10.1109/TEM.2021.3065699
中图分类号
F [经济];
学科分类号
02 ;
摘要
This article presents an analysis of data extraction for classification using correlation coefficient and fuzzy model. Several traditional methods of data extraction are used for classification that could not provide sufficient information for further step of data analysis on class. It needs refinement of features data to distinguish a class that differs from a traditional class. Thus, it proposes the feature tiny data (subfeature data) to find distinguish class from a traditional class using two methods such as correlation coefficient and fuzzy model to select features as well as subfeature for distinguishing class. In the first approach, the correlation coefficient methods with gradient descent technique are used to select features from the dataset and in the second approach, the fuzzy model with supreme of minimum value is considered to get subfeature data. As per the proposed model, some features (i.e., three features from the acoustic dataset, two features from the QCM dataset, and eight features from the audit dataset, etc.) and subfeatures (as per threshold value like 20 for acoustic; 10 for QCM, and 20 for audit, etc.) are selected based on correlation coefficient as well as fuzzy methods, respectively. Further, the probability approach is used to find the association and availability of subfeature data from the dimensional reduced database. The experimental results show the proposed framework identifies and selects both feature and subfeature data with the effectiveness of the new class. The comparison results of several classifiers on several datasets are explained in the experimental section.
引用
收藏
页码:1655 / 1669
页数:15
相关论文
共 50 条
  • [1] Feature Genes Selection Using Supervised Locally Linear Embedding and Correlation Coefficient for Microarray Classification
    Xu, Jiucheng
    Mu, Huiyu
    Wang, Yun
    Huang, Fangzhou
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2018, 2018
  • [2] Effective classification using feature selection and fuzzy integration
    Pizzi, Nick J.
    Pedrycz, Witold
    FUZZY SETS AND SYSTEMS, 2008, 159 (21) : 2859 - 2872
  • [3] Simultaneous Feature Selection and Classification Using Fuzzy Rules
    Kumar, D. Sai
    Rao, V. Madhusudan
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 125 - 130
  • [4] Fuzzy Classification in Ant Feature Selection
    Vieira, S. M.
    Sousa, J. M. C.
    Runkler, T. A.
    2008 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-5, 2008, : 1764 - +
  • [5] Correlation Coefficient Based Feature Selection Framework Using Graph Construction
    Potharaju, Sai Prasad
    Sreedevi, Marriboyina
    GAZI UNIVERSITY JOURNAL OF SCIENCE, 2018, 31 (03): : 775 - 787
  • [6] Novel feature selection method for accurate breast cancer classification using Correlation coefficient and Modified GWO Algorithm
    Mezaghrani, Ali
    Debakla, Mohammed
    Djemal, Khalifa
    COMPUTER SCIENCE JOURNAL OF MOLDOVA, 2024, 32 (02) : 175 - 198
  • [7] Optimization of sEMG Classification Model Based on Correlation Analysis and Feature Selection
    Li, Zhengzhen
    Li, Ke
    Li, Jinping
    Wei, Na
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 402 - 407
  • [8] Feature selection for classification of polar regions using a fuzzy expert system
    South Dakota Sch of Mines and, Technology, Rapid City, United States
    Remote Sens Environ, 1 (81-100):
  • [9] Cancer Classification using Fuzzy C-Means with Feature Selection
    Rachman, Arvan Aulia
    Rustam, Zuherman
    2016 12TH INTERNATIONAL CONFERENCE ON MATHEMATICS, STATISTICS, AND THEIR APPLICATIONS (ICMSA), 2016, : 31 - 34
  • [10] Feature selection for classification of polar regions using a fuzzy expert system
    Penaloza, MA
    Welch, RM
    REMOTE SENSING OF ENVIRONMENT, 1996, 58 (01) : 81 - 100