Feature and Subfeature Selection for Classification Using Correlation Coefficient and Fuzzy Model

被引:14
|
作者
Bhuyan, Hemanta Kumar [1 ]
Chakraborty, Chinmay [2 ]
Pani, Subhendu Kumar [3 ]
Ravi, Vinayakumar [4 ]
机构
[1] Technol & Res Deemed Univ, Vignans Fdn Sci, Dept Informat Technol, Vejendla 522213, Andhra Pradesh, India
[2] Birla Inst Technol Mesra, Elect & Commun Engn, Jharhand 835215, India
[3] Biju Patnaik Univ Technol, Orissa Engn Coll, Dept Comp Sci & Engn, Rourkela 769004, Odisha, India
[4] Prince Mohammad Bin Fahd Univ, Ctr Artificial Intelligence, Khobar 34754, Saudi Arabia
关键词
Feature extraction; Correlation; Redundancy; Databases; Data models; Data mining; Task analysis; Classification; correlation coefficient; data mining; feature selection; fuzzy model; UNSUPERVISED FEATURE-SELECTION; SUB-FEATURE SELECTION;
D O I
10.1109/TEM.2021.3065699
中图分类号
F [经济];
学科分类号
02 ;
摘要
This article presents an analysis of data extraction for classification using correlation coefficient and fuzzy model. Several traditional methods of data extraction are used for classification that could not provide sufficient information for further step of data analysis on class. It needs refinement of features data to distinguish a class that differs from a traditional class. Thus, it proposes the feature tiny data (subfeature data) to find distinguish class from a traditional class using two methods such as correlation coefficient and fuzzy model to select features as well as subfeature for distinguishing class. In the first approach, the correlation coefficient methods with gradient descent technique are used to select features from the dataset and in the second approach, the fuzzy model with supreme of minimum value is considered to get subfeature data. As per the proposed model, some features (i.e., three features from the acoustic dataset, two features from the QCM dataset, and eight features from the audit dataset, etc.) and subfeatures (as per threshold value like 20 for acoustic; 10 for QCM, and 20 for audit, etc.) are selected based on correlation coefficient as well as fuzzy methods, respectively. Further, the probability approach is used to find the association and availability of subfeature data from the dimensional reduced database. The experimental results show the proposed framework identifies and selects both feature and subfeature data with the effectiveness of the new class. The comparison results of several classifiers on several datasets are explained in the experimental section.
引用
收藏
页码:1655 / 1669
页数:15
相关论文
共 50 条
  • [21] Grooming Detection using Fuzzy-Rough Feature Selection and Text Classification
    Zuo, Zheming
    Li, Jie
    Anderson, Philip
    Yang, Longzhi
    Naik, Nitin
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [22] Classification of biomedical spectra using fuzzy interquartile encoding and stochastic feature selection
    Pizzi, Nick J.
    Alexiuk, Mark D.
    Pedrycz, Witold
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 668 - 673
  • [23] Aggregating multiple classification results using fuzzy integration and stochastic feature selection
    Pizzi, Nick J.
    Pedrycz, Witold
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2010, 51 (08) : 883 - 894
  • [24] A new filter feature selection algorithm for classification task by ensembling pearson correlation coefficient and mutual information
    Gong, Huanhuan
    Li, Yanying
    Zhang, Jiaoni
    Zhang, Baoshuang
    Wang, Xialin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 131
  • [25] Feature selection based on correlation between fuzzy features and optimal fuzzy-valued feature subset selection
    Li, Jirong
    2008 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, PROCEEDINGS, 2008, : 775 - 778
  • [26] Model and feature selection in microarray classification
    Peterson, DA
    Thaut, MH
    PROCEEDINGS OF THE 2004 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2004, : 56 - 60
  • [27] Feature selection for classification with Spearman's rank correlation coefficient-based self-information in divergence-based fuzzy rough sets
    Jiang, Jiefang
    Zhang, Xianyong
    Yuan, Zhong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [28] Ensemble Method Using Correlation Based Feature Selection with Stratified Sampling for Classification
    Meshram, Shweta B.
    Shinde, Sharmila M.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT 2016, VOL 1, 2017, 468 : 47 - 55
  • [29] Fuzzy-Rough Feature Selection for Mammogram Classification
    R.Roselin
    K.Thangavel
    C.Velayutham
    Journal of Electronic Science and Technology, 2011, 9 (02) : 124 - 132
  • [30] A fuzzy classification based on feature selection for web pages
    Zhang, MY
    Lu, ZD
    IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, : 469 - 472