A two stage grading approach for feature selection and classification of microarray data using Pareto based feature ranking techniques: A case study

被引:10
|
作者
Dash, Rasmita [1 ]
机构
[1] Siksha O Anusandhan Univ, Dept Comp Sc & Informat Technol, Bhubaneswar 751030, Odisha, India
关键词
Feature ranking technique; Statistical analysis; Pareto front; Multi-objective optimization; Classification technique; Microarray database; MULTIOBJECTIVE OPTIMIZATION; GENE SELECTION; CANCER; PREDICTION; TUMOR; ALGORITHMS; DISCOVERY; EFFICIENT; PATTERNS;
D O I
10.1016/j.jksuci.2017.08.005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High dimensional search space in microarray data with large number of genes and few dozen of samples increases the complexity of analysis of such databases. All the genes are not significant and hence informative genes are required to be extracted. So dimension reduction is necessary for this process. It is often found in literature that the ranking approaches are used for feature selection. Different ranking techniques may assign different rank to the same gene and the selection made based on these ranks may not be suitable for different problems. So use of one ranking technique may lead to rejection of some important genes and possibly selection of some insignificant genes. Such selection may degrade the performance of the classifier. To overcome this problem, here a bi-objective ranked based Pareto front technique is proposed. In this technique using two ranked based technique the Pareto optimal solution is generated with a set of features. For the experimental work, 21 models based on 7 feature ranking strategies are considered. Eight different microarray data are taken to find the suitable ranking combination for the work. A grading method is used to rank the models and statistical test is performed to validate the findings. (C) 2017 The Author. Production and hosting by Elsevier B.V.
引用
收藏
页码:232 / 247
页数:16
相关论文
共 50 条
  • [41] Feature selection and classification of urinary mRNA microarray data by iterative random forest to diagnose renal fibrosis: a two-stage study
    Zhou, Le-Ting
    Cao, Yu-Han
    Lv, Lin-Li
    Ma, Kun-Ling
    Chen, Ping-Sheng
    Ni, Hai-Feng
    Lei, Xiang-Dong
    Liu, Bi-Cheng
    SCIENTIFIC REPORTS, 2017, 7
  • [42] Classification of Gene Expression Data Using Feature Selection Based on Type Combination Approach Model With Advanced Feature Selection Technology
    Siddesh, G. M.
    Gururaj, T.
    INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (04)
  • [43] Robust microarray data feature selection using a correntropy based distance metric learning approach
    Vahabzadeh, Venus
    Moattar, Mohammad Hossein
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 161
  • [44] Two-Stage Feature Selection for Text Classification
    Ozgur, Levent
    Gungor, Tunga
    INFORMATION SCIENCES AND SYSTEMS 2015, 2016, 363 : 329 - 337
  • [45] Data Shrinking Based Feature Ranking for Protein Classification
    Dua, Sumeet
    Saini, Sheetal
    INFORMATION SYSTEMS, TECHNOLOGY AND MANAGEMENT-THIRD INTERNATIONAL CONFERENCE, ICISTM 2009, 2009, 31 : 54 - 63
  • [46] FEATURE SELECTION FOR MICROARRAY DATA USING PROBABILITY DISTANCES
    Korenblat, K.
    Volkovich, Z.
    JP JOURNAL OF BIOSTATISTICS, 2012, 7 (01) : 15 - 34
  • [47] Improve Abstract Data with Feature Selection for Classification Techniques
    Nuipian, Vatinee
    Meesad, Phayung
    Boonrawd, Pudsadee
    FUTURE INFORMATION TECHNOLOGY, 2011, 13 : 213 - 217
  • [48] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Wang, Zixuan
    Zhou, Yi
    Takagi, Tatsuya
    Song, Jiangning
    Tian, Yu-Shi
    Shibuya, Tetsuo
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [49] Improve Abstract Data with Feature Selection for Classification Techniques
    Nuipian, Vatinee
    Meesad, Phayung
    Boonrawd, Pudsadee
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 3699 - +
  • [50] Feature Selection and Classification of Microarray Data using MapReduce based ANOVA and K-Nearest Neighbor
    Kumar, Mukesh
    Rath, Nitish Kumar
    Swain, Amitav
    Rath, Santanu Kumar
    ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 301 - 310