Benchmarking Classification Models for Cancer Prediction from Gene Expression Data: A Novel Approach and New Findings

被引:0
|
作者
Ramani, R. Geetha [1 ]
Jacob, Shomona Gracia [1 ]
机构
[1] Anna Univ, Madras 600025, Tamil Nadu, India
来源
STUDIES IN INFORMATICS AND CONTROL | 2013年 / 22卷 / 02期
关键词
Cancer prediction; Gene Expression; Feature Relevance; Multi-class classification; MICROARRAY DATA; SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gene Selection from gene expression data for Cancer prediction has been an area of intensive research, aiming at identifying the minimal and optimal set of candidate genes that could generate accurate predictive performance. The two major problems encountered in this process are the high dimensionality of data with comparatively few instances and the need to categorize records under multiple classes. In this paper we propose a novel approach called Rank-Weight Feature Selection that utilizes the filtering capacity of more than one feature selection algorithm to detect the minimal set of predictive genes that generate higher predictor performance in categorizing and predicting diverse oncogenic gene expression data. The filtered features (genes) are weighted based on the number of feature relevance algorithms reporting them to be significant. The ranked genes are then used to validate the proposed method by utilizing ten classifiers over five diverse gene expression datasets. The results proved that the proposed approach generated higher predictive performance with fewer features than previously reported results with the most relevant and minimal set of genes and commend classifiers based on their accuracy and reliability in predicting cancer data.
引用
收藏
页码:133 / 142
页数:10
相关论文
共 50 条
  • [31] A Novel Information Theoretic Approach to Gene Selection for Cancer Classification Using Microarray Data
    Naseem, Imran
    Togneri, Roberto
    Bennamoun, Mohammed
    CURRENT BIOINFORMATICS, 2015, 10 (04) : 431 - 440
  • [32] A genetic filter for cancer classification on gene expression data
    Kim, Yong-Hyuk
    Yoon, Yourim
    BIO-MEDICAL MATERIALS AND ENGINEERING, 2015, 26 : S1993 - S2002
  • [33] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [34] Classification of Cancer Types based on Gene Expression Data
    He, Yinchao
    Bockmon, Ryan
    Modey, Miracle
    Roscoe, Sarah
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2175 - 2182
  • [35] Event models for tumor classification with SAGE gene expression data
    Jin, Xin
    Xu, Anbang
    Zhao, Guoxing
    Ma, Jixin
    Bie, Rongfang
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 775 - 782
  • [36] Bioinformatics Prediction and Machine Learning on Gene Expression Data Identifies Novel Gene Candidates in Gastric Cancer
    Kori, Medi
    Gov, Esra
    GENES, 2022, 13 (12)
  • [37] Disjoint PCA models for marker identification and classification of cancer types using gene expression data
    Bicciato, S
    Luchini, A
    Di Bello, C
    MINERVA BIOTECNOLOGICA, 2002, 14 (3-4) : 281 - 290
  • [38] A novel approach to feature extraction from classification models based on information gene pairs
    Li, J.
    Tang, X.
    Liu, J.
    Huang, J.
    Wang, Y.
    PATTERN RECOGNITION, 2008, 41 (06) : 1975 - 1984
  • [39] Gene selection from microarray data for cancer classification - a machine learning approach
    Wang, Y
    Tetko, IV
    Hall, MA
    Frank, E
    Facius, A
    Mayer, KFX
    Mewes, HW
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2005, 29 (01) : 37 - 46
  • [40] Gene ranking from microarray data for cancer classification -: A machine learning approach
    Ruiz, Roberto
    Pontes, Beatriz
    Giraldez, Raul
    Aguilar-Ruiz, Jesus S.
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2006, 4252 : 1272 - 1280