Benchmarking Classification Models for Cancer Prediction from Gene Expression Data: A Novel Approach and New Findings

被引:0
|
作者
Ramani, R. Geetha [1 ]
Jacob, Shomona Gracia [1 ]
机构
[1] Anna Univ, Madras 600025, Tamil Nadu, India
来源
STUDIES IN INFORMATICS AND CONTROL | 2013年 / 22卷 / 02期
关键词
Cancer prediction; Gene Expression; Feature Relevance; Multi-class classification; MICROARRAY DATA; SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gene Selection from gene expression data for Cancer prediction has been an area of intensive research, aiming at identifying the minimal and optimal set of candidate genes that could generate accurate predictive performance. The two major problems encountered in this process are the high dimensionality of data with comparatively few instances and the need to categorize records under multiple classes. In this paper we propose a novel approach called Rank-Weight Feature Selection that utilizes the filtering capacity of more than one feature selection algorithm to detect the minimal set of predictive genes that generate higher predictor performance in categorizing and predicting diverse oncogenic gene expression data. The filtered features (genes) are weighted based on the number of feature relevance algorithms reporting them to be significant. The ranked genes are then used to validate the proposed method by utilizing ten classifiers over five diverse gene expression datasets. The results proved that the proposed approach generated higher predictive performance with fewer features than previously reported results with the most relevant and minimal set of genes and commend classifiers based on their accuracy and reliability in predicting cancer data.
引用
收藏
页码:133 / 142
页数:10
相关论文
共 50 条
  • [1] Benchmarking classification models for software defect prediction: A proposed framework and novel findings
    Lessmann, Stefan
    Baesens, Bart
    Mues, Christophe
    Pietsch, Swantje
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2008, 34 (04) : 485 - 496
  • [2] A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification
    Peng, Yanxiong
    Li, Wenyuan
    Liu, Ying
    CANCER INFORMATICS, 2006, 2 : 301 - 311
  • [3] Cancer Classification of Gene Expression Data using Machine Learning Models
    De Guia, Joseph M.
    Devaraj, Madhavi
    Vea, Larry A.
    2018 IEEE 10TH INTERNATIONAL CONFERENCE ON HUMANOID, NANOTECHNOLOGY, INFORMATION TECHNOLOGY, COMMUNICATION AND CONTROL, ENVIRONMENT AND MANAGEMENT (HNICEM), 2018,
  • [4] Optimization models for cancer classification: extracting gene interaction information from microarray expression data
    Antonov, AV
    Tetko, IV
    Mader, MT
    Budczies, J
    Mewes, HW
    BIOINFORMATICS, 2004, 20 (05) : 644 - U145
  • [5] An Interpretable Approach for Lung Cancer Prediction and Subtype Classification using Gene Expression
    Ramos, Bernardo
    Pereira, Tania
    Moranguinho, Joao
    Morgado, Joana
    Costa, Jose Luis
    Oliveira, Helder P.
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 1707 - 1710
  • [6] Cancer Classification from Gene Expression Data by NPPC Ensemble
    Ghorai, Santanu
    Mukherjee, Anirban
    Sengupta, Sanghamitra
    Dutta, Pranab K.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 659 - 671
  • [7] Ensemble dependence model for classification and prediction of cancer and normal gene expression data
    Qiu, P
    Wang, ZJ
    Liu, KJR
    BIOINFORMATICS, 2005, 21 (14) : 3114 - 3121
  • [8] A novel gene selection method for gene expression data for the task of cancer type classification
    N. Özlem ÖZCAN ŞİMŞEK
    Arzucan ÖZGÜR
    Fikret GÜRGEN
    Biology Direct, 16
  • [9] A novel gene selection method for gene expression data for the task of cancer type classification
    Simsek, N. Ozlem Ozcan
    Ozgur, Arzucan
    Gurgen, Fikret
    BIOLOGY DIRECT, 2021, 16 (01)
  • [10] New ensemble machine learning method for classification and prediction on gene expression data
    Wang, Ching Wei
    2006 28TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-15, 2006, : 60 - 63