Benchmarking Classification Models for Cancer Prediction from Gene Expression Data: A Novel Approach and New Findings

被引:0
|
作者
Ramani, R. Geetha [1 ]
Jacob, Shomona Gracia [1 ]
机构
[1] Anna Univ, Madras 600025, Tamil Nadu, India
来源
STUDIES IN INFORMATICS AND CONTROL | 2013年 / 22卷 / 02期
关键词
Cancer prediction; Gene Expression; Feature Relevance; Multi-class classification; MICROARRAY DATA; SELECTION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gene Selection from gene expression data for Cancer prediction has been an area of intensive research, aiming at identifying the minimal and optimal set of candidate genes that could generate accurate predictive performance. The two major problems encountered in this process are the high dimensionality of data with comparatively few instances and the need to categorize records under multiple classes. In this paper we propose a novel approach called Rank-Weight Feature Selection that utilizes the filtering capacity of more than one feature selection algorithm to detect the minimal set of predictive genes that generate higher predictor performance in categorizing and predicting diverse oncogenic gene expression data. The filtered features (genes) are weighted based on the number of feature relevance algorithms reporting them to be significant. The ranked genes are then used to validate the proposed method by utilizing ten classifiers over five diverse gene expression datasets. The results proved that the proposed approach generated higher predictive performance with fewer features than previously reported results with the most relevant and minimal set of genes and commend classifiers based on their accuracy and reliability in predicting cancer data.
引用
收藏
页码:133 / 142
页数:10
相关论文
共 50 条
  • [21] A topological approach for cancer subtyping from gene expression data
    Rafique, Omar
    Mir, A. H.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 102
  • [22] Lung cancer prediction from microarray data by gene expression programming
    Azzawi, Hasseeb
    Hou, Jingyu
    Xiang, Yong
    Alanni, Russul
    IET SYSTEMS BIOLOGY, 2016, 10 (05) : 168 - 178
  • [23] Prediction of homologous recombination deficiency from cancer gene expression data
    Kang, Jun
    Lee, Jieun
    Lee, Ahwon
    Lee, Youn Soo
    JOURNAL OF INTERNATIONAL MEDICAL RESEARCH, 2022, 50 (11)
  • [24] A novel ensemble approach for cancer data classification
    Zhao, Yaou
    Chen, Yuehui
    Zhang, Xueqin
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 2, PROCEEDINGS, 2007, 4492 : 1211 - +
  • [25] Benchmarking the translational potential of spatial gene expression prediction from histology
    Wang, Chuhan
    Chan, Adam S.
    Fu, Xiaohang
    Ghazanfar, Shila
    Kim, Jinman
    Patrick, Ellis
    Yang, Jean Y. H.
    NATURE COMMUNICATIONS, 2025, 16 (01)
  • [26] An efficient approach for classification of gene expression microarray data
    Sreepada, Rama Syamala
    Vipsita, Swati
    Mohapatra, Puspanjali
    2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 344 - 348
  • [27] A novel approach to generate robust predictive developmental toxicity classification models from highly unbalanced dataset and benchmarking with the reported models
    Patel, Nikunjkumar K.
    Gunturi, Sitarama B.
    Narayanan, Ramamurthi
    DRUG METABOLISM REVIEWS, 2011, 43 : 131 - 132
  • [28] Selecting a classification function for class prediction with gene expression data
    Jong, Victor L.
    Novianti, Putri W.
    Roes, Kit C. B.
    Eijkemans, Marinus J. C.
    BIOINFORMATICS, 2016, 32 (12) : 1814 - 1822
  • [29] Investigating a Breast Cancer Gene Expression Data Using a Novel Clustering Approach
    Naeni, L. M.
    Salehipour, A.
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEM), 2019, : 1038 - 1042
  • [30] Mining gene expression data using a novel approach based on hidden Markov models
    Ji, XL
    Li-Ling, J
    Sun, Z
    FEBS LETTERS, 2003, 542 (1-3) : 125 - 131