Variance-based Feature Selection for Classification of Cancer Subtypes Using Gene Expression Data

被引:0
|
作者
Roberts, Aedan G. K. [1 ]
Catchpoole, Daniel R. [2 ]
Kennedy, Paul J. [1 ]
机构
[1] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Artificial Intelligence, Broadway, NSW 2007, Australia
[2] Childrens Hosp Westmead, Childrens Canc Res Unit, Tumour Bank, Westmead, NSW 2145, Australia
关键词
ACUTE LYMPHOBLASTIC-LEUKEMIA; VARIABILITY; PERFORMANCE; OMNIBUS; GEO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classification in cancer has traditionally relied on feature selection by differential expression as a first step, where genes are selected according to the strength of evidence for a consistent difference in expression level between classes. However, recent work has shown that many genes also differ in the variance of their gene expression between disease states, and in particular between cancers of different types, prognosis, or stages of development. Features selected based on increased variance in cancer or differences in variance between tumours of differing prognosis have been used to successfully predict tumour progression or prognosis within the same cancer type, and to classify cancer subtypes in cases where there is an overall increase in variance in one class over the other. Here, we apply feature selection by differential variance to the more general problem of classification of cancer subtypes. We show that classifiers using features selected by differential variance are able to distinguish between clinically relevant cancer subtypes, that these classifiers perform as well as classifiers based on features selected by differential expression, and that combining the two approaches often gives better classification results than either feature selection method alone.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Feature Selection and Classification in gene expression cancer data
    Pavithra, D.
    Lakshmanan, B.
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [2] An Integrated Feature Selection Algorithm for Cancer Classification using Gene Expression Data
    Ahmed, Saeed
    Kabir, Muhammad
    Ali, Zakir
    Arif, Muhammad
    Ali, Farman
    Yu, Dong-Jun
    [J]. COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2018, 21 (09) : 631 - 645
  • [3] Feature Selection of Gene Expression Data for Cancer Classification: A Review
    Singh, Rabindra Kumar
    Sivabalakrishnan, M.
    [J]. BIG DATA, CLOUD AND COMPUTING CHALLENGES, 2015, 50 : 52 - 57
  • [4] Gene Expression Data Clustering Using Variance-based Harmony Search Algorithm
    Kumar, Vijay
    Kumar, Dinesh
    [J]. IETE JOURNAL OF RESEARCH, 2019, 65 (05) : 641 - 652
  • [5] Gene expression data classification using genetic algorithm-based feature selection
    Sonmez, Oznur Sinem
    Dagtekin, Mustafa
    Ensari, Tolga
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2021, 29 (07) : 3165 - 3179
  • [6] Feature Selection and Classification for Gene Expression Data using Evolutionary Computation
    Banka, Haider
    Dara, Suresh
    [J]. 2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 185 - 189
  • [7] Feature selection of gene expression data for Cancer classification using double RBF-kernels
    Liu, Shenghui
    Xu, Chunrui
    Zhang, Yusen
    Liu, Jiaguo
    Yu, Bin
    Liu, Xiaoping
    Dehmer, Matthias
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [8] Feature selection of gene expression data for Cancer classification using double RBF-kernels
    Shenghui Liu
    Chunrui Xu
    Yusen Zhang
    Jiaguo Liu
    Bin Yu
    Xiaoping Liu
    Matthias Dehmer
    [J]. BMC Bioinformatics, 19
  • [9] Classification of Gene Expression Data Using Feature Selection Based on Type Combination Approach Model With Advanced Feature Selection Technology
    Siddesh, G. M.
    Gururaj, T.
    [J]. INTERNATIONAL JOURNAL OF COGNITIVE INFORMATICS AND NATURAL INTELLIGENCE, 2021, 15 (04)
  • [10] Semi-supervised SVM-based Feature Selection for Cancer Classification using Microarray Gene Expression Data
    Ang, Jun Chin
    Haron, Habibollah
    Hamed, Haza Nuzly Abdull
    [J]. CURRENT APPROACHES IN APPLIED ARTIFICIAL INTELLIGENCE, 2015, 9101 : 468 - 477