Gene Selection for Cancer Clustering Analysis Based on Expression Data

被引:0
|
作者
Xu, Taosheng [1 ,2 ]
Su, Ning [1 ,2 ]
Wang, Rujing [2 ]
Song, Liangtu [2 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei, Peoples R China
[2] Chinese Acad Sci, Inst Intelligent Machines, Hefei, Peoples R China
关键词
gene selection; clustering analysis; proportional hazard model; data mining; CLASS DISCOVERY; BREAST;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Although the genomics data are accumulated in an exponential growth, the molecular complexity of cancer is still hard to understand. The most remarkable characteristics of the genomic data are severely high-dimensional features with a small number of samples, such as gene expression data. The traditional data mining method has a limited ability to process these asymmetry datasets. In order to select the key genes from the high-dimensional gene expression data for cancer clustering analysis, gene selection based on proportional hazard model is applied in this paper. The proportional hazard model is a statistical approach used for survival risk analysis. The significant genes are selected for clustering analysis based on gene expression dataset. We demonstrate the effectiveness of this method on breast cancer and lung cancer. The experiments show a better cancer clustering result of separating of samples into distinct subclasses.
引用
收藏
页码:516 / 519
页数:4
相关论文
共 50 条
  • [1] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    [J]. AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [2] PSO Based Feature Selection for Clustering Gene Expression Data
    Deepthi, P. S.
    Thampi, Sabu M.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [3] A kernel-based clustering method for gene selection with gene expression data
    Chen, Huihui
    Zhang, Yusen
    Gutman, Ivan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 : 12 - 20
  • [4] Feature selection and gene clustering from gene expression data
    Mitra, P
    Majumder, DD
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 343 - 346
  • [5] Clustering analysis for gene expression data
    Chen, YD
    Ermolaeva, O
    Bittner, M
    Meltzer, P
    Trent, J
    Dougherty, ER
    Batman, S
    [J]. ADVANCES IN FLUORESCENCE SENSING TECHNOLOGY IV, PROCEEDINGS OF, 1999, 3602 : 422 - 428
  • [6] On the selection of appropriate distances for gene expression data clustering
    Pablo A Jaskowiak
    Ricardo JGB Campello
    Ivan G Costa
    [J]. BMC Bioinformatics, 15
  • [7] Informative Feature Clustering and Selection for Gene Expression Data
    Yang, Yuqi
    Yin, Pengshuai
    Luo, Zhihang
    Gu, Wenwen
    Chen, Renjie
    Wu, Qingyao
    [J]. IEEE ACCESS, 2019, 7 : 169174 - 169184
  • [8] On the selection of appropriate distances for gene expression data clustering
    Jaskowiak, Pablo A.
    Campello, Ricardo J. G. B.
    Costa, Ivan G.
    [J]. BMC BIOINFORMATICS, 2014, 15
  • [9] Initial points selection for clustering gene expression data: A spatial contiguity analysis-based approach
    Yi, Hui
    Bo, Cuimei
    Song, Xiaofeng
    Yuan, Yuhao
    [J]. BIO-MEDICAL MATERIALS AND ENGINEERING, 2014, 24 (06) : 3709 - 3717
  • [10] Machine Learning Clustering for Cancer Analysis Employing Gene Expression Data
    Ospino, Camilo Andres Perez
    Rivera, Jorman Arbey Castro
    Orjuela-Canon, Alvaro D.
    [J]. 2023 IEEE COLOMBIAN CONFERENCE ON APPLICATIONS OF COMPUTATIONAL INTELLIGENCE, COLCACI, 2023,