Initial points selection for clustering gene expression data: A spatial contiguity analysis-based approach

被引:1
|
作者
Yi, Hui [1 ,2 ]
Bo, Cuimei [1 ]
Song, Xiaofeng [2 ]
Yuan, Yuhao [1 ]
机构
[1] Nanjing Univ Technol, Coll Automat & Elect Engn, Nanjing 211816, Jiangsu, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Dept Biomed Engn, Nanjing 210016, Peoples R China
基金
美国国家科学基金会; 国家教育部博士点专项基金资助;
关键词
Gene expression data; k-means; initial points; spatial contiguity analysis; ALGORITHM;
D O I
10.3233/BME-141199
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Clustering is considered one of the most powerful tools for analyzing gene expression data. Although clustering has been extensively studied, a problem remains significant: iterative techniques like k-means clustering are especially sensitive to initial starting conditions. An unreasonable selection of initial points leads to problems including local minima and massive computation. In this paper, a spatial contiguity analysis-based approach is proposed, aiming to solve this problem. It employs principal component analysis (PCA) to identify data points that are likely extracted from different clusters as initial points. This helps to avoid local minima, and accelerates the computation. The effectiveness of the proposed approach was validated on several benchmark datasets.
引用
收藏
页码:3709 / 3717
页数:9
相关论文
共 50 条
  • [1] Reducing the Subjectivity of Gene Expression Data Clustering Based on Spatial Contiguity Analysis
    Yi, Hui
    Song, Xiaofeng
    Jiang, Bin
    Liu, Yufang
    [J]. DATABASE THEORY AND APPLICATION, BIO-SCIENCE AND BIO-TECHNOLOGY, 2011, 258 : 118 - 124
  • [2] Spatial clustering based gene selection for gene expression analysis in microarray data classification
    Dhas, P. Edwin
    Lalitha, S.
    Govindaraj, Annalakshmi
    Jyoshna, B.
    [J]. AUTOMATIKA, 2024, 65 (01) : 152 - 158
  • [3] Gene Selection for Cancer Clustering Analysis Based on Expression Data
    Xu, Taosheng
    Su, Ning
    Wang, Rujing
    Song, Liangtu
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 516 - 519
  • [4] A data envelopment analysis-based clustering approach under dynamic situations
    Kim, Nam Hyok
    He, Feng
    Zhang, Hongjie
    Hong, Kwon Ryong
    Ri, Kwang-Chol
    [J]. EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2023, 311 (01) : 251 - 262
  • [5] PSO Based Feature Selection for Clustering Gene Expression Data
    Deepthi, P. S.
    Thampi, Sabu M.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [6] An Agent-Based Clustering Approach for Gene Selection in Gene Expression Microarray
    Ramos, Juan
    Castellanos-Garzon, Jose A.
    Gonzalez-Briones, Alfonso
    de Paz, Juan F.
    Corchado, Juan M.
    [J]. INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2017, 9 (01) : 1 - 13
  • [7] An Agent-Based Clustering Approach for Gene Selection in Gene Expression Microarray
    Juan Ramos
    José A. Castellanos-Garzón
    Alfonso González-Briones
    Juan F. de Paz
    Juan M. Corchado
    [J]. Interdisciplinary Sciences: Computational Life Sciences, 2017, 9 : 1 - 13
  • [8] A kernel-based clustering method for gene selection with gene expression data
    Chen, Huihui
    Zhang, Yusen
    Gutman, Ivan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 62 : 12 - 20
  • [9] Robust Convex Clustering with Spectral Analysis-based Feature Selection
    Fu, Yitu
    Sun, Xiaodong
    Lan, Qing
    [J]. PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 6578 - 6583
  • [10] Min max kurtosis distance based improved initial centroid selection approach of K-means clustering for big data mining on gene expression data
    Kamlesh Kumar Pandey
    Diwakar Shukla
    [J]. Evolving Systems, 2023, 14 : 207 - 244