K-walks: clustering gene-expression data using a K-means clustering algorithm optimised by random walks

被引:3
|
作者
Yao, Min [1 ]
Wu, Qinghua [2 ]
Li, Juan [3 ]
Huang, Tinghua [1 ]
机构
[1] Yangtze Univ, Coll Anim Sci, Jingzhou 434025, Hubei, Peoples R China
[2] Yangtze Univ, Coll Life Sci, Jingzhou 434025, Hubei, Peoples R China
[3] Xiangtan Univ, Coll Chem, Xiangtan 411105, Hunan, Peoples R China
基金
中国国家自然科学基金;
关键词
gene expression; K-means; random walks; DISCOVERY; TISSUES;
D O I
10.1504/IJDMB.2016.080039
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene-expression data obtained from the biological experiments always have thousands of dimensions, which can be very confusing and perplexing to biologists when viewed as a whole. Clustering analysis is an explorative data-mining technique for statistical data analysis that is widely used in gene-expression data analysis. Practical approaches employed for solving the clustering problem use iterative procedures such as K-means, which typically converge to one of many local minima. Here, we propose a simulated annealing approximation algorithm that is optimised using random walks to solve the K-means clustering problem. The algorithm is verified with synthetic and real-world data sets and compared with other well-known K-means variants. The new algorithm is less sensitive to initial cluster centres, and the primary strength of our algorithm is its ability to produce high-quality clustering results for thousands of high-dimensional data. However, the algorithm is computationally intensive.
引用
收藏
页码:121 / 140
页数:20
相关论文
共 50 条
  • [1] A genetic K-means clustering algorithm applied to gene expression data
    Wu, FX
    Zhang, WJ
    Kusalik, AJ
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 520 - 526
  • [2] Soil data clustering by using K-means and fuzzy K-means algorithm
    Hot, Elma
    Popovic-Bugarin, Vesna
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
  • [3] IMPROVEMENT IN K-MEANS CLUSTERING ALGORITHM FOR DATA CLUSTERING
    Rajeswari, K.
    Acharya, Omkar
    Sharma, Mayur
    Kopnar, Mahesh
    Karandikar, Kiran
    1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 367 - 369
  • [4] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
    Shi Na
    Liu Xumin
    Guan Yong
    2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
  • [5] Clustering gene expression data using self-organizing maps and k-means clustering
    Yano, N
    Kotani, A
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 3211 - 3215
  • [6] Clustering Data in Power Management System Using k-Means Clustering Algorithm
    Aryani, Ressy
    Nasrun, Muhammad
    Setianingsih, Casi
    Murti, Muhammad Ary
    2019 IEEE ASIA PACIFIC CONFERENCE ON WIRELESS AND MOBILE (APWIMOB), 2019, : 164 - 170
  • [7] On K-means Data Clustering Algorithm with Genetic Algorithm
    Kapil, Shruti
    Chawla, Meenu
    Ansari, Mohd Dilshad
    2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
  • [8] Clustering of Image Data Using K-Means and Fuzzy K-Means
    Rahmani, Md. Khalid Imam
    Pal, Naina
    Arora, Kamiya
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
  • [9] Using K-Means Clustering Algorithm for Handling Data Precision
    Suganthi, P.
    Kala, K.
    Balasubramanian, C.
    2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
  • [10] NEW ALGORITHM FOR CLUSTERING DISTRIBUTED DATA USING K-MEANS
    Khedr, Ahmed M.
    Bhatnagar, Raj K.
    COMPUTING AND INFORMATICS, 2014, 33 (04) : 943 - 964