K-walks: clustering gene-expression data using a K-means clustering algorithm optimised by random walks

被引：3

作者：

Yao, Min ^{[1
]}

Wu, Qinghua ^{[2
]}

Li, Juan ^{[3
]}

Huang, Tinghua ^{[1
]}

机构：

[1] Yangtze Univ, Coll Anim Sci, Jingzhou 434025, Hubei, Peoples R China

[2] Yangtze Univ, Coll Life Sci, Jingzhou 434025, Hubei, Peoples R China

[3] Xiangtan Univ, Coll Chem, Xiangtan 411105, Hunan, Peoples R China

来源：

INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS | 2016年 / 16卷 / 02期

基金：

中国国家自然科学基金;

关键词：

gene expression; K-means; random walks; DISCOVERY; TISSUES;

D O I：

10.1504/IJDMB.2016.080039

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Gene-expression data obtained from the biological experiments always have thousands of dimensions, which can be very confusing and perplexing to biologists when viewed as a whole. Clustering analysis is an explorative data-mining technique for statistical data analysis that is widely used in gene-expression data analysis. Practical approaches employed for solving the clustering problem use iterative procedures such as K-means, which typically converge to one of many local minima. Here, we propose a simulated annealing approximation algorithm that is optimised using random walks to solve the K-means clustering problem. The algorithm is verified with synthetic and real-world data sets and compared with other well-known K-means variants. The new algorithm is less sensitive to initial cluster centres, and the primary strength of our algorithm is its ability to produce high-quality clustering results for thousands of high-dimensional data. However, the algorithm is computationally intensive.

引用

页码：121 / 140

页数：20

共 50 条

[1] A genetic K-means clustering algorithm applied to gene expression data
Wu, FX
Zhang, WJ
Kusalik, AJ
ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 520 - 526
[2] Soil data clustering by using K-means and fuzzy K-means algorithm
Hot, Elma
Popovic-Bugarin, Vesna
2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 890 - 893
[3] IMPROVEMENT IN K-MEANS CLUSTERING ALGORITHM FOR DATA CLUSTERING
Rajeswari, K.
Acharya, Omkar
Sharma, Mayur
Kopnar, Mahesh
Karandikar, Kiran
1ST INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION ICCUBEA 2015, 2015, : 367 - 369
[4] Research on k-means Clustering Algorithm An Improved k-means Clustering Algorithm
Shi Na
Liu Xumin
Guan Yong
2010 THIRD INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY AND SECURITY INFORMATICS (IITSI 2010), 2010, : 63 - 67
[5] Clustering gene expression data using self-organizing maps and k-means clustering
Yano, N
Kotani, A
SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 3211 - 3215
[6] Clustering Data in Power Management System Using k-Means Clustering Algorithm
Aryani, Ressy
Nasrun, Muhammad
Setianingsih, Casi
Murti, Muhammad Ary
2019 IEEE ASIA PACIFIC CONFERENCE ON WIRELESS AND MOBILE (APWIMOB), 2019, : 164 - 170
[7] On K-means Data Clustering Algorithm with Genetic Algorithm
Kapil, Shruti
Chawla, Meenu
Ansari, Mohd Dilshad
2016 FOURTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2016, : 202 - 206
[8] Clustering of Image Data Using K-Means and Fuzzy K-Means
Rahmani, Md. Khalid Imam
Pal, Naina
Arora, Kamiya
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (07) : 160 - 163
[9] Using K-Means Clustering Algorithm for Handling Data Precision
Suganthi, P.
Kala, K.
Balasubramanian, C.
2016 INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGIES AND INTELLIGENT DATA ENGINEERING (ICCTIDE'16), 2016,
[10] NEW ALGORITHM FOR CLUSTERING DISTRIBUTED DATA USING K-MEANS
Khedr, Ahmed M.
Bhatnagar, Raj K.
COMPUTING AND INFORMATICS, 2014, 33 (04) : 943 - 964

← 1 2 3 4 5 →