Missing value estimation for microarray data based on fuzzy C-means clustering

被引:0
|
作者
Luo, JiaWei [1 ]
Yang, Tao [1 ]
Wang, Yan [1 ]
机构
[1] Hunan Univ, Sch Comp & Commun, Changsha 410082, Peoples R China
来源
Eighth International Conference on High-Performance Computing in Asia-Pacific Region, Proceedings | 2005年
关键词
microarray data; missing value estimation; fuzzy C-means; validity function;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Microarray experiments can generate data sets with multiple missing expression values, normally due to various experimental problems. Unfortunately, many algorithms for gene expression analysis require a complete matrix of gene array values as input. Effective missing value estimation methods are needed, therefore, to minimize the effect of incomplete data sets on analysis, and to increase the range of data sets to which these algorithms can be applied In this paper, a new imputation method (FCMimpute) based on the fuzzy C-means clustering algorithm is proposed to estimate missing values in microarray data, which utilizes information in the cluster structures. This imputes the missing value by the attribute over all cluster centers obtained through fuzzy C-means clustering algorithm applicable to incomplete data. We compare the estimation accuracy of our method with the widely used KNNimpute and another SKNNimpute method on various microarray data sets with different percentage of missing entries. In our experiments, the proposed FCMimpute method shows better performance than other methods in terms of Root Means Square error.
引用
收藏
页码:611 / 616
页数:6
相关论文
共 50 条
  • [31] Online Classifiers Based on Fuzzy C-means Clustering
    Jedrzejowicz, Joanna
    Jedrzejowicz, Piotr
    COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, 2013, 8083 : 427 - 436
  • [32] Construction of EBRB classifier for imbalanced data based on Fuzzy C-Means clustering
    Fu, Yang-Geng
    Ye, Ji-Feng
    Yin, Ze-Feng
    Chen, Long-Jiang
    Wang, Ying-Ming
    Liu, Geng-Geng
    KNOWLEDGE-BASED SYSTEMS, 2021, 234
  • [33] Granular Fuzzy Possibilistic C-Means Clustering approach to DNA microarray problem
    Hung Quoc Truong
    Long Thanh Ngo
    Pedrycz, Witold
    KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 53 - 65
  • [34] Educational data mining for students' performance based on fuzzy C-means clustering
    Li, Yu
    Gou, Jin
    Fan, Zongwen
    JOURNAL OF ENGINEERING-JOE, 2019, 2019 (11): : 8245 - 8250
  • [35] Agreement-based fuzzy C-means for clustering data with blocks of features
    Izakian, Hesam
    Pedrycz, Witold
    NEUROCOMPUTING, 2014, 127 : 266 - 280
  • [36] Microarray Time-Series Data Clustering Using Rough-Fuzzy C-Means Algorithm
    Maji, Pradipta
    Paul, Sushmita
    2011 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM 2011), 2011, : 269 - 272
  • [37] Ant Colony Based Fuzzy C-Means Clustering for Very Large Data
    Mullick, Dhruv
    Garg, Ayush
    Bajaj, Arpit
    Garg, Ayush
    Aggarwal, Swati
    ADVANCES IN FUZZY LOGIC AND TECHNOLOGY 2017, VOL 2, 2018, 642 : 578 - 591
  • [38] DATA CLUSTERING BASED ON FUZZY C-MEANS AND CHAOTIC WHALE OPTIMIZATION ALGORITHMS
    Arslan, Hatice
    Toz, Metin
    SIGMA JOURNAL OF ENGINEERING AND NATURAL SCIENCES-SIGMA MUHENDISLIK VE FEN BILIMLERI DERGISI, 2019, 37 (04): : 1103 - 1124
  • [39] Mixed fuzzy C-means clustering
    Demirhan, Haydar
    INFORMATION SCIENCES, 2025, 690
  • [40] Generalized Ordered Intuitionistic Fuzzy C-Means Clustering Algorithm Based on PROMETHEE and Intuitionistic Fuzzy C-Means
    Bashir, Muhammad Adnan
    Rashid, Tabasam
    Bashir, Muhammad Salman
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2023, 2023