Verification of Improving a Clustering Algorithm for Microarray Data with Missing Values

被引:0
|
作者
Kim, SuYoung [1 ]
机构
[1] Acad Korean Studies, Ctr Korean Studies Mat, Seongnam Si, Gyeonggi Do, South Korea
关键词
Microarray; gene expression; clustering; missing value;
D O I
10.5351/KJAS.2011.24.2.315
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Gene expression microarray data often include multiple missing values. Most gene expression analysis (including gene clustering analysis); however, require a complete data matric as an input. In ordinary clustering methods, just a single missing value makes one abandon the whole data of a gene even if the rest of data for that gene was intact. The quality of analysis may decrease seriously as the missing rate is increased. In the opposite aspect, the imputation of missing value may result in an artifact that reduces the reliability of the analysis. To clarify this contradiction in microarray clustering analysis, this paper compared the accuracy of clustering with and without imputation over several microarray data having different missing rates. This paper also tested the clustering efficiency of several imputation methods including our proposed algorithm. The results showed it is worthwhile to check the clustering result in this alternative way without any imputed data for the imperfect microarray data.
引用
收藏
页码:315 / 321
页数:7
相关论文
共 50 条
  • [41] Improving missing value imputation of microarray data by using spot quality weights
    Peter Johansson
    Jari Häkkinen
    [J]. BMC Bioinformatics, 7
  • [42] EXORCISE - AN ALGORITHM FOR DETECTION OF SPURIOUS VALUES AND PREDICTION OF MISSING DATA
    ZHANG, TS
    SCHULTZ, A
    [J]. COMPUTERS & GEOSCIENCES, 1990, 16 (08) : 1027 - 1065
  • [43] Clustering with missing values: No imputation required
    Wagstaff, K
    [J]. CLASSIFICATION, CLUSTERING, AND DATA MINING APPLICATIONS, 2004, : 649 - 658
  • [44] Efficient technique of microarray missing data imputation using clustering and weighted nearest neighbour
    Dubey, Aditya
    Rasool, Akhtar
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [45] Efficient technique of microarray missing data imputation using clustering and weighted nearest neighbour
    Aditya Dubey
    Akhtar Rasool
    [J]. Scientific Reports, 11
  • [46] A Novel Interpolation Based Missing Value Estimation Method to Predict Missing Values in Microarray Gene Expression Data
    Bose, Shilpi
    Das, Chandra
    Dutta, Sourav
    Chattopadhyay, Samiran
    [J]. PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, DEVICES AND INTELLIGENT SYSTEMS (CODLS), 2012, : 318 - 321
  • [47] Missing value estimation for microarray data based on fuzzy C-means clustering
    Luo, JiaWei
    Yang, Tao
    Wang, Yan
    [J]. Eighth International Conference on High-Performance Computing in Asia-Pacific Region, Proceedings, 2005, : 611 - 616
  • [48] Clustering-Guided Particle Swarm Feature Selection Algorithm for High-Dimensional Imbalanced Data With Missing Values
    Zhang, Yong
    Wang, Yan-Hu
    Gong, Dun-Wei
    Sun, Xiao-Yan
    [J]. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2022, 26 (04) : 616 - 630
  • [49] MISSING VALUES IN DATA
    RACKLEY, K
    [J]. SIAM REVIEW, 1974, 16 (01) : 136 - 136
  • [50] Hybrid metaheuristic algorithm for improving the efficiency of data clustering
    Mageshkumar, C.
    Karthik, S.
    Arunachalam, V. P.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 1): : 435 - 442