Dealing with Missing Values in Microarray Data

被引:0
|
作者
Mohammadi, Azadeh [1 ]
Saraee, Mohammad Hossein [1 ]
机构
[1] Isfahan Univ Technol, Dept Elect & Comp Engn, Esfahan, Iran
关键词
gene expression; microarray; missing values; fuzzy clustering; gene ontoloy;
D O I
10.1109/ICET.2008.4777511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gene expression profiling plays an important role in a broad range of areas in biology. The raw gene expression data, may contain missing values. It is an important preprocessing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis. Numerous methods have been developed to deal with missing values. In this paper, a new and robust method based on fuzzy clustering and gene ontology is proposed to estimate missing values in microarray data. In the proposed method, missing values are imputed with values generated from cluster centers. To determine the similar genes in clustering process, we have utilized the biological knowledge obtained from gene ontology as well as gene expression values. We have applied the proposed method on yeast cell cycle data and yeast environmental stress data, with different percentage of missing entries. We compared the estimation accuracy of our method with some other methods. The experimental results indicate that the proposed method outperforms other methods in terms of accuracy.
引用
收藏
页码:258 / 263
页数:6
相关论文
共 50 条
  • [21] DEALING WITH MISSING VALUES IN NUCLEAR SAFEGUARDS EVALUATION
    Kabak, Ozgur
    Ruan, Da
    [J]. INTELLIGENT DECISION MAKING SYSTEMS, VOL. 2, 2010, : 145 - +
  • [22] LSimpute: accurate estimation of missing values in microarray data with least squares methods
    Bo, TH
    Dysvik, J
    Jonassen, I
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (03) : e34
  • [23] Don't Do Imputation: Dealing with Informative Missing Values in EHR Data Analysis
    Li, Jia
    Wang, Mengdie
    Steinbach, Michael S.
    Kumar, Vipin
    Simon, Gyorgy J.
    [J]. 2018 9TH IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (ICBK), 2018, : 415 - 422
  • [24] A Novel Interpolation Based Missing Value Estimation Method to Predict Missing Values in Microarray Gene Expression Data
    Bose, Shilpi
    Das, Chandra
    Dutta, Sourav
    Chattopadhyay, Samiran
    [J]. PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, DEVICES AND INTELLIGENT SYSTEMS (CODLS), 2012, : 318 - 321
  • [25] Dealing with missing software project data
    Cartwright, MH
    Shepperd, MJ
    Song, Q
    [J]. NINTH INTERNATIONAL SOFTWARE METRICS SYMPOSIUM, PROCEEDINGS, 2003, : 154 - 165
  • [26] Dealing with missing data: Part II
    Walczak, B
    Massart, DL
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2001, 58 (01) : 29 - 42
  • [27] Dealing with missing values and outliers in principal component analysis
    Stanimirova, I.
    Daszykowski, M.
    Walczak, B.
    [J]. TALANTA, 2007, 72 (01) : 172 - 178
  • [28] Dealing with Missing Values for Effective Prediction of NPC Recurrence
    Kumdee, Orrawan
    Ritthipravat, Panrasee
    Bhongmakapat, Thongchai
    Cheewaruangroj, Wichit
    [J]. 2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1231 - +
  • [29] Multiple imputation: dealing with missing data
    de Goeij, Moniek C. M.
    van Diepen, Merel
    Jager, Kitty J.
    Tripepi, Giovanni
    Zoccali, Carmine
    Dekker, Friedo W.
    [J]. NEPHROLOGY DIALYSIS TRANSPLANTATION, 2013, 28 (10) : 2415 - 2420
  • [30] Dealing with missing data: Part I
    Walczak, B.
    Massart, D.L.
    [J]. Chemometrics and Intelligent Laboratory Systems, 2001, 58 (01) : 15 - 27