Dealing with Missing Values in Microarray Data

被引:0
|
作者
Mohammadi, Azadeh [1 ]
Saraee, Mohammad Hossein [1 ]
机构
[1] Isfahan Univ Technol, Dept Elect & Comp Engn, Esfahan, Iran
关键词
gene expression; microarray; missing values; fuzzy clustering; gene ontoloy;
D O I
10.1109/ICET.2008.4777511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gene expression profiling plays an important role in a broad range of areas in biology. The raw gene expression data, may contain missing values. It is an important preprocessing step to accurately estimate missing values in microarray data, because complete datasets are required in numerous expression profile analysis. Numerous methods have been developed to deal with missing values. In this paper, a new and robust method based on fuzzy clustering and gene ontology is proposed to estimate missing values in microarray data. In the proposed method, missing values are imputed with values generated from cluster centers. To determine the similar genes in clustering process, we have utilized the biological knowledge obtained from gene ontology as well as gene expression values. We have applied the proposed method on yeast cell cycle data and yeast environmental stress data, with different percentage of missing entries. We compared the estimation accuracy of our method with some other methods. The experimental results indicate that the proposed method outperforms other methods in terms of accuracy.
引用
收藏
页码:258 / 263
页数:6
相关论文
共 50 条
  • [41] Introduction to multiple imputation for dealing with missing data
    Lee, Katherine J.
    Simpson, Julie A.
    [J]. RESPIROLOGY, 2014, 19 (02) : 162 - 167
  • [42] Dealing with missing values in a probabilistic decision tree during classification
    Hawarah, Lamis
    Simonet, Ana
    Simonet, Michel
    [J]. ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 325 - +
  • [43] Bayesian methods for dealing with missing data problems
    Zhihua Ma
    Guanghui Chen
    [J]. Journal of the Korean Statistical Society, 2018, 47 : 297 - 313
  • [44] Dealing With the Missing Data Challenge in Clinical Trials
    Permutt, Thomas
    Pinheiro, Jose
    [J]. DRUG INFORMATION JOURNAL, 2009, 43 (04): : 403 - 408
  • [45] Bayesian methods for dealing with missing data problems
    Ma, Zhihua
    Chen, Guanghui
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2018, 47 (03) : 297 - 313
  • [46] A GMM Approach for Dealing with Missing Data on Regressors
    Abrevaya, Jason
    Donald, Stephen G.
    [J]. REVIEW OF ECONOMICS AND STATISTICS, 2017, 99 (04) : 657 - 662
  • [47] MISSING VALUES IN MULTIVARIATE DATA
    KUZMA, JW
    [J]. BIOMETRICS, 1965, 21 (01) : 254 - &
  • [48] Dealing with missing phase and missing data in phylogeny-based analysis
    Claire Bardel
    Pascal Croiseau
    Emmanuelle Génin
    [J]. BMC Proceedings, 1 (Suppl 1)
  • [49] Microarray Missing Values Imputation Methods: Critical Analysis Review
    Hourani, Mou'ath
    El Emary, Ibrahiem M. M.
    [J]. COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2009, 6 (02) : 165 - 190
  • [50] Effectiveness of Different Partition Based Clustering Algorithms for Estimation of Missing Values in Microarray Gene Expression Data
    Bose, Shilpi
    Das, Chandra
    Chakraborty, Abirlal
    Chattopadhyay, Samiran
    [J]. ADVANCES IN COMPUTING AND INFORMATION TECHNOLOGY, VOL 2, 2013, 177 : 37 - +