A cluster-directed framework for neighbour based imputation of missing value in microarray data

被引:17
|
作者
Keerin, Phimmarin [1 ]
Kurutach, Werasak [1 ]
Boongoen, Tossapon [2 ]
机构
[1] Mahanakorn Univ Technol, Fac Informat Sci & Technol, Bangkok, Thailand
[2] Navaminda Kasatriyadhiraj Royal Air Force Acad, Dept Math & Comp Sci, Bangkok, Thailand
关键词
missing value; imputation; gene expression data; clustering; regression; nearest neighbour; GENE-EXPRESSION DATA; CANCER;
D O I
10.1504/IJDMB.2016.076535
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA microarray has been the most widely used functional genomics approach in bioinformatics. However, microarray data suffer from frequent missing values due to various experimental and data handling reasons. Leaving this unsolved may degrade the reliability of any consequent downstream analysis. As such, missing value imputation has been recognised as an important pre-processing step, which can yield the quality of data and its interpretation. Several techniques found in the literature have successfully exploited the characteristics and relations among a set of genes closest to the one under examination. However, the selection of so-called nearest neighbours is based simply on proximity between gene pairs, without taking the structural or grouping information into account. In response, this paper proposes a novel cluster-directed framework (CFNI: Cluster-directed Framework for Neighbour-based Imputation), in which data clustering is uniquely used to guide the identification of nearest neighbours. This allows a more accurate imputed value to be derived. Not only it performs better than several benchmark methods on published microarray data sets; it is also generalised such that any neighbour-based imputation technique can be coupled with the proposed model. This has been successfully demonstrated with both single pass and iterative models.
引用
收藏
页码:165 / 193
页数:29
相关论文
共 50 条
  • [31] Incorporating Nonlinear Relationships in Microarray Missing Value Imputation
    Yu, Tianwei
    Peng, Hesen
    Sun, Wei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 723 - 731
  • [32] The influence of missing value imputation on detection of differentially expressed genes from microarray data
    Scheel, I
    Aldrin, M
    Glad, IK
    Sorum, R
    Lyng, H
    Frigessi, A
    [J]. BIOINFORMATICS, 2005, 21 (23) : 4272 - 4279
  • [33] Missing value imputation for microarray gene expression data using histone acetylation information
    Xiang, Qian
    Dai, Xianhua
    Deng, Yangyang
    He, Caisheng
    Wang, Jiang
    Feng, Jihua
    Dai, Zhiming
    [J]. BMC BIOINFORMATICS, 2008, 9 (1)
  • [34] Missing value imputation for microarray gene expression data using histone acetylation information
    Qian Xiang
    Xianhua Dai
    Yangyang Deng
    Caisheng He
    Jiang Wang
    Jihua Feng
    Zhiming Dai
    [J]. BMC Bioinformatics, 9
  • [35] An accurate and robust missing value estimation for Microarray data: least absolute deviation imputation
    Cao, Yi
    Poh, Kim Leng
    [J]. ICMLA 2006: 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2006, : 157 - +
  • [36] Robust imputation method for missing values in microarray data
    Yoon, Dankyu
    Lee, Eun-Kyung
    Park, Taesung
    [J]. BMC BIOINFORMATICS, 2007, 8 (Suppl 2)
  • [37] Robust imputation method for missing values in microarray data
    Dankyu Yoon
    Eun-Kyung Lee
    Taesung Park
    [J]. BMC Bioinformatics, 8
  • [38] Imputation Algorithm Based on Copula for Missing Value in Timeseries Data
    Afrianti, Y. S.
    Indratno, S. W.
    Pasaribu, U. S.
    [J]. 2014 2ND INTERNATIONAL CONFERENCE ON TECHNOLOGY, INFORMATICS, MANAGEMENT, ENGINEERING, AND ENVIRONMENT (TIME-E 2014), 2014, : 252 - 257
  • [39] A Novel Approach for Missing Value Imputation and Classification of Microarray Dataset
    Senapti, Rajashree
    Shaw, Kailash
    Mishra, Sashikala
    Mishra, Debahuti
    [J]. INTERNATIONAL CONFERENCE ON MODELLING OPTIMIZATION AND COMPUTING, 2012, 38 : 1067 - 1071
  • [40] A Quasi-linear Approach for Microarray Missing Value Imputation
    Cheng, Yu
    Wang, Lan
    Hu, Jinglu
    [J]. NEURAL INFORMATION PROCESSING, PT I, 2011, 7062 : 233 - +