KNN-DTW Based Missing Value Imputation for Microarray Time Series Data

被引:26
|
作者
Hsu, Hui-Huang [1 ]
Yang, Andy C. [2 ]
Lu, Ming-Da [2 ]
机构
[1] Tamkang Univ, Dept Comp Sci & Informat Engn, Comp Sci & Informat Engn, Taipei, Taiwan
[2] Tamkang Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
关键词
microarray time series data; missing value imputation; dynamic time warping; k-nearest neighbor;
D O I
10.4304/jcp.6.3.418-425
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray technology provides an opportunity for scientists to analyze thousands of gene expression profiles simultaneously. However, microarray gene expression data often contain multiple missing expression values due to many reasons. Effective methods for missing value imputation in gene expression data are needed since many algorithms for gene analysis require a complete matrix of gene array values. Several algorithms are proposed to handle this problem, but they have various limitations. In this paper, we develop a novel method to impute missing values in microarray time-series data combining k-nearest neighbor (KNN) and dynamic time warping (DTW). We also analyze and implement several variants of DTW to further improve the efficiency and accuracy of our method. Experimental results show that our method is more accurate compared with existing missing value imputation methods on real microarray time series datasets.
引用
收藏
页码:418 / 425
页数:8
相关论文
共 50 条
  • [41] Incorporating Nonlinear Relationships in Microarray Missing Value Imputation
    Yu, Tianwei
    Peng, Hesen
    Sun, Wei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 723 - 731
  • [42] A hybrid imputation approach for microarray missing value estimation
    Li, Huihui
    Zhao, Changbo
    Shao, Fengfeng
    Li, Guo-Zheng
    Wang, Xiao
    [J]. BMC GENOMICS, 2015, 16
  • [43] Missing Value Imputation for Traffic-Related Time Series Data Based on a Multi-View Learning Method
    Li, Linchao
    Zhang, Jian
    Wang, Yonggang
    Ran, Bin
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (08) : 2933 - 2943
  • [44] MTSSP: Missing Value Imputation in Multivariate Time Series for Survival Prediction
    Li, Bo
    Shi, Yuliang
    Cheng, Lin
    Yan, Zhongmin
    Wang, Xinjun
    Li, Hui
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [45] Imputation Methods in Time Series with a Trend and a Consecutive Missing Value Pattern
    Wongoutong, Chantha
    [J]. THAILAND STATISTICIAN, 2021, 19 (04): : 866 - 879
  • [46] Missing value imputation in time series using Singular Spectrum Analysis
    Mahmoudvand, Rahim
    Rodrigues, Paulo Canas
    [J]. INTERNATIONAL JOURNAL OF ENERGY AND STATISTICS, 2016, 4 (01)
  • [47] On Combining Websensors and DTW Distance for kNN Time Series Forecasting
    Marcacini, Ricardo M.
    Carnevali, Julio C.
    Domingos, Joao
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2521 - 2525
  • [48] The influence of missing value imputation on detection of differentially expressed genes from microarray data
    Scheel, I
    Aldrin, M
    Glad, IK
    Sorum, R
    Lyng, H
    Frigessi, A
    [J]. BIOINFORMATICS, 2005, 21 (23) : 4272 - 4279
  • [49] Missing value imputation for microarray gene expression data using histone acetylation information
    Xiang, Qian
    Dai, Xianhua
    Deng, Yangyang
    He, Caisheng
    Wang, Jiang
    Feng, Jihua
    Dai, Zhiming
    [J]. BMC BIOINFORMATICS, 2008, 9 (1)
  • [50] Improved KNN Imputation for Missing Values in Gene Expression Data
    Keerin, Phimmarin
    Boongoen, Tossapon
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (02): : 4009 - 4025