Shrinkage regression-based methods for microarray missing value imputation

被引:10
|
作者
Wang, Hsiuying [1 ]
Chiu, Chia-Chun [2 ]
Wu, Yi-Ching [1 ]
Wu, Wei-Sheng [2 ]
机构
[1] Natl Chiao Tung Univ, Inst Stat, Hsinchu 300, Taiwan
[2] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 701, Taiwan
来源
BMC SYSTEMS BIOLOGY | 2013年 / 7卷
关键词
CELL-CYCLE TRANSCRIPTION; GENE-EXPRESSION PATTERNS; REGULATORY MODULES; IDENTIFICATION; LYMPHOMA;
D O I
10.1186/1752-0509-7-S6-S11
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Missing values commonly occur in the microarray data, which usually contain more than 5% missing values with up to 90% of genes affected. Inaccurate missing value estimation results in reducing the power of downstream microarray data analyses. Many types of methods have been developed to estimate missing values. Among them, the regression-based methods are very popular and have been shown to perform better than the other types of methods in many testing microarray datasets. Results: To further improve the performances of the regression-based methods, we propose shrinkage regression-based methods. Our methods take the advantage of the correlation structure in the microarray data and select similar genes for the target gene by Pearson correlation coefficients. Besides, our methods incorporate the least squares principle, utilize a shrinkage estimation approach to adjust the coefficients of the regression model, and then use the new coefficients to estimate missing values. Simulation results show that the proposed methods provide more accurate missing value estimation in six testing microarray datasets than the existing regression-based methods do. Conclusions: Imputation of missing values is a very important aspect of microarray data analyses because most of the downstream analyses require a complete dataset. Therefore, exploring accurate and efficient methods for estimating missing values has become an essential issue. Since our proposed shrinkage regression-based methods can provide accurate missing value estimation, they are competitive alternatives to the existing regression-based methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Framework for regression-based missing data imputation methods in on-line MSPC
    Arteaga, F
    Ferrer, A
    [J]. JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 439 - 447
  • [2] Regression-based imputation of explanatory discrete missing data
    Hernandez-Herrera, Gilma
    Navarro, Albert
    Morina, David
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2022,
  • [3] MICROARRAY MISSING DATA IMPUTATION USING REGRESSION
    Bayrak, Tuncay
    Ogul, Hasan
    [J]. 2017 13TH IASTED INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING (BIOMED), 2017, : 68 - 73
  • [4] Triple Imputation for Microarray Missing Value Estimation
    He, Chong
    Li, Hui-Hui
    Zhao, Changbo
    Li, Guo-Zheng
    Zhang, Wei
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 208 - 213
  • [5] An Optimization Algorithm for Missing Value Imputation in Microarray Based on Integrated Information
    Liu, Feng
    Zhang, Yiding
    Liu, Zeming
    Gao, Meng
    [J]. FUZZY SYSTEMS AND DATA MINING V (FSDM 2019), 2019, 320 : 55 - 64
  • [6] Task reduction using regression-based missing data imputation in sparse mobile crowdsensing
    Marchang, Ningrinla
    Meitei, Goldie M.
    Thakur, Tejendra
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (14): : 15995 - 16028
  • [7] Task reduction using regression-based missing data imputation in sparse mobile crowdsensing
    Ningrinla Marchang
    Goldie M. Meitei
    Tejendra Thakur
    [J]. The Journal of Supercomputing, 2022, 78 : 15995 - 16028
  • [8] A hybrid imputation approach for microarray missing value estimation
    Huihui Li
    Changbo Zhao
    Fengfeng Shao
    Guo-Zheng Li
    Xiao Wang
    [J]. BMC Genomics, 16
  • [9] A hybrid imputation approach for microarray missing value estimation
    Li, Huihui
    Zhao, Changbo
    Shao, Fengfeng
    Li, Guo-Zheng
    Wang, Xiao
    [J]. BMC GENOMICS, 2015, 16
  • [10] Incorporating Nonlinear Relationships in Microarray Missing Value Imputation
    Yu, Tianwei
    Peng, Hesen
    Sun, Wei
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (03) : 723 - 731