Imputing missing values for genetic interaction data

被引:5
|
作者
Wang, Yishu [1 ]
Wang, Lin [1 ]
Yang, Dejie [2 ]
Deng, Minghua [1 ,3 ,4 ]
机构
[1] Peking Univ, Ctr Quantitat Biol, Beijing 100871, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China
[3] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[4] Peking Univ, Ctr Stat Sci, Beijing 100871, Peoples R China
基金
中国国家自然科学基金;
关键词
Soft-SVD; Imputation; EMAP; Genetic interaction;
D O I
10.1016/j.ymeth.2014.03.032
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Epistatic Miniarray Profiles (EMAP) enable the research of genetic interaction as an important method to construct large-scale genetic interaction networks. However, a high proportion of missing values frequently poses problems in EMAP data analysis since such missing values hinder downstream analysis. While some imputation approaches have been available to EMAP data, we adopted an improved SVD modeling procedure to impute the missing values in EMAP data which has resulted in a higher accuracy rate compared with existing methods. Results: The improved SVD imputation method adopts an effective soft-threshold to the SVD approach which has been shown to be the best model to impute genetic interaction data when compared with a number of advanced imputation methods. Imputation methods also improve the clustering results of EMAP datasets. Thus, after applying our imputation method on the EMAP dataset, more meaningful modules, known pathways and protein complexes could be detected. Conclusion: While the phenomenon of missing data unavoidably complicates EMAP data, our results showed that we could complete the original dataset by the Soft-SVD approach to accurately recover genetic interactions. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:269 / 277
页数:9
相关论文
共 50 条
  • [1] Imputing Missing Values in Microarray Data with Ontology Information
    Yang, Andy C.
    Hsu, Hui-Huang
    Lu, Ming-Da
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2010, : 535 - 540
  • [2] A Genetic Asexual Reproduction Optimization Algorithm for Imputing Missing Values
    Noei, Mohammadreza
    Abadeh, Mohammad Saniee
    [J]. 2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 214 - 218
  • [3] A Matrix Completion Method for Imputing Missing Values of Process Data
    Zhang, Xinyu
    Sun, Xiaoyan
    Xia, Li
    Tao, Shaohui
    Xiang, Shuguang
    [J]. PROCESSES, 2024, 12 (04)
  • [4] Imputing missing data
    Croy, CD
    Novins, DK
    [J]. JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2004, 43 (04): : 380 - 380
  • [5] Imputing Missing Values from Low Quality Data by NIP Tooly
    Martinez, Raquel
    Cadenas, Jose M.
    Carmen Garrido, M.
    Martinez, Alejandro
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ - IEEE 2013), 2013,
  • [6] Novel Methods for Imputing Missing Values in Water Level Monitoring Data
    Thakolpat Khampuengson
    Wenjia Wang
    [J]. Water Resources Management, 2023, 37 : 851 - 878
  • [7] Novel Methods for Imputing Missing Values in Water Level Monitoring Data
    Khampuengson, Thakolpat
    Wang, Wenjia
    [J]. WATER RESOURCES MANAGEMENT, 2023, 37 (02) : 851 - 878
  • [8] Multiply imputing missing values arising by design in transplant survival data
    Pankhurst, Laura
    Mitra, Robin
    Kimber, Alan
    Collett, Dave
    [J]. BIOMETRICAL JOURNAL, 2020, 62 (05) : 1192 - 1207
  • [9] A genetic algorithm based approach for imputing missing discrete attribute values in databases
    Devi Priya, R.
    Kuppuswami, S.
    [J]. WSEAS Transactions on Information Science and Applications, 2012, 9 (06): : 169 - 178
  • [10] IMPUTING MISSING VALUES UNDER ORDER RESTRICTIONS
    THOMPSON, B
    HANDY, C
    TERRIN, M
    [J]. CONTROLLED CLINICAL TRIALS, 1988, 9 (03): : 262 - 262