A New Approach of Outlier-robust Missing Value Imputation for Metabolomics Data Analysis

被引:12
|
作者
Kumar, Nishith [1 ,2 ]
Hoque, Md Aminul [1 ]
Shahjaman, Md [1 ,3 ]
Islam, S. M. Shahinul [4 ]
Mollah, Md Nurul Haque [1 ]
机构
[1] Rajshahi Univ, Dept Stat, Rajshahi 6205, Bangladesh
[2] Bangabandhu Sheikh Mujibur Rahman Sci & Technol U, Dept Stat, Gopalganj, Bangladesh
[3] Begum Rokeya Univ, Dept Stat, Rangpur, Bangladesh
[4] Rajshahi Univ, Inst Biol Sci, Rajshahi 6205, Bangladesh
关键词
Metabolomics; missing data; missing value imputation; singular value decomposition (SVD); receiver operating characteristic (ROC) curve; support vector machine; PCA;
D O I
10.2174/1574893612666171121154655
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Metabolomics data generation and quantification are different from other types of molecular "omics" data in bioinformatics. Mass spectrometry (MS) based (gas chromatography mass spectrometry (GC-MS), liquid chromatography mass spectrometry (LC-MS), etc.) metabolomics data frequently contain missing values that make some quantitative analysis complex. Typically metabolomics datasets contain 10% to 20% missing values that originate from several reasons, like analytical, computational as well as biological hazard. Imputation of missing values is a very important and interesting issue for further metabolomics data analysis. Objective: This paper introduces a new algorithm for missing value imputation in the presence of outliers for metabolomics data analysis. Method: Currently, the most well known missing value imputation techniques in metabolomics data are knearest neighbours (kNN), random forest (RF) and zero imputation. However, these techniques are sensitive to outliers. In this paper, we have proposed an outlier robust missing imputation technique by minimizing two-way empirical mean absolute error (MAE) loss function for imputing missing values in metabolomics data. Results: We have investigated the performance of the proposed missing value imputation technique in a comparison of the other traditional imputation techniques using both simulated and real data analysis in the absence and presence of outliers. Conclusion: Results of both simulated and real data analyses show that the proposed outlier robust missing imputation technique is better performer than the traditional missing imputation methods in both absence and presence of outliers.
引用
收藏
页码:43 / 52
页数:10
相关论文
共 50 条
  • [1] rMisbeta: A robust missing value imputation approach in transcriptomics and metabolomics data
    Shahjaman, Md
    Rahman, Md Rezanur
    Islam, Tania
    Auwul, Md Rabiul
    Moni, Mohammad Ali
    Mollah, Md Nurul Haque
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 138
  • [2] Missing value imputation strategies for metabolomics data
    Grace Armitage, Emily
    Godzien, Joanna
    Alonso-Herranz, Vanesa
    Lopez-Gonzalvez, Angeles
    Barbas, Coral
    [J]. ELECTROPHORESIS, 2015, 36 (24) : 3050 - 3060
  • [3] A functional data approach to missing value imputation and outlier detection for traffic flow data
    Chiou, Jeng-Min
    Zhang, Yi-Chen
    Chen, Wan-Hui
    Chang, Chiung-Wen
    [J]. TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2014, 2 (02) : 106 - 129
  • [4] Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data
    Wei, Runmin
    Wang, Jingye
    Su, Mingming
    Jia, Erik
    Chen, Shaoqiu
    Chen, Tianlu
    Ni, Yan
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [5] Missing Value Imputation Approach for Mass Spectrometry-based Metabolomics Data
    Runmin Wei
    Jingye Wang
    Mingming Su
    Erik Jia
    Shaoqiu Chen
    Tianlu Chen
    Yan Ni
    [J]. Scientific Reports, 8
  • [6] Collateral missing value imputation: a new robust missing value estimation algorithm for microarray data
    Sehgal, MSB
    Gondal, I
    Dooley, LS
    [J]. BIOINFORMATICS, 2005, 21 (10) : 2417 - 2423
  • [7] A robust missing value imputation method for noisy data
    Zhu, Bing
    He, Changzheng
    Liatsis, Panos
    [J]. APPLIED INTELLIGENCE, 2012, 36 (01) : 61 - 74
  • [8] A robust missing value imputation method for noisy data
    Bing Zhu
    Changzheng He
    Panos Liatsis
    [J]. Applied Intelligence, 2012, 36 : 61 - 74
  • [9] Outlier-Robust Gromov-Wasserstein for Graph Data
    Kong, Lemin
    Li, Jiajin
    Tang, Jianheng
    So, Anthony Man-Cho
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] Missing value imputation and outlier detection for functional data: an application for PM10 data
    Melendez, Rafael
    Bolivar, Stevenson
    Rojano, Roberto
    [J]. UIS INGENIERIAS, 2020, 19 (02): : 1 - 10