Robust regression with compositional covariates

被引:8
|
作者
Mishra, Aditya [1 ]
Muller, Christian L. [1 ]
机构
[1] Flatiron Inst, Ctr Computat Math, New York, NY 10010 USA
关键词
Compositional data; Robust; Mean shift; Sparsity; Microbiome; VARIABLE SELECTION; MICROBIOME; REGULARIZATION; ALGORITHM; STATISTICS; LASSO;
D O I
10.1016/j.csda.2021.107315
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Many biological high-throughput datasets, such as targeted amplicon-based and metagenomic sequencing data, are compositional. A common exploratory data analysis task is to infer robust statistical associations between high-dimensional microbial compositions and habitat- or host-related covariates. To address this, a general robust statistical regression framework RobRegCC (Robust Regression with Compositional Covariates) is proposed, which extends the linear log-contrast model by a mean shift formulation for capturing outliers. RobRegCC includes sparsity-promoting convex and non-convex penalties for parsimonious model estimation, a data-driven robust initialization procedure, and a novel robust cross-validation model selection scheme. The procedure is implemented in the R package robregcc. Extensive simulation studies show the RobRegCC's ability to perform simultaneous sparse log-contrast regression and outlier detection over a wide range of settings. To demonstrate the seamless applicability of the workflow to real data, the gut microbiome dataset from HIV patients are analyzed and robust associations between a sparse set of microbial species and host immune response from soluble CD14 measurements are inferred. (C) 2021 The Author(s). Published by Elsevier B.V.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Robust regression with compositional covariates including cellwise outliers
    Stefelova, Nikola
    Alfons, Andreas
    Palarea-Albaladejo, Javier
    Filzmoser, Peter
    Hron, Karel
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (04) : 869 - 909
  • [2] Robust regression with compositional covariates including cellwise outliers
    Nikola Štefelová
    Andreas Alfons
    Javier Palarea-Albaladejo
    Peter Filzmoser
    Karel Hron
    [J]. Advances in Data Analysis and Classification, 2021, 15 : 869 - 909
  • [3] Quantile regression for compositional covariates
    Ma, Xuejun
    Zhang, Ping
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2023, 52 (03) : 658 - 668
  • [4] Variable selection in regression with compositional covariates
    Lin, Wei
    Shi, Pixu
    Feng, Rui
    Li, Hongzhe
    [J]. BIOMETRIKA, 2014, 101 (04) : 785 - 797
  • [5] Multiple linear regression with compositional response and covariates
    Chen, Jiajia
    Zhang, Xiaoqin
    Li, Shengjia
    [J]. JOURNAL OF APPLIED STATISTICS, 2017, 44 (12) : 2270 - 2285
  • [6] Heteroskedastic linear regression model with compositional response and covariates
    Chen, Jiajia
    Zhang, Xiaoqin
    Li, Shengjia
    [J]. JOURNAL OF APPLIED STATISTICS, 2018, 45 (12) : 2164 - 2181
  • [7] Partial least squares regression with compositional response variables and covariates
    Chen, Jiajia
    Zhang, Xiaoqin
    Hron, Karel
    [J]. JOURNAL OF APPLIED STATISTICS, 2021, 48 (16) : 3130 - 3149
  • [8] GLOBALLY ADAPTIVE LONGITUDINAL QUANTILE REGRESSION WITH HIGH DIMENSIONAL COMPOSITIONAL COVARIATES
    Ma, Huijuan
    Zheng, Qi
    Zhang, Zhumin
    Lai, Huichuan
    Peng, Limin
    [J]. STATISTICA SINICA, 2023, 33 : 1295 - 1318
  • [9] Classical and Robust Regression Analysis with Compositional Data
    K. G. van den Boogaart
    P. Filzmoser
    K. Hron
    M. Templ
    R. Tolosana-Delgado
    [J]. Mathematical Geosciences, 2021, 53 : 823 - 858
  • [10] Robust Regression with Compositional Response: Application to Geosciences
    Hron, Karel
    Filzmoser, Peter
    Templ, Matthias
    van den Boogaart, Karl Gerald
    Tolosana-Delgado, Raimon
    [J]. MATHEMATICS OF PLANET EARTH, 2014, : 87 - 90