RRPP: An R package for fitting linear models to high-dimensional data using residual randomization

被引:426
|
作者
Collyer, Michael L. [1 ]
Adams, Dean C. [2 ,3 ]
机构
[1] Chatham Univ, Dept Sci Biol, Pittsburgh, PA 15232 USA
[2] Iowa State Univ, Dept Ecol Evolut & Organismal Biol, Ames, IA USA
[3] Iowa State Univ, Dept Stat, Ames, IA USA
来源
METHODS IN ECOLOGY AND EVOLUTION | 2018年 / 9卷 / 07期
基金
美国国家科学基金会;
关键词
dissimilarity; generalized least-squares; high-dimensional data; multivariate; PERMUTATION TESTS; MULTIVARIATE-ANALYSIS; VARIANCE; REGRESSION; MATRICES; SHAPE;
D O I
10.1111/2041-210X.13029
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
1. Residual randomization in permutation procedures (RRPP) is an appropriate means of generating empirical sampling distributions for ANOVA statistics and linear model coefficients, using ordinary or generalized least-squares estimation. This is an especially useful approach for high-dimensional (multivariate) data. 2. Here, we present an r package that provides a comprehensive suite of tools for applying RRPP to linear models. Important available features include choices for OLS or GLS coefficient estimation, data or dissimilarity matrix analysis capability, choice among types I, II, or III sums of squares and cross-products, various effect size estimation methods, and an ability to perform mixed-model ANOVA. 3. The lm.rrpp function is similar to the lm function in many regards, but provides coefficient and ANOVA statistics estimates over many random permutations. The S3 generic functions commonly used with lm also work with lm.rrpp. Additionally, a pairwise function provides statistical tests for comparisons of least-squares means or slopes, among designated groups. Users have many options for varying random permutations. Compared to similar available packages and functions, RRPP is extremely fast and yields comprehensive results for downstream analyses and graphics, following model fits with lm.rrpp. 4. The RRPP package facilitates analysis of both univariate and multivariate response data, even when the number of variables exceeds the number of observations.
引用
收藏
页码:1772 / 1779
页数:8
相关论文
共 50 条
  • [1] Robust Inference for High-Dimensional Linear Models via Residual Randomization
    Wang, Y. Samuel
    Lee, Si Kai
    Toulis, Panos
    Kolar, Mladen
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139 : 7818 - 7828
  • [2] ordinalbayes: Fitting Ordinal Bayesian Regression Models to High-Dimensional Data Using R
    Archer, Kellie J.
    Seffernick, Anna Eames
    Sun, Shuai
    Zhang, Yiran
    [J]. STATS, 2022, 5 (02): : 371 - 384
  • [3] Using Evolutionary Algorithms for Fitting High-Dimensional Models to Neuronal Data
    Carl-Magnus Svensson
    Stephen Coombes
    Jonathan Westley Peirce
    [J]. Neuroinformatics, 2012, 10 : 199 - 218
  • [4] Using Evolutionary Algorithms for Fitting High-Dimensional Models to Neuronal Data
    Svensson, Carl-Magnus
    Coombes, Stephen
    Peirce, Jonathan Westley
    [J]. NEUROINFORMATICS, 2012, 10 (02) : 199 - 218
  • [5] PREDICTION WHEN FITTING SIMPLE MODELS TO HIGH-DIMENSIONAL DATA
    Steinberger, Lukas
    Leeb, Hannes
    [J]. ANNALS OF STATISTICS, 2019, 47 (03): : 1408 - 1442
  • [6] Variable Clustering in High-Dimensional Linear Regression: The R Package clere
    Yengo, Loic
    Jacques, Julien
    Biernacki, Christophe
    Canouil, Mickael
    [J]. R JOURNAL, 2016, 8 (01): : 92 - 106
  • [7] ecpc: an R-package for generic co-data models for high-dimensional prediction
    van Nee, Mirrelijn M.
    Wessels, Lodewyk F. A.
    van de Wiel, Mark A.
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [8] ecpc: an R-package for generic co-data models for high-dimensional prediction
    Mirrelijn M. van Nee
    Lodewyk F. A. Wessels
    Mark A. van de Wiel
    [J]. BMC Bioinformatics, 24
  • [9] g.ridge: An R Package for Generalized Ridge Regression for Sparse and High-Dimensional Linear Models
    Emura, Takeshi
    Matsumoto, Koutarou
    Uozumi, Ryuji
    Michimae, Hirofumi
    [J]. SYMMETRY-BASEL, 2024, 16 (02):
  • [10] High-dimensional shape fitting in linear time
    Har-Peled, S
    Varadarajan, KR
    [J]. DISCRETE & COMPUTATIONAL GEOMETRY, 2004, 32 (02) : 269 - 288