Software Tools for Robust Analysis of High-Dimensional Data

被引:2
|
作者
Todorov, Valentin [1 ]
Filzmoser, Peter [2 ]
机构
[1] UNIDO, Vienna, Austria
[2] Vienna Univ Technol, Dept Stat & Probabil Theory, Vienna, Austria
关键词
high dimensions; robustness; classification; PLS; PCA; outliers;
D O I
10.17713/ajs.v43i4.44
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The present work discusses robust multivariate methods specifically designed for high dimensions. Their implementation in R is presented and their application is illustrated on examples. The first group are algorithms for outlier detection, already introduced elsewhere and implemented in other packages. The value added of the new package is that all methods follow the same design pattern and thus can use the same graphical and diagnostic tools. The next topic covered is sparse principal components including an object oriented interface to the standard method proposed by Zou, Hastie, and Tibshirani (2006) and the robust one proposed by Croux, Filzmoser, and Fritz (2013). Robust partial least squares (see Hubert and Vanden Branden 2003) as well as partial least squares for discriminant analysis conclude the scope of the new package.
引用
收藏
页码:255 / 266
页数:12
相关论文
共 50 条
  • [1] Robust analysis of cancer heterogeneity for high-dimensional data
    Cheng, Chao
    Feng, Xingdong
    Li, Xiaoguang
    Wu, Mengyun
    [J]. STATISTICS IN MEDICINE, 2022, 41 (27) : 5448 - 5462
  • [2] Robust regularized cluster analysis for high-dimensional data
    Kalina, Jan
    Vlckova, Katarina
    [J]. MATHEMATICAL METHODS IN ECONOMICS (MME 2014), 2014, : 378 - 383
  • [3] Robust PCA for high-dimensional data
    Hubert, M
    Rousseeuw, PJ
    Verboven, S
    [J]. DEVELOPMENTS IN ROBUST STATISTICS, 2003, : 169 - 179
  • [4] Robust structured heterogeneity analysis approach for high-dimensional data
    Sun, Yifan
    Luo, Ziye
    Fan, Xinyan
    [J]. STATISTICS IN MEDICINE, 2022, 41 (17) : 3229 - 3259
  • [5] Fast Robust Correlation for High-Dimensional Data
    Raymaekers, Jakob
    Rousseeuw, Peter J.
    [J]. TECHNOMETRICS, 2021, 63 (02) : 184 - 198
  • [6] Robust Ridge Regression for High-Dimensional Data
    Maronna, Ricardo A.
    [J]. TECHNOMETRICS, 2011, 53 (01) : 44 - 53
  • [7] Algorithmic Tools for Mining High-Dimensional Cytometry Data
    Chester, Cariad
    Maecker, Holden T.
    [J]. JOURNAL OF IMMUNOLOGY, 2015, 195 (03): : 773 - 779
  • [8] Tools for the analysis of high-dimensional single-cell RNA sequencing data
    Wu, Yan
    Zhang, Kun
    [J]. NATURE REVIEWS NEPHROLOGY, 2020, 16 (07) : 408 - 421
  • [9] Tools for the analysis of high-dimensional single-cell RNA sequencing data
    Yan Wu
    Kun Zhang
    [J]. Nature Reviews Nephrology, 2020, 16 : 408 - 421
  • [10] High-Dimensional Software Engineering Data and Feature Selection
    Wang, Huanjing
    Khoshgoftaar, Taghi M.
    Gao, Kehan
    Seliya, Naeem
    [J]. ICTAI: 2009 21ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, 2009, : 83 - +