High dimensional data analysis using multivariate generalized spatial quantiles

被引:9
|
作者
Mukhopadhyay, Nitai D. [1 ]
Chatterjee, Snigdhansu [2 ]
机构
[1] Virginia Commonwealth Univ, Dept Biostat, Richmond, VA 23298 USA
[2] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
Multivariate quantile; Spatial quantile; Projection quantile; Generalized spatial quantile; Multidimensional coverage sets; Multivariate order statistics; Brain imaging; High dimensional data visualization; DATA DEPTH; TRANSFORMATION-RETRANSFORMATION; PROJECTION DEPTH; RANK-TESTS; CELL-CYCLE; ONE-SAMPLE; ESTIMATORS; EXPRESSION; SIGN; IDENTIFICATION;
D O I
10.1016/j.jmva.2010.12.002
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High dimensional data routinely arises in image analysis, genetic experiments, network analysis, and various other research areas. Many such datasets do not correspond to well-studied probability distributions, and in several applications the data-cloud prominently displays non-symmetric and non-convex shape features. We propose using spatial quantiles and their generalizations, in particular, the projection quantile, for describing, analyzing and conducting inference with multivariate data. Minimal assumptions are made about the nature and shape characteristics of the underlying probability distribution, and we do not require the sample size to be as high as the data-dimension. We present theoretical properties of the generalized spatial quantiles, and an algorithm to compute them quickly. Our quantiles may be used to obtain multidimensional confidence or credible regions that are not required to conform to a pre-determined shape. We also propose a new notion of multidimensional order statistics, which may be used to obtain multidimensional outliers. Many of the features revealed using a generalized spatial quantile-based analysis would be missed if the data was shoehorned into a well-known probabilistic configuration. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:768 / 780
页数:13
相关论文
共 50 条
  • [1] QUANTILE TOMOGRAPHY: USING QUANTILES WITH MULTIVARIATE DATA
    Kong, Linglong
    Mizera, Ivan
    [J]. STATISTICA SINICA, 2012, 22 (04) : 1589 - 1610
  • [2] Analysis of Multivariate and High Dimensional Data
    Shalabh
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2015, 178 (03) : 783 - 784
  • [3] Multivariate ρ-quantiles: A spatial approach
    Konen, Dimitri
    Paindaveine, Davy
    [J]. BERNOULLI, 2022, 28 (03) : 1912 - 1934
  • [4] Functional data analysis of generalized regression quantiles
    Mengmeng Guo
    Lan Zhou
    Jianhua Z. Huang
    Wolfgang Karl Härdle
    [J]. Statistics and Computing, 2015, 25 : 189 - 202
  • [5] Functional data analysis of generalized regression quantiles
    Guo, Mengmeng
    Zhou, Lan
    Huang, Jianhua Z.
    Haerdle, Wolfgang Karl
    [J]. STATISTICS AND COMPUTING, 2015, 25 (02) : 189 - 202
  • [6] Goodness-of-fit analysis for multivariate normality based on generalized quantiles
    Beirlant, J
    Mason, DM
    Vynckier, C
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1999, 30 (02) : 119 - 142
  • [7] Generalized multivariate rank type test statistics via spatial U-quantiles
    Zhou, Weihua
    Serfling, Robert
    [J]. STATISTICS & PROBABILITY LETTERS, 2008, 78 (04) : 376 - 383
  • [8] On a geometric notion of quantiles for multivariate data
    Chaudhuri, P
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (434) : 862 - 872
  • [9] Nonparametric multivariate descriptive measures based on spatial quantiles
    Serfling, R
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2004, 123 (02) : 259 - 278
  • [10] Multivariate quantiles in hydrological frequency analysis
    Chebana, F.
    Ouarda, T. B. M. J.
    [J]. ENVIRONMETRICS, 2011, 22 (01) : 63 - 78