On the generation of random multivariate data

被引:9
|
作者
Camacho, Jose [1 ]
机构
[1] Univ Granada, Dept Teoria Senal Telemat & Comunicac, E-18071 Granada, Spain
关键词
Multivariate data; Simulation; ADICOV; MEDA toolbox; Montecarlo; CROSS-VALIDATION; SPARSE METHODS; MODELS; CLASSIFICATION; COMPONENTS; NUMBER; PLS;
D O I
10.1016/j.chemolab.2016.11.013
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The simulation of multivariate data is often necessary for assessing the performance of multivariate analysis techniques. The random generation of multivariate data when the covariance matrix is completely or partly specified is solved by different methods, from the Cholesky decomposition to some recent alternatives. However, many times the covariance matrix has to be generated also at random, so that the data simulation spans different situations from highly correlated to uncorrelated data. This is the case when assessing a new multivariate analysis techniqfie in Montecarlo experiments. In this paper, we introduce a new algorithm for the generation of random data from covariance matrices of random structure, where the user only decides the data dimension and the level of correlation. We will illustrate the application of this algorithm in several relevant problems in multivariate analysis, namely the selection of the number of Principal Components in Principal Component Analysis, the evaluation of the performance of sparse Partial Least Squares and the calibration of Multivariate Statistical Process Control systems. The algorithm is available as part of the MEDA Toolbox v1.1.(1)
引用
收藏
页码:40 / 51
页数:12
相关论文
共 50 条
  • [1] Generation of multivariate Weibull random variates
    Matolak, D. W.
    Sen, I.
    Xiong, W.
    IET COMMUNICATIONS, 2008, 2 (04) : 523 - 527
  • [2] Data depth, random simplices and multivariate dispersion
    Romanazzi, Mario
    STATISTICS & PROBABILITY LETTERS, 2009, 79 (12) : 1473 - 1479
  • [3] Diagnosing missing always at random in multivariate data
    Bojinov, Iavor I.
    Pillai, Natesh S.
    Rubin, Donald B.
    BIOMETRIKA, 2020, 107 (01) : 246 - 253
  • [4] A random effects model for multivariate life data
    Kimber, AC
    LIFETIME DATA: MODELS IN RELIABILITY AND SURVIVAL ANALYSIS, 1996, : 167 - 174
  • [5] A regression model for multivariate random length data
    Barnhart, HX
    Kosinski, AS
    Sampson, AR
    STATISTICS IN MEDICINE, 1999, 18 (02) : 199 - 211
  • [6] Multiplierless Algorithm for Multivariate Gaussian Random Number Generation in FPGAs
    Thomas, David B.
    Luk, Wayne
    IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2013, 21 (12) : 2193 - 2205
  • [7] Generation of multivariate cross-correlated geotechnical random fields
    Zhu, H.
    Zhang, L. M.
    Xiao, T.
    Li, X. Y.
    COMPUTERS AND GEOTECHNICS, 2017, 86 : 95 - 107
  • [8] GENERATION OF A RANDOM SAMPLE FROM A MULTIVARIATE NORMAL-DISTRIBUTION
    DERDE, MP
    MASSART, DL
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 1984, 3 (09) : 218 - 220
  • [9] Class library ranlip for multivariate nonuniform random variate generation
    Beliakov, G
    COMPUTER PHYSICS COMMUNICATIONS, 2005, 170 (01) : 93 - 108
  • [10] 2 RANDOM EFFECTS MODELS FOR MULTIVARIATE BINARY DATA
    MCDONALD, BW
    BIOMETRICS, 1994, 50 (01) : 164 - 172