Tracy-Widom statistic for the largest eigenvalue of autoscaled real matrices

被引:15
|
作者
Saccenti, Edoardo [1 ,3 ]
Smilde, Age K. [1 ,3 ]
Westerhuis, Johan A. [1 ,3 ]
Hendriks, Margriet M. W. B. [2 ,3 ]
机构
[1] Univ Amsterdam, Swammerdam Inst Life Sci, Biosyst Data Anal Grp, NL-1098 XH Amsterdam, Netherlands
[2] Univ Med Ctr Utrecht, Dept Metab Dis, Utrecht, Netherlands
[3] Netherlands Metabol Ctr, Leiden, Netherlands
关键词
largest eigenvalue; covariance matrix; Tracy-Widom distribution; eigenanalysis; autoscaling; LEVEL-SPACING DISTRIBUTIONS; F-TEST; NUMBER; UNIVERSALITY; ASSOCIATION; COMPONENTS;
D O I
10.1002/cem.1411
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Eigenanalysis is common practice in biostatistics, and the largest eigenvalue of a data set contains valuable information about the data. However, to make inferences about the size of the largest eigenvalue, its distribution must be known. Johnstone's theorem states that the largest eigenvalues l1 of real random covariance matrices are distributed according to the TracyWidom distribution of order 1 when properly normalized to L1=l1-?np?np, where ?np and ?np are functions of the data matrix dimensions n and p. Very often, data are expressed in terms of correlations (autoscaling) for which case Johnstone's theorem does not work because the normalizing parameters ?np and ?np are not theoretically known. In this paper we propose a semi-empirical method based on test-equating theory to numerically approximate the normalization parameters in the case of autoscaled matrices. This opens the way of making inferences regarding the largest eigenvalue of an autoscaled data set. The method is illustrated by means of application to two real-life data sets. Copyright (C) 2011 John Wiley & Sons, Ltd.
引用
收藏
页码:644 / 652
页数:9
相关论文
共 50 条