Principal component analysis

被引:0
|
作者
Michael Greenacre
Patrick J. F. Groenen
Trevor Hastie
Alfonso Iodice D’Enza
Angelos Markos
Elena Tuzhilina
机构
[1] Universitat Pompeu Fabra and Barcelona School of Management,Department of Economics and Business
[2] Erasmus University Rotterdam,Econometric Institute, Erasmus School of Economics
[3] Stanford University,Departments of Statistics and Biomedical Science
[4] University of Naples Federico II,Department of Political Sciences
[5] Democritus University of Thrace,Department of Primary Education
[6] Stanford University,Department of Statistics
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Principal component analysis is a versatile statistical method for reducing a cases-by-variables data table to its essential features, called principal components. Principal components are a few linear combinations of the original variables that maximally explain the variance of all the variables. In the process, the method provides an approximation of the original data table using only these few major components. This Primer presents a comprehensive review of the method’s definition and geometry, as well as the interpretation of its numerical and graphical results. The main graphical result is often in the form of a biplot, using the major components to map the cases and adding the original variables to support the distance interpretation of the cases’ positions. Variants of the method are also treated, such as the analysis of grouped data, as well as the analysis of categorical data, known as correspondence analysis. Also described and illustrated are the latest innovative applications of principal component analysis: for estimating missing values in huge data matrices, sparse component estimation, and the analysis of images, shapes and functions. Supplementary material includes video animations and computer scripts in the R environment.
引用
收藏
相关论文
共 50 条
  • [1] Principal Component Projection Without Principal Component Analysis
    Frostig, Roy
    Musco, Cameron
    Musco, Christopher
    Sidford, Aaron
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [2] Principal component analysis
    Bro, Rasmus
    Smilde, Age K.
    [J]. ANALYTICAL METHODS, 2014, 6 (09) : 2812 - 2831
  • [3] Principal component analysis
    Wallen, Hayley
    [J]. NATURE REVIEWS METHODS PRIMERS, 2022, 2 (01):
  • [4] Principal component analysis
    Jake Lever
    Martin Krzywinski
    Naomi Altman
    [J]. Nature Methods, 2017, 14 : 641 - 642
  • [5] PRINCIPAL COMPONENT ANALYSIS
    WOLD, S
    ESBENSEN, K
    GELADI, P
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1987, 2 (1-3) : 37 - 52
  • [6] Principal component analysis
    School of Behavioral and Brain Sciences, University of Texas at Dallas, MS: GR4.1, Richardson, TX 75080-3021, United States
    不详
    [J]. Wiley Interdiscip. Rev. Comput. Stat., 4 (433-459):
  • [7] Principal component analysis
    Abdi, Herve
    Williams, Lynne J.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04): : 433 - 459
  • [8] Principal component analysis
    Hess, Aaron S.
    Hess, John R.
    [J]. TRANSFUSION, 2018, 58 (07) : 1580 - 1582
  • [9] PRINCIPAL COMPONENT ANALYSIS
    ARIES, RE
    LIDIARD, DP
    SPRAGG, RA
    [J]. CHEMISTRY IN BRITAIN, 1991, 27 (09) : 821 - 824
  • [10] Segmented principal component transform-principal component analysis
    Barros, AS
    Rutledge, DN
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 78 (1-2) : 125 - 137