MWPCR: Multiscale Weighted Principal Component Regression for High-Dimensional Prediction

被引:8
|
作者
Zhu, Hongtu [1 ,2 ]
Shen, Dan [3 ,4 ]
Peng, Xuewei [5 ]
Liu, Leo Yufeng [1 ,6 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[2] Univ N Carolina, Dept Biostat, Chapel Hill, NC USA
[3] Univ S Florida, Interdisciplinary Data Sci Consortium, Tampa, FL USA
[4] Univ S Florida, Dept Math & Stat, Tampa, FL USA
[5] Texas A&M Univ, College Stn, TX USA
[6] Univ N Carolina, Dept Stat & Operat Res, Chapel Hill, NC USA
关键词
Alzheimer; Feature; Principal component analysis; Regression; Spatial; Supervised; MODELS; CLASSIFICATION; VARIABLES; TUTORIAL;
D O I
10.1080/01621459.2016.1261710
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose a multiscale weighted principal component regression (MWPCR) framework for the use of high-dimensional features with strong spatial features (e.g., smoothness and correlation) to predict an outcome variable, such as disease status. This development is motivated by identifying imaging biomarkers that could potentially aid detection, diagnosis, assessment of prognosis, prediction of response to treatment, and monitoring of disease status, among many others. The MWPCR can be regarded as a novel integration of principal components analysis (PCA), kernel methods, and regression models. In MWPCR, we introduce various weight matrices to prewhitten high-dimensional feature vectors, perform matrix decomposition for both dimension reduction and feature extraction, and build a prediction model by using the extracted features. Examples of such weight matrices include an importance score weight matrix for the selection of individual features at each location and a spatial weight matrix for the incorporation of the spatial pattern of feature vectors. We integrate the importance of score weights with the spatial weights to recover the low-dimensional structure of high-dimensional features. We demonstrate the utility of our methods through extensive simulations and real data analyses of the Alzheimer's disease neuroimaging initiative (ADNI) dataset. Supplementary materials for this article are available online.
引用
收藏
页码:1009 / 1021
页数:13
相关论文
共 50 条
  • [41] Local prediction models by principal component regression
    Xie, YL
    Kalivas, JH
    ANALYTICA CHIMICA ACTA, 1997, 348 (1-3) : 29 - 38
  • [42] Cauchy robust principal component analysis with applications to high-dimensional data sets
    Fayomi, Aisha
    Pantazis, Yannis
    Tsagris, Michail
    Wood, Andrew T. A.
    STATISTICS AND COMPUTING, 2024, 34 (01)
  • [43] Exploring high-dimensional biological data with sparse contrastive principal component analysis
    Boileau, Philippe
    Hejazi, Nima S.
    Dudoit, Sandrine
    BIOINFORMATICS, 2020, 36 (11) : 3422 - 3430
  • [44] Dynamic principal component CAW models for high-dimensional realized covariance matrices
    Gribisch, Bastian
    Stollenwerk, Michael
    QUANTITATIVE FINANCE, 2020, 20 (05) : 799 - 821
  • [45] Adaptive local Principal Component Analysis improves the clustering of high-dimensional data
    Migenda, Nico
    Moeller, Ralf
    Schenck, Wolfram
    PATTERN RECOGNITION, 2024, 146
  • [46] Cauchy robust principal component analysis with applications to high-dimensional data sets
    Aisha Fayomi
    Yannis Pantazis
    Michail Tsagris
    Andrew T. A. Wood
    Statistics and Computing, 2024, 34
  • [47] Objective-sensitive principal component analysis for high-dimensional inverse problems
    Elizarev, Maksim
    Mukhin, Andrei
    Khlyupin, Aleksey
    COMPUTATIONAL GEOSCIENCES, 2021, 25 (06) : 2019 - 2031
  • [48] Asymptotic Distribution of Studentized Contribution Ratio in High-Dimensional Principal Component Analysis
    Hyodo, Masashi
    Yamada, Takayuki
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2009, 38 (04) : 905 - 917
  • [49] Objective-sensitive principal component analysis for high-dimensional inverse problems
    Maksim Elizarev
    Andrei Mukhin
    Aleksey Khlyupin
    Computational Geosciences, 2021, 25 : 2019 - 2031
  • [50] High-dimensional inference with the generalized Hopfield model: Principal component analysis and corrections
    Cocco, S.
    Monasson, R.
    Sessak, V.
    PHYSICAL REVIEW E, 2011, 83 (05):