Folded concave penalized learning of high-dimensional MRI data in Parkinson's disease

被引:1
|
作者
Li, Changcheng [1 ]
Wang, Xue [2 ]
Du, Guangwei [3 ,5 ]
Chen, Hairong [3 ]
Brown, Gregory [3 ]
Lewis, Mechelle M. [3 ,4 ]
Yao, Tao [2 ]
Li, Runze [1 ]
Huang, Xuemei [3 ,4 ,5 ,6 ,7 ]
机构
[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA
[2] Alibaba DAMO Acad, Seattle, WA USA
[3] Penn State Hershey Med Ctr, Dept Neurol, Hershey, PA 17033 USA
[4] Penn State Hershey Med Ctr, Dept Pharmacol, Hershey, PA 17033 USA
[5] Penn State Hershey Med Ctr, Dept Radiol, Hershey, PA 17033 USA
[6] Penn State Hershey Med Ctr, Dept Neurosurg, Hershey, PA 17033 USA
[7] Penn State Hershey Med Ctr, Dept Kinesiol, Hershey, PA 17033 USA
基金
美国国家科学基金会;
关键词
Magnetic resonance imaging; Diffusion tensor imaging; Folded concave penalized learning; Support vector machine; Logistic regression; LOGISTIC-REGRESSION; VARIABLE SELECTION; MODEL SELECTION; LASSO; SPARSITY; LIKELIHOOD; RECOVERY; CLASSIFICATION; REGISTRATION; THICKNESS;
D O I
10.1016/j.jneumeth.2021.109157
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Brain MRI is a promising technique for Parkinson's disease (PD) biomarker development. Its analysis, however, is hindered by the high-dimensional nature of the data, particularly when the sample size is relatively small. New Method: This study introduces a folded concave penalized machine learning scheme with spatial coupling fused penalty (fused FCP) to build biomarkers for PD directly from whole-brain voxel-wise MRI data. The penalized maximum likelihood estimation problem of the model is solved by local linear approximation. Results: The proposed approach is evaluated on synthetic and Parkinson's Progression Marker Initiative (PPMI) data. It achieves good AUC scores, accuracy in classification, and biomarker identification with a relatively small sample size, and the results are robust for different tuning parameter choices. On the PPMI data, the proposed method discovers over 80 % of large regions of interest (ROIs) identified by the voxel-wise method, as well as potential new ROIs. Comparison with Existing Methods: The fused FCP approach is compared with L1, fused-L1, and FCP method using three popular machine learning algorithms, logistic regression, support vector machine, and linear discriminant analysis, as well as the voxel-wise method, on both synthetic and PPMI datasets. The fused FCP method demonstrated better accuracy in separating PD from controls than L1 and fused-L1 methods, and similar performance when compared with FCP method. In addition, the fused FCP method showed better ROI identification. Conclusions: The fused FCP method can be an effective approach for MRI biomarker discovery in PD and other studies using high dimensionality data/low sample sizes.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Learning high-dimensional multimedia data
    Zhu, Xiaofeng
    Jin, Zhi
    Ji, Rongrong
    MULTIMEDIA SYSTEMS, 2017, 23 (03) : 281 - 283
  • [22] Rank penalized estimators for high-dimensional matrices
    Klopp, Olga
    ELECTRONIC JOURNAL OF STATISTICS, 2011, 5 : 1161 - 1183
  • [23] Targeted Inference Involving High-Dimensional Data Using Nuisance Penalized Regression
    Sun, Qiang
    Zhang, Heping
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (535) : 1472 - 1486
  • [24] Improving Penalized Logistic Regression Model with Missing Values in High-Dimensional Data
    Alharthi, Aiedh Mrisi
    Lee, Muhammad Hisyam
    Algamal, Zakariya Yahya
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2022, 18 (02) : 40 - 54
  • [25] Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data
    Xie, Benhuai
    Pan, Wei
    Shen, Xiaotong
    BIOINFORMATICS, 2010, 26 (04) : 501 - 508
  • [26] Penalized empirical likelihood for high-dimensional generalized linear models with longitudinal data
    Chen, Xia
    Tan, Xiaoyan
    Yan, Li
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (10) : 1515 - 1531
  • [27] Bayesian penalized cumulative logit model for high-dimensional data with an ordinal response
    Zhang, Yiran
    Archer, Kellie J.
    STATISTICS IN MEDICINE, 2021, 40 (06) : 1453 - 1481
  • [28] Coordinate ascent for penalized semiparametric regression on high-dimensional panel count data
    Wu, Tong Tong
    He, Xin
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) : 25 - 33
  • [29] Efficient Learning on High-dimensional Operational Data
    Samani, Forough Shahab
    Zhang, Hongyi
    Stadler, Rolf
    2019 15TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2019,
  • [30] PCA learning for sparse high-dimensional data
    Hoyle, DC
    Rattray, M
    EUROPHYSICS LETTERS, 2003, 62 (01): : 117 - 123