Folded concave penalized learning of high-dimensional MRI data in Parkinson's disease

被引：1

作者：

Li, Changcheng ^{[1
]}

Wang, Xue ^{[2
]}

Du, Guangwei ^{[3
,5
]}

Chen, Hairong ^{[3
]}

Brown, Gregory ^{[3
]}

Lewis, Mechelle M. ^{[3
,4
]}

Yao, Tao ^{[2
]}

Li, Runze ^{[1
]}

Huang, Xuemei ^{[3
,4
,5
,6
,7
]}

机构：

[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA

[2] Alibaba DAMO Acad, Seattle, WA USA

[3] Penn State Hershey Med Ctr, Dept Neurol, Hershey, PA 17033 USA

[4] Penn State Hershey Med Ctr, Dept Pharmacol, Hershey, PA 17033 USA

[5] Penn State Hershey Med Ctr, Dept Radiol, Hershey, PA 17033 USA

[6] Penn State Hershey Med Ctr, Dept Neurosurg, Hershey, PA 17033 USA

[7] Penn State Hershey Med Ctr, Dept Kinesiol, Hershey, PA 17033 USA

来源：

JOURNAL OF NEUROSCIENCE METHODS | 2021年 / 357卷

基金：

美国国家科学基金会;

关键词：

Magnetic resonance imaging; Diffusion tensor imaging; Folded concave penalized learning; Support vector machine; Logistic regression; LOGISTIC-REGRESSION; VARIABLE SELECTION; MODEL SELECTION; LASSO; SPARSITY; LIKELIHOOD; RECOVERY; CLASSIFICATION; REGISTRATION; THICKNESS;

D O I：

10.1016/j.jneumeth.2021.109157

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Brain MRI is a promising technique for Parkinson's disease (PD) biomarker development. Its analysis, however, is hindered by the high-dimensional nature of the data, particularly when the sample size is relatively small. New Method: This study introduces a folded concave penalized machine learning scheme with spatial coupling fused penalty (fused FCP) to build biomarkers for PD directly from whole-brain voxel-wise MRI data. The penalized maximum likelihood estimation problem of the model is solved by local linear approximation. Results: The proposed approach is evaluated on synthetic and Parkinson's Progression Marker Initiative (PPMI) data. It achieves good AUC scores, accuracy in classification, and biomarker identification with a relatively small sample size, and the results are robust for different tuning parameter choices. On the PPMI data, the proposed method discovers over 80 % of large regions of interest (ROIs) identified by the voxel-wise method, as well as potential new ROIs. Comparison with Existing Methods: The fused FCP approach is compared with L1, fused-L1, and FCP method using three popular machine learning algorithms, logistic regression, support vector machine, and linear discriminant analysis, as well as the voxel-wise method, on both synthetic and PPMI datasets. The fused FCP method demonstrated better accuracy in separating PD from controls than L1 and fused-L1 methods, and similar performance when compared with FCP method. In addition, the fused FCP method showed better ROI identification. Conclusions: The fused FCP method can be an effective approach for MRI biomarker discovery in PD and other studies using high dimensionality data/low sample sizes.

引用

页数：14

共 50 条

[21] Learning high-dimensional multimedia data
Zhu, Xiaofeng
Jin, Zhi
Ji, Rongrong
MULTIMEDIA SYSTEMS, 2017, 23 (03) : 281 - 283
[22] Rank penalized estimators for high-dimensional matrices
Klopp, Olga
ELECTRONIC JOURNAL OF STATISTICS, 2011, 5 : 1161 - 1183
[23] Targeted Inference Involving High-Dimensional Data Using Nuisance Penalized Regression
Sun, Qiang
Zhang, Heping
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (535) : 1472 - 1486
[24] Improving Penalized Logistic Regression Model with Missing Values in High-Dimensional Data
Alharthi, Aiedh Mrisi
Lee, Muhammad Hisyam
Algamal, Zakariya Yahya
INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2022, 18 (02) : 40 - 54
[25] Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data
Xie, Benhuai
Pan, Wei
Shen, Xiaotong
BIOINFORMATICS, 2010, 26 (04) : 501 - 508
[26] Penalized empirical likelihood for high-dimensional generalized linear models with longitudinal data
Chen, Xia
Tan, Xiaoyan
Yan, Li
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2023, 93 (10) : 1515 - 1531
[27] Bayesian penalized cumulative logit model for high-dimensional data with an ordinal response
Zhang, Yiran
Archer, Kellie J.
STATISTICS IN MEDICINE, 2021, 40 (06) : 1453 - 1481
[28] Coordinate ascent for penalized semiparametric regression on high-dimensional panel count data
Wu, Tong Tong
He, Xin
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) : 25 - 33
[29] Efficient Learning on High-dimensional Operational Data
Samani, Forough Shahab
Zhang, Hongyi
Stadler, Rolf
2019 15TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2019,
[30] PCA learning for sparse high-dimensional data
Hoyle, DC
Rattray, M
EUROPHYSICS LETTERS, 2003, 62 (01): : 117 - 123

← 1 2 3 4 5 →