High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm

被引：21

作者：

Algamal, Z. Y. ^{[1
]}

Lee, M. H. ^{[1
]}

Al-Fakih, A. M. ^{[2
]}

Aziz, M. ^{[2
]}

机构：

[1] Univ Teknol Malaysia, Dept Math Sci, Johor Baharu, Malaysia

[2] Univ Teknol Malaysia, Dept Chem, Johor Baharu, Malaysia

来源：

SAR AND QSAR IN ENVIRONMENTAL RESEARCH | 2016年 / 27卷 / 09期

关键词：

QSAR; bridge penalty; L1; 2-norm; penalized method; imidazo[4; 5-b]pyridine derivatives; procollagen C-proteinase; ADAPTIVE ELASTIC-NET; LOGISTIC-REGRESSION; GENE SELECTION; CORROSION INHIBITION; VARIABLE SELECTION; ANTICANCER POTENCY; DIVERGING NUMBER; BRIDGE; LASSO; DERIVATIVES;

D O I：

10.1080/1062936X.2016.1228696

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

In high-dimensional quantitative structure-activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L-1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds.

引用

页码：703 / 719

页数：17

共 50 条

[31] Linear Classifiers with the L1 Margin from a Small Number of High-Dimensional Vectors
Bobrowski, Leon
Lukaszuk, Tomasz
[J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2020), PT II, 2020, 12034 : 79 - 89
[32] Penalized least-squares estimation for regression coefficients in high-dimensional partially linear models
Ni, Huey-Fan
[J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (02) : 379 - 389
[33] Weighted l1-Penalized Corrected Quantile Regression for High-Dimensional Temporally Dependent Measurement Errors
Bhattacharjee, Monika
Chakraborty, Nilanjan
Koul, Hira L.
[J]. JOURNAL OF TIME SERIES ANALYSIS, 2023, 44 (5-6) : 442 - 473
[34] Automatic selection by penalized asymmetric Lq-norm in a high-dimensional model with grouped variables
Alcaraz, Angelo
Ciuperca, Gabriela
[J]. STATISTICS, 2023, 57 (05) : 1202 - 1238
[35] Manifold optimization-based analysis dictionary learning with an l1/2-norm regularizer
Li, Zhenni
Ding, Shuxue
Li, Yujie
Yang, Zuyuan
Xie, Shengli
Chen, Wuhui
[J]. NEURAL NETWORKS, 2018, 98 : 212 - 222
[36] A penalized likelihood-based quality monitoring via L2-norm regularization for high-dimensional processes
Kim, Sangahn
Jeong, Myong K.
Elsayed, Elsayed A.
[J]. JOURNAL OF QUALITY TECHNOLOGY, 2020, 52 (03) : 265 - 280
[37] L1/2-norm Regularization for Detecting Aero-engine Fan Acoustic Mode
Li, Zhendong
Qiao, Baijie
Wen, Bi
Li, Zepeng
Chen, Xuefeng
[J]. 2022 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2022), 2022,
[38] MODEL SELECTION FOR HIGH-DIMENSIONAL LINEAR REGRESSION WITH DEPENDENT OBSERVATIONS
Ing, Ching-Kang
[J]. ANNALS OF STATISTICS, 2020, 48 (04): : 1959 - 1980
[39] The likelihood ratio test for high-dimensional linear regression model
Xie, Junshan
Xiao, Nannan
[J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (17) : 8479 - 8492
[40] Backfitting algorithms for total-variation and empirical-norm penalized additive modelling with high-dimensional data
Yang, Ting
Tan, Zhiqiang
[J]. STAT, 2018, 7 (01):

← 1 2 3 4 5 →