High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm

被引:21
|
作者
Algamal, Z. Y. [1 ]
Lee, M. H. [1 ]
Al-Fakih, A. M. [2 ]
Aziz, M. [2 ]
机构
[1] Univ Teknol Malaysia, Dept Math Sci, Johor Baharu, Malaysia
[2] Univ Teknol Malaysia, Dept Chem, Johor Baharu, Malaysia
关键词
QSAR; bridge penalty; L1; 2-norm; penalized method; imidazo[4; 5-b]pyridine derivatives; procollagen C-proteinase; ADAPTIVE ELASTIC-NET; LOGISTIC-REGRESSION; GENE SELECTION; CORROSION INHIBITION; VARIABLE SELECTION; ANTICANCER POTENCY; DIVERGING NUMBER; BRIDGE; LASSO; DERIVATIVES;
D O I
10.1080/1062936X.2016.1228696
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In high-dimensional quantitative structure-activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L-1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds.
引用
收藏
页码:703 / 719
页数:17
相关论文
共 50 条
  • [31] Linear Classifiers with the L1 Margin from a Small Number of High-Dimensional Vectors
    Bobrowski, Leon
    Lukaszuk, Tomasz
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2020), PT II, 2020, 12034 : 79 - 89
  • [32] Penalized least-squares estimation for regression coefficients in high-dimensional partially linear models
    Ni, Huey-Fan
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (02) : 379 - 389
  • [33] Weighted l1-Penalized Corrected Quantile Regression for High-Dimensional Temporally Dependent Measurement Errors
    Bhattacharjee, Monika
    Chakraborty, Nilanjan
    Koul, Hira L.
    [J]. JOURNAL OF TIME SERIES ANALYSIS, 2023, 44 (5-6) : 442 - 473
  • [34] Automatic selection by penalized asymmetric Lq-norm in a high-dimensional model with grouped variables
    Alcaraz, Angelo
    Ciuperca, Gabriela
    [J]. STATISTICS, 2023, 57 (05) : 1202 - 1238
  • [35] Manifold optimization-based analysis dictionary learning with an l1/2-norm regularizer
    Li, Zhenni
    Ding, Shuxue
    Li, Yujie
    Yang, Zuyuan
    Xie, Shengli
    Chen, Wuhui
    [J]. NEURAL NETWORKS, 2018, 98 : 212 - 222
  • [36] A penalized likelihood-based quality monitoring via L2-norm regularization for high-dimensional processes
    Kim, Sangahn
    Jeong, Myong K.
    Elsayed, Elsayed A.
    [J]. JOURNAL OF QUALITY TECHNOLOGY, 2020, 52 (03) : 265 - 280
  • [37] L1/2-norm Regularization for Detecting Aero-engine Fan Acoustic Mode
    Li, Zhendong
    Qiao, Baijie
    Wen, Bi
    Li, Zepeng
    Chen, Xuefeng
    [J]. 2022 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2022), 2022,
  • [38] MODEL SELECTION FOR HIGH-DIMENSIONAL LINEAR REGRESSION WITH DEPENDENT OBSERVATIONS
    Ing, Ching-Kang
    [J]. ANNALS OF STATISTICS, 2020, 48 (04): : 1959 - 1980
  • [39] The likelihood ratio test for high-dimensional linear regression model
    Xie, Junshan
    Xiao, Nannan
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (17) : 8479 - 8492
  • [40] Backfitting algorithms for total-variation and empirical-norm penalized additive modelling with high-dimensional data
    Yang, Ting
    Tan, Zhiqiang
    [J]. STAT, 2018, 7 (01):