High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm

被引:21
|
作者
Algamal, Z. Y. [1 ]
Lee, M. H. [1 ]
Al-Fakih, A. M. [2 ]
Aziz, M. [2 ]
机构
[1] Univ Teknol Malaysia, Dept Math Sci, Johor Baharu, Malaysia
[2] Univ Teknol Malaysia, Dept Chem, Johor Baharu, Malaysia
关键词
QSAR; bridge penalty; L1; 2-norm; penalized method; imidazo[4; 5-b]pyridine derivatives; procollagen C-proteinase; ADAPTIVE ELASTIC-NET; LOGISTIC-REGRESSION; GENE SELECTION; CORROSION INHIBITION; VARIABLE SELECTION; ANTICANCER POTENCY; DIVERGING NUMBER; BRIDGE; LASSO; DERIVATIVES;
D O I
10.1080/1062936X.2016.1228696
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In high-dimensional quantitative structure-activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L-1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds.
引用
收藏
页码:703 / 719
页数:17
相关论文
共 50 条
  • [1] The L1 penalized LAD estimator for high dimensional linear regression
    Wang, Lie
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 120 : 135 - 151
  • [2] High-dimensional regression coefficient estimation by nuclear norm plus l1 norm penalization
    Farne, Matteo
    Montanari, Angela
    [J]. STAT, 2023, 12 (01):
  • [3] Penalized logistic regression based on L1/2 penalty for high-dimensional DNA methylation data
    Jiang, Hong-Kun
    Liang, Yong
    [J]. TECHNOLOGY AND HEALTH CARE, 2020, 28 : S161 - S171
  • [4] Penalized high-dimensional M-quantile regression: From L1 to Lp optimization
    Hu, Jie
    Chen, Yu
    Zhang, Weiping
    Guo, Xiao
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (03): : 875 - 905
  • [5] PENALIZED LINEAR REGRESSION WITH HIGH-DIMENSIONAL PAIRWISE SCREENING
    Gong, Siliang
    Zhang, Kai
    Liu, Yufeng
    [J]. STATISTICA SINICA, 2021, 31 (01) : 391 - 420
  • [6] Feature Grouping using Weighted l1 Norm for High-Dimensional Data
    Vinzamuri, Bhanukiran
    Padthe, Karthik K.
    Reddy, Chandan K.
    [J]. 2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1233 - 1238
  • [7] l1-Penalized Pairwise Difference Estimation for a High-Dimensional Censored Regression Model
    Pan, Zhewen
    Xie, Jianhui
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2023, 41 (02) : 283 - 297
  • [8] High-dimensional mean estimation via l1 penalized normal likelihood
    Katayama, Shota
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 130 : 90 - 106
  • [9] Penalized regression combining the L1 norm and a correlation based penalty
    Anbari M.E.
    Mkhadri A.
    [J]. Sankhya B, 2014, 76 (1) : 82 - 102
  • [10] l1-PENALIZED QUANTILE REGRESSION IN HIGH-DIMENSIONAL SPARSE MODELS
    Belloni, Alexandre
    Chernozhukov, Victor
    [J]. ANNALS OF STATISTICS, 2011, 39 (01): : 82 - 130