Exploring Alternative Strategies for the Identification of Potent Compounds Using Support Vector Machine and Regression Modeling

被引:6
|
作者
Miyao, Tomoyuki [1 ,2 ]
Funatsu, Kimito [1 ,2 ,3 ]
Bajorath, Juergen [4 ]
机构
[1] Nara Inst Sci & Technol, Data Sci Ctr, 8916-5 Takayama Cho, Nara 6300192, Japan
[2] Nara Inst Sci & Technol, Grad Sch Sci & Technol, 8916-5 Takayama Cho, Nara 6300192, Japan
[3] Univ Tokyo, Sch Engn, Dept Chem Syst Engn, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138656, Japan
[4] Rhein Friedrich Wilhelms Univ, LIMES Program Unit Chem Biol & Med Chem, B IT, Dept Life Sci Informat, Endenicher Allee 19c, D-53115 Bonn, Germany
关键词
ACTIVITY CLIFFS; CURRENT TRENDS;
D O I
10.1021/acs.jcim.8b00584
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Support vector regression (SVR) is a premier approach for the prediction of compound potency. Given the conceptual link between support vector machine (SVM) and SVR modeling, SVR is capable of accounting for continuous and discontinuous structure activity relationships (SARs) in potency prediction, which further extends the classical quantitative SAR (QSAR) paradigm. In the context of virtual compound screening, compound potency prediction can be applied to identify the most potent compounds that are available or enrich database selection sets with potent compounds. To these ends, we have evaluated new potency prediction strategies. Conventional (direct) potency prediction using SVR was compared to two-stage SVM-SVR modeling and potency prediction using SVR models trained in the presence of active and inactive compounds, a previously unconsidered approach. The latter models were found to maximize the recall of potent compounds but were least accurate in predicting high potency values. For this purpose, direct SVR predictions were preferred. However, the best balance between accurate potency predictions and enrichment of potent compounds in database selection sets was achieved by combined SVM-SVR modeling. Taken together, our findings further extend current approaches for compound potency prediction in virtual compound screening.
引用
收藏
页码:983 / 992
页数:10
相关论文
共 50 条
  • [41] Interval Support Vector Machine in Regression Analysis
    Arjmandzadeh, Ameneh
    Effati, Sohrab
    Zamirian, Mohammad
    [J]. JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2011, 2 (03): : 565 - 571
  • [42] A rough ν-twin support vector regression machine
    Xue, Zhenxia
    Zhang, Roxin
    Qin, Chuandong
    Zeng, Xiaoqing
    [J]. APPLIED INTELLIGENCE, 2018, 48 (11) : 4023 - 4046
  • [43] A rough ν-twin support vector regression machine
    Zhenxia Xue
    Roxin Zhang
    Chuandong Qin
    Xiaoqing Zeng
    [J]. Applied Intelligence, 2018, 48 : 4023 - 4046
  • [44] On-line support vector machine regression
    Martin, M
    [J]. MACHINE LEARNING: ECML 2002, 2002, 2430 : 282 - 294
  • [45] Fuzzy support vector machine for regression estimation
    Sun, ZH
    Sun, YX
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 3336 - 3341
  • [46] A Support Vector Machine for Regression in Complex Field
    Lang, Rongling
    Zhao, Fei
    Shi, Yongtang
    [J]. INFORMATICA, 2017, 28 (04) : 651 - 664
  • [47] Twin Support Vector Machine Based Regression
    Khemchandani, Reshma
    Goyal, Keshav
    Chandra, Suresh
    [J]. 2015 EIGHTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2015, : 18 - +
  • [48] Possibilistic Regression Analysis by Support Vector Machine
    Hao, Pei-Yi
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 889 - 894
  • [49] A fuzzy model of support vector machine regression
    Hao, PY
    Chiang, JH
    [J]. PROCEEDINGS OF THE 12TH IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1 AND 2, 2003, : 738 - 742
  • [50] Prediction intervals for support vector machine regression
    Seok, K
    Hwang, C
    Cho, D
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2002, 31 (10) : 1887 - 1898