Active learning with support vector machines in the drug discovery process

被引:254
|
作者
Warmuth, MK [1 ]
Liao, J
Rätsch, G
Mathieson, M
Putta, S
Lemmen, C
机构
[1] Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA
[2] Australian Natl Univ, RSISE, Canberra, ACT 0200, Australia
[3] Rational Discovery LLC, Palo Alto, CA 94301 USA
[4] BioSolveIT GMBH, D-53757 St Augustin, Germany
关键词
D O I
10.1021/ci025620t
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We investigate the following data mining problem from computer-aided drug design: From a large collection of compounds, find those that bind to a target molecule in as few iterations of biochemical testing as possible. In each iteration a comparatively small batch of compounds is screened for binding activity toward this target. We employed the so-called "active learning paradigm" from Machine Learning for selecting the successive batches. Our main selection strategy is based on the maximum margin hyperplane-generated by "Support Vector Machines". This hyperplane separates the current set of active from the inactive compounds and has the largest possible distance from any labeled compound. We perform a thorough comparative study of various other selection strategies on data sets provided by DuPont Pharmaceuticals and show that the strategies based on the maximum margin hyperplane clearly outperform the simpler ones.
引用
收藏
页码:667 / 673
页数:7
相关论文
共 50 条
  • [31] Active Learning Support Vector Machines to Classify Imbalanced Reservoir Simulation Data
    Yu, Tina
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [32] Active-Learning Approaches for Landslide Mapping Using Support Vector Machines
    Wang, Zhihao
    Brenning, Alexander
    REMOTE SENSING, 2021, 13 (13)
  • [33] An Uncertainty sampling-based Active Learning Approach For Support Vector Machines
    Xu, Hailong
    Wang, Xiaodan
    Liao, Yong
    Zheng, Chunying
    2009 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, VOL III, PROCEEDINGS, 2009, : 208 - 213
  • [34] An Improved Active Learning Sparse Least Squares Support Vector Machines for Regression
    Si Gangquan
    Shi Jianquan
    Guo Zhang
    Gao Hong
    2015 27TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2015, : 4558 - 4562
  • [35] Drug design by machine learning: support vector machines for pharmaceutical data analysis
    Burbidge, R
    Trotter, M
    Buxton, B
    Holden, S
    COMPUTERS & CHEMISTRY, 2001, 26 (01): : 5 - 14
  • [36] Learning students' learning patterns with support vector machines
    Liu, Chao-Lin
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 601 - 611
  • [37] Support Vector Machines Applied to a Combustion Process
    Torres, Claudia I.
    Hernandez, Fernando
    Trejo, Antonio
    Ronquillo, Guillermo
    2012 IEEE NINTH ELECTRONICS, ROBOTICS AND AUTOMOTIVE MECHANICS CONFERENCE (CERMA 2012), 2012, : 176 - 181
  • [38] A comparative analysis of support vector machines and extreme learning machines
    Liu, Xueyi
    Gao, Chuanhou
    Li, Ping
    NEURAL NETWORKS, 2012, 33 : 58 - 66
  • [39] Relevance regression learning with support vector machines
    Apolloni, Bruno
    Malchiodi, Dario
    Valerio, Lorenzo
    NONLINEAR ANALYSIS-THEORY METHODS & APPLICATIONS, 2010, 73 (09) : 2855 - 2867
  • [40] Visualization of support vector machines with unsupervised learning
    Hamel, Lutz
    PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 148 - 155