Active learning with support vector machines in the drug discovery process

被引:254
|
作者
Warmuth, MK [1 ]
Liao, J
Rätsch, G
Mathieson, M
Putta, S
Lemmen, C
机构
[1] Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA
[2] Australian Natl Univ, RSISE, Canberra, ACT 0200, Australia
[3] Rational Discovery LLC, Palo Alto, CA 94301 USA
[4] BioSolveIT GMBH, D-53757 St Augustin, Germany
关键词
D O I
10.1021/ci025620t
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We investigate the following data mining problem from computer-aided drug design: From a large collection of compounds, find those that bind to a target molecule in as few iterations of biochemical testing as possible. In each iteration a comparatively small batch of compounds is screened for binding activity toward this target. We employed the so-called "active learning paradigm" from Machine Learning for selecting the successive batches. Our main selection strategy is based on the maximum margin hyperplane-generated by "Support Vector Machines". This hyperplane separates the current set of active from the inactive compounds and has the largest possible distance from any labeled compound. We perform a thorough comparative study of various other selection strategies on data sets provided by DuPont Pharmaceuticals and show that the strategies based on the maximum margin hyperplane clearly outperform the simpler ones.
引用
收藏
页码:667 / 673
页数:7
相关论文
共 50 条
  • [1] Support vector machines for drug discovery
    Heikamp, Kathrin
    Bajorath, Juergen
    EXPERT OPINION ON DRUG DISCOVERY, 2014, 9 (01) : 93 - 104
  • [2] Active learning with support vector machines
    Kremer, Jan
    Pedersen, Kim Steenstrup
    Igel, Christian
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 4 (04) : 313 - 326
  • [3] Advances with support vector machines for novel drug discovery
    Maltarollo, Vinicius Goncalves
    Kronenberger, Thales
    Espinoza, Gabriel Zarzana
    Oliveira, Patricia Rufino
    Honorio, Kathia Maria
    EXPERT OPINION ON DRUG DISCOVERY, 2019, 14 (01) : 23 - 33
  • [4] On multiclass active learning with support vector machines
    Brinker, K
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 969 - 970
  • [5] Active Learning Based on Support Vector Machines
    Wang, Ran
    Kwong, Sam
    He, Qiang
    IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [6] Active learning in the drug discovery process
    Warmuth, MK
    Rätsch, G
    Mathieson, M
    Liao, J
    Lemmen, C
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1449 - 1456
  • [7] Active Learning of Actions Based on Support Vector Machines
    Ruiz, Francisco
    Sama, Albert
    Agell, Nuria
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2012, 248 : 37 - +
  • [8] Active learning with support vector machines for tornado prediction
    Trafalis, Theodore B.
    Adrianto, Indra
    Richman, Michael B.
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 1, PROCEEDINGS, 2007, 4487 : 1130 - +
  • [9] Support vector machines for knowledge discovery
    Sugaya, S
    Suzuki, E
    Tsumoto, S
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1999, 1704 : 561 - 567
  • [10] Active learning of environmental data using Support Vector machines
    Kanevski, M
    Pozdnoukhov, A
    Maignan, M
    GIS and Spatial Analysis, Vol 1and 2, 2005, : 1198 - 1203