Active learning with support vector machines in the drug discovery process

被引:254
|
作者
Warmuth, MK [1 ]
Liao, J
Rätsch, G
Mathieson, M
Putta, S
Lemmen, C
机构
[1] Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA
[2] Australian Natl Univ, RSISE, Canberra, ACT 0200, Australia
[3] Rational Discovery LLC, Palo Alto, CA 94301 USA
[4] BioSolveIT GMBH, D-53757 St Augustin, Germany
关键词
D O I
10.1021/ci025620t
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
We investigate the following data mining problem from computer-aided drug design: From a large collection of compounds, find those that bind to a target molecule in as few iterations of biochemical testing as possible. In each iteration a comparatively small batch of compounds is screened for binding activity toward this target. We employed the so-called "active learning paradigm" from Machine Learning for selecting the successive batches. Our main selection strategy is based on the maximum margin hyperplane-generated by "Support Vector Machines". This hyperplane separates the current set of active from the inactive compounds and has the largest possible distance from any labeled compound. We perform a thorough comparative study of various other selection strategies on data sets provided by DuPont Pharmaceuticals and show that the strategies based on the maximum margin hyperplane clearly outperform the simpler ones.
引用
收藏
页码:667 / 673
页数:7
相关论文
共 50 条
  • [21] Learning curves of support vector machines
    Ikeda, K
    PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND BRAIN, VOLS 1-3, 2005, : 1708 - 1713
  • [22] Incremental learning with Support Vector Machines
    Rüping, S
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 641 - 642
  • [23] Learning with rigorous support vector machines
    Bi, J
    Vapnik, VN
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 243 - 257
  • [24] Combining active learning and transductive support vector machines for sea ice detection
    Han, Yanling
    Li, Peng
    Zhang, Yun
    Hong, Zhonghua
    Liu, Kaichen
    Wang, Jing
    JOURNAL OF APPLIED REMOTE SENSING, 2018, 12 (02)
  • [25] Sampling Active Learning Based on Non-parallel Support Vector Machines
    Xijiong Xie
    Neural Processing Letters, 2021, 53 : 2081 - 2094
  • [26] Circuit Performance Classification With Active Learning Guided Sampling for Support Vector Machines
    Lin, Honghuang
    Li, Peng
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2015, 34 (09) : 1467 - 1480
  • [27] Support vector machines based active learning for the relevance feedback document retrieval
    Onoda, Takashi
    Murata, Hiroshi
    Yamada, Seiji
    2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS PROCEEDINGS, 2006, : 389 - +
  • [29] An empirical study of active learning with support vector machines for Japanese word segmentation
    Sassano, M
    40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 505 - 512
  • [30] Sampling Active Learning Based on Non-parallel Support Vector Machines
    Xie, Xijiong
    NEURAL PROCESSING LETTERS, 2021, 53 (03) : 2081 - 2094