The discovery of indicator variables for QSAR using inductive logic programming

被引:19
|
作者
King, RD [1 ]
Srinivasan, A
机构
[1] Univ Wales, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales
[2] Univ Oxford, Comp Lab, Oxford OX1 3QD, England
基金
英国工程与自然科学研究理事会;
关键词
artificial intelligence; machine learning; regression;
D O I
10.1023/A:1007967728701
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A central problem in forming accurate regression equations in QSAR studies is the selection of appropriate descriptors for the compounds under study. We describe a novel procedure for using inductive logic programming (ILP) to discover new indicator variables (attributes) for QSAR problems, and show that these improve the accuracy of the derived regression equations. ILP techniques have previously been shown to work well on drug design problems where there is a large structural component or where clear comprehensible rules are required. However, ILP techniques have had the disadvantage of only being able to make qualitative predictions e.g. active, inactive) and not to predict real numbers (regression). We unify ILP and linear regression techniques to give a QSAR method that has the strength of ILP at describing steric structure, with the familiarity and power of linear regression. We evaluated the utility of this new QSAR technique by examining the prediction of biological activity with and without the addition of new structural indicator variables formed by ILP. In three out of five datasets examined the addition of ILP variables produced statistically better results (P<0.01) over the original description. The new ILP variables did not increase the overall complexity of the derived QSAR equations and added insight into possible mechanisms of action. We conclude that ILP can aid in the process of drug design.
引用
收藏
页码:571 / 580
页数:10
相关论文
共 50 条
  • [1] The discovery of indicator variables for QSAR using inductive logic programming
    Ross D. King
    Ashwin Srinivasan
    [J]. Journal of Computer-Aided Molecular Design, 1997, 11 : 571 - 580
  • [2] Scientific knowledge discovery using inductive logic programming
    Muggleton, S
    [J]. COMMUNICATIONS OF THE ACM, 1999, 42 (11) : 42 - 46
  • [3] Biochemical knowledge discovery using inductive logic programming
    Muggleton, S
    Srinivasan, A
    King, RD
    Sternberg, MJE
    [J]. DISCOVERY SCIENCE, 1998, 1532 : 326 - 341
  • [4] Pharmacophore discovery using the Inductive Logic Programming system PROGOL
    Finn, P
    Muggleton, S
    Page, D
    Srinivasan, A
    [J]. MACHINE LEARNING, 1998, 30 (2-3) : 241 - 270
  • [5] Scaffold hopping in drug discovery using inductive logic programming
    Tsunoyama, Kazuhisa
    Amini, Ata
    Sternberg, Michael J. E.
    Muggleton, Stephen H.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2008, 48 (05) : 949 - 957
  • [6] Knowledge discovery for pancreatic cancer using inductive logic programming
    Qiu, Yushan
    Shimada, Kazuaki
    Hiraoka, Nobuyoshi
    Maeshiro, Kensei
    Ching, Wai-Ki
    Aoki-Kinoshita, Kiyoko F.
    Furuta, Koh
    [J]. IET SYSTEMS BIOLOGY, 2014, 8 (04) : 162 - 168
  • [7] Knowledge Discovery in Variant Databases Using Inductive Logic programming
    Hoan Nguyen
    Tien-Dao Luu
    Poch, Olivier
    Thompson, Julie D.
    [J]. BIOINFORMATICS AND BIOLOGY INSIGHTS, 2013, 7 : 119 - 131
  • [8] Pharmacophore Discovery Using the Inductive Logic Programming System PROGOL
    Paul Finn
    Stephen Muggleton
    David Page
    Ashwin Srinivasan
    [J]. Machine Learning, 1998, 30 : 241 - 270
  • [9] Inductive logic programming for relational knowledge discovery
    Lavrac, N
    [J]. LOGIC PROGRAMMING - PROCEEDINGS OF THE 1998 JOINT INTERNATIONAL CONFERENCE AND SYMPOSIUM ON LOGIC PROGRAMMING, 1998, : 7 - 22
  • [10] Inductive logic programming for relational knowledge discovery
    Nada Lavrač
    Saŝo Džeroski
    Masayuki Numao
    [J]. New Generation Computing, 1999, 17 : 3 - 23