Implementation of multiple-instance learning in drug activity prediction

被引:22
|
作者
Fu, Gang [2 ]
Nan, Xiaofei [1 ]
Liu, Haining [2 ]
Patel, Ronak Y. [2 ,4 ]
Daga, Pankaj R. [2 ]
Chen, Yixin [1 ]
Wilkins, Dawn E. [1 ]
Doerksen, Robert J. [2 ,3 ]
机构
[1] Univ Mississippi, Sch Engn, Dept Informat & Comp Sci, University, MS 38677 USA
[2] Univ Mississippi, Sch Pharm, Dept Med Chem, University, MS 38677 USA
[3] Univ Mississippi, Sch Pharm, Pharmaceut Sci Res Inst, University, MS 38677 USA
[4] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
来源
BMC BIOINFORMATICS | 2012年 / 13卷
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
PHARMACOPHORE; SIMILARITY; QSAR;
D O I
10.1186/1471-2105-13-S15-S3
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In the context of drug discovery and development, much effort has been exerted to determine which conformers of a given molecule are responsible for the observed biological activity. In this work we aimed to predict bioactive conformers using a variant of supervised learning, named multiple-instance learning. A single molecule, treated as a bag of conformers, is biologically active if and only if at least one of its conformers, treated as an instance, is responsible for the observed bioactivity; and a molecule is inactive if none of its conformers is responsible for the observed bioactivity. The implementation requires instance-based embedding, and joint feature selection and classification. The goal of the present project is to implement multiple-instance learning in drug activity prediction, and subsequently to identify the bioactive conformers for each molecule. Methods: We encoded the 3-dimensional structures using pharmacophore fingerprints which are binary strings, and accomplished instance-based embedding using calculated dissimilarity distances. Four dissimilarity measures were employed and their performances were compared. 1-norm SVM was used for joint feature selection and classification. The approach was applied to four data sets, and the best proposed model for each data set was determined by using the dissimilarity measure yielding the smallest number of selected features. Results: The predictive abilities of the proposed approach were compared with three classical predictive models without instance-based embedding. The proposed approach produced the best predictive models for one data set and second best predictive models for the rest of the data sets, based on the external validations. To validate the ability of the proposed approach to find bioactive conformers, 12 small molecules with co-crystallized structures were seeded in one data set. 10 out of 12 co-crystallized structures were indeed identified as significant conformers using the proposed approach. Conclusions: The proposed approach was proven not to suffer from overfitting and to be highly competitive with classical predictive models, so it is very powerful for drug activity prediction. The approach was also validated as a useful method for pursuit of bioactive conformers.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Implementation of multiple-instance learning in drug activity prediction
    Gang Fu
    Xiaofei Nan
    Haining Liu
    Ronak Y Patel
    Pankaj R Daga
    Yixin Chen
    Dawn E Wilkins
    Robert J Doerksen
    [J]. BMC Bioinformatics, 13
  • [2] Drug activity prediction using multiple-instance learning via joint instance and feature selection
    Zhao, Zhendong
    Fu, Gang
    Liu, Sheng
    Elokely, Khaled M.
    Doerksen, Robert J.
    Chen, Yixin
    Wilkins, Dawn E.
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [3] Drug activity prediction using multiple-instance learning via joint instance and feature selection
    Zhendong Zhao
    Gang Fu
    Sheng Liu
    Khaled M Elokely
    Robert J Doerksen
    Yixin Chen
    Dawn E Wilkins
    [J]. BMC Bioinformatics, 14
  • [4] Compact Multiple-Instance Learning
    Chai, Jing
    Liu, Weiwei
    Tsang, Ivor W.
    Shen, Xiaobo
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2007 - 2010
  • [5] ON GENERALIZED MULTIPLE-INSTANCE LEARNING
    Scott, Stephen
    Zhang, Jun
    Brown, Joshua
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2005, 5 (01) : 21 - 35
  • [6] A framework for multiple-instance learning
    Maron, O
    Lozano-Perez, T
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 570 - 576
  • [7] On multiple-instance learning of halfspaces
    Diochnos, D. I.
    Sloan, R. H.
    Turan, Gy
    [J]. INFORMATION PROCESSING LETTERS, 2012, 112 (23) : 933 - 936
  • [8] UNSUPERVISED MULTIPLE-INSTANCE LEARNING FOR INSTANCE SEARCH
    Wang, Zhenzhen
    Yuan, Junsong
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [9] MULTIPLE-INSTANCE LEARNING WITH PAIRWISE INSTANCE SIMILARITY
    Yuan, Liming
    Liu, Jiafeng
    Tang, Xianglong
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2014, 24 (03) : 567 - 577
  • [10] Salient Instance Selection for Multiple-Instance Learning
    Yuan, Liming
    Liu, Songbo
    Huang, Qingcheng
    Liu, Jiafeng
    Tang, Xianglong
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 58 - 67