Drug activity prediction using multiple-instance learning via joint instance and feature selection

被引:15
|
作者
Zhao, Zhendong [1 ]
Fu, Gang [2 ]
Liu, Sheng [1 ]
Elokely, Khaled M. [2 ]
Doerksen, Robert J. [2 ,3 ]
Chen, Yixin [1 ]
Wilkins, Dawn E. [1 ]
机构
[1] Univ Mississippi, Sch Engn, Dept Comp & Informat Sci, University, MS 38677 USA
[2] Univ Mississippi, Sch Pharm, Dept Med Chem, University, MS 38677 USA
[3] Univ Mississippi, Sch Pharm, Res Inst Pharmaceut Sci, University, MS 38677 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
美国国家科学基金会;
关键词
PHASE;
D O I
10.1186/1471-2105-14-S14-S16
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In drug discovery and development, it is crucial to determine which conformers (instances) of a given molecule are responsible for its observed biological activity and at the same time to recognize the most representative subset of features (molecular descriptors). Due to experimental difficulty in obtaining the bioactive conformers, computational approaches such as machine learning techniques are much needed. Multiple Instance Learning (MIL) is a machine learning method capable of tackling this type of problem. In the MIL framework, each instance is represented as a feature vector, which usually resides in a high-dimensional feature space. The high dimensionality may provide significant information for learning tasks, but at the same time it may also include a large number of irrelevant or redundant features that might negatively affect learning performance. Reducing the dimensionality of data will hence facilitate the classification task and improve the interpretability of the model. Results: In this work we propose a novel approach, named multiple instance learning via joint instance and feature selection. The iterative joint instance and feature selection is achieved using an instance-based feature mapping and 1-norm regularized optimization. The proposed approach was tested on four biological activity datasets. Conclusions: The empirical results demonstrate that the selected instances (prototype conformers) and features (pharmacophore fingerprints) have competitive discriminative power and the convergence of the selection process is also fast.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Pairwise-similarity-based instance reduction for efficient instance selection in multiple-instance learning
    Liming Yuan
    Jiafeng Liu
    Xianglong Tang
    Daming Shi
    Lu Zhao
    [J]. International Journal of Machine Learning and Cybernetics, 2015, 6 : 83 - 93
  • [22] Pairwise-similarity-based instance reduction for efficient instance selection in multiple-instance learning
    Yuan, Liming
    Liu, Jiafeng
    Tang, Xianglong
    Shi, Daming
    Zhao, Lu
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2015, 6 (01) : 83 - 93
  • [23] MILD: Multiple-Instance Learning via Disambiguation
    Li, Wu-Jun
    Yeung, Dit-Yan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (01) : 76 - 89
  • [24] Multiple-instance learning via random walk
    Wang, Dong
    Li, Jianmin
    Zhang, Bo
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 473 - 484
  • [25] Multiple-Instance Lasso Regularization via Embedded Instance Selection for Emotion Recognition
    Caicedo-Acosta, J.
    Cardenas-Pena, D.
    Collazos-Huertas, D.
    Padilla-Buritica, J., I
    Castano-Duque, G.
    Castellanos-Dominguez, G.
    [J]. UNDERSTANDING THE BRAIN FUNCTION AND EMOTIONS, PT I, 2019, 11486 : 244 - 251
  • [26] Joint Gaussian Based Measures for Multiple-Instance Learning
    Zhou, Linfei
    Plant, Claudia
    Boehm, Christian
    [J]. 2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 203 - 206
  • [27] Compact Multiple-Instance Learning
    Chai, Jing
    Liu, Weiwei
    Tsang, Ivor W.
    Shen, Xiaobo
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 2007 - 2010
  • [28] ON GENERALIZED MULTIPLE-INSTANCE LEARNING
    Scott, Stephen
    Zhang, Jun
    Brown, Joshua
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2005, 5 (01) : 21 - 35
  • [29] On multiple-instance learning of halfspaces
    Diochnos, D. I.
    Sloan, R. H.
    Turan, Gy
    [J]. INFORMATION PROCESSING LETTERS, 2012, 112 (23) : 933 - 936
  • [30] A framework for multiple-instance learning
    Maron, O
    Lozano-Perez, T
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 570 - 576