Drug activity prediction using multiple-instance learning via joint instance and feature selection

被引:15
|
作者
Zhao, Zhendong [1 ]
Fu, Gang [2 ]
Liu, Sheng [1 ]
Elokely, Khaled M. [2 ]
Doerksen, Robert J. [2 ,3 ]
Chen, Yixin [1 ]
Wilkins, Dawn E. [1 ]
机构
[1] Univ Mississippi, Sch Engn, Dept Comp & Informat Sci, University, MS 38677 USA
[2] Univ Mississippi, Sch Pharm, Dept Med Chem, University, MS 38677 USA
[3] Univ Mississippi, Sch Pharm, Res Inst Pharmaceut Sci, University, MS 38677 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
美国国家科学基金会;
关键词
PHASE;
D O I
10.1186/1471-2105-14-S14-S16
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In drug discovery and development, it is crucial to determine which conformers (instances) of a given molecule are responsible for its observed biological activity and at the same time to recognize the most representative subset of features (molecular descriptors). Due to experimental difficulty in obtaining the bioactive conformers, computational approaches such as machine learning techniques are much needed. Multiple Instance Learning (MIL) is a machine learning method capable of tackling this type of problem. In the MIL framework, each instance is represented as a feature vector, which usually resides in a high-dimensional feature space. The high dimensionality may provide significant information for learning tasks, but at the same time it may also include a large number of irrelevant or redundant features that might negatively affect learning performance. Reducing the dimensionality of data will hence facilitate the classification task and improve the interpretability of the model. Results: In this work we propose a novel approach, named multiple instance learning via joint instance and feature selection. The iterative joint instance and feature selection is achieved using an instance-based feature mapping and 1-norm regularized optimization. The proposed approach was tested on four biological activity datasets. Conclusions: The empirical results demonstrate that the selected instances (prototype conformers) and features (pharmacophore fingerprints) have competitive discriminative power and the convergence of the selection process is also fast.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Drug activity prediction using multiple-instance learning via joint instance and feature selection
    Zhendong Zhao
    Gang Fu
    Sheng Liu
    Khaled M Elokely
    Robert J Doerksen
    Yixin Chen
    Dawn E Wilkins
    [J]. BMC Bioinformatics, 14
  • [2] Implementation of multiple-instance learning in drug activity prediction
    Fu, Gang
    Nan, Xiaofei
    Liu, Haining
    Patel, Ronak Y.
    Daga, Pankaj R.
    Chen, Yixin
    Wilkins, Dawn E.
    Doerksen, Robert J.
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [3] Implementation of multiple-instance learning in drug activity prediction
    Gang Fu
    Xiaofei Nan
    Haining Liu
    Ronak Y Patel
    Pankaj R Daga
    Yixin Chen
    Dawn E Wilkins
    Robert J Doerksen
    [J]. BMC Bioinformatics, 13
  • [4] Multiple-Instance Learning with Instance Selection via Dominant Sets
    Erdem, Aykut
    Erdem, Erkut
    [J]. SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 177 - 191
  • [5] Multiple-Instance Learning with Instance Selection via Dominant Sets
    Erdem, Aykut
    Erdem, Erkut
    [J]. SIMILARITY-BASED PATTERN RECOGNITION, 2011, 7005 : 177 - 191
  • [6] MILES: Multiple-Instance Learning via Embedded instance Selection
    Chen, Yixin
    Bi, Jinbo
    Wang, James Z.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (12) : 1931 - 1947
  • [7] Revisiting Multiple-Instance Learning Via Embedded Instance Selection
    Foulds, James
    Frank, Eibe
    [J]. AI 2008: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2008, 5360 : 300 - 310
  • [8] Salient Instance Selection for Multiple-Instance Learning
    Yuan, Liming
    Liu, Songbo
    Huang, Qingcheng
    Liu, Jiafeng
    Tang, Xianglong
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2012, PT III, 2012, 7665 : 58 - 67
  • [9] Multiple-Instance Learning with Instance Selection via Constructive Covering Algorithm
    Zhang, Yanping
    Zhang, Heng
    Wei, Huazhen
    Tang, Jie
    Zhao, Shu
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2014, 19 (03) : 285 - 292
  • [10] Multiple-Instance Learning with Instance Selection via Constructive Covering Algorithm
    Yanping Zhang
    Heng Zhang
    Huazhen Wei
    Jie Tang
    Shu Zhao
    [J]. Tsinghua Science and Technology, 2014, (03) : 285 - 292