Drug activity prediction using multiple-instance learning via joint instance and feature selection

被引:15
|
作者
Zhao, Zhendong [1 ]
Fu, Gang [2 ]
Liu, Sheng [1 ]
Elokely, Khaled M. [2 ]
Doerksen, Robert J. [2 ,3 ]
Chen, Yixin [1 ]
Wilkins, Dawn E. [1 ]
机构
[1] Univ Mississippi, Sch Engn, Dept Comp & Informat Sci, University, MS 38677 USA
[2] Univ Mississippi, Sch Pharm, Dept Med Chem, University, MS 38677 USA
[3] Univ Mississippi, Sch Pharm, Res Inst Pharmaceut Sci, University, MS 38677 USA
来源
BMC BIOINFORMATICS | 2013年 / 14卷
基金
美国国家科学基金会;
关键词
PHASE;
D O I
10.1186/1471-2105-14-S14-S16
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: In drug discovery and development, it is crucial to determine which conformers (instances) of a given molecule are responsible for its observed biological activity and at the same time to recognize the most representative subset of features (molecular descriptors). Due to experimental difficulty in obtaining the bioactive conformers, computational approaches such as machine learning techniques are much needed. Multiple Instance Learning (MIL) is a machine learning method capable of tackling this type of problem. In the MIL framework, each instance is represented as a feature vector, which usually resides in a high-dimensional feature space. The high dimensionality may provide significant information for learning tasks, but at the same time it may also include a large number of irrelevant or redundant features that might negatively affect learning performance. Reducing the dimensionality of data will hence facilitate the classification task and improve the interpretability of the model. Results: In this work we propose a novel approach, named multiple instance learning via joint instance and feature selection. The iterative joint instance and feature selection is achieved using an instance-based feature mapping and 1-norm regularized optimization. The proposed approach was tested on four biological activity datasets. Conclusions: The empirical results demonstrate that the selected instances (prototype conformers) and features (pharmacophore fingerprints) have competitive discriminative power and the convergence of the selection process is also fast.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Detecting Multiple Myeloma via Generalized Multiple-Instance Learning
    Hering, Jan
    Kybic, Jan
    Lambert, Lukas
    [J]. MEDICAL IMAGING 2018: IMAGE PROCESSING, 2018, 10574
  • [32] Simultaneous instance pooling and bag representation selection approach for multiple-instance learning (MIL) using vision transformer
    Muhammad Waqas
    Muhammad Atif Tahir
    Muhammad Danish Author
    Sumaya Al-Maadeed
    Ahmed Bouridane
    Jia Wu
    [J]. Neural Computing and Applications, 2024, 36 : 6659 - 6680
  • [33] Simultaneous instance pooling and bag representation selection approach for multiple-instance learning (MIL) using vision transformer
    Waqas, Muhammad
    Tahir, Muhammad Atif
    Author, Muhammad Danish
    Al-Maadeed, Sumaya
    Bouridane, Ahmed
    Wu, Jia
    [J]. NEURAL COMPUTING & APPLICATIONS, 2024, 36 (12): : 6659 - 6680
  • [34] Local feature selection for multiple instance learning
    Aliasghar Shahrjooihaghighi
    Hichem Frigui
    [J]. Journal of Intelligent Information Systems, 2022, 59 : 45 - 69
  • [35] A Multiple-Instance Learning Approach to Sentence Selection for Question Ranking
    Romeo, Salvatore
    Martino, Giovanni Da San
    Barron-Cedeno, Alberto
    Moschitti, Alessandro
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 437 - 449
  • [36] Kernels for Generalized Multiple-Instance Learning
    Tao, Qingping
    Scott, Stephen D.
    Vinodchandran, N. V.
    Osugi, Thomas Takeo
    Mueller, Brandon
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (12) : 2084 - 2097
  • [37] Multiple-Instance Learning from Distributions
    Doran, Gary
    Ray, Soumya
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [38] Local feature selection for multiple instance learning
    Shahrjooihaghighi, Aliasghar
    Frigui, Hichem
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 59 (01) : 45 - 69
  • [39] An improved multiple-instance learning algorithm
    Han, Fengqing
    Wang, Dacheng
    Liao, Xiaofeng
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 1104 - +
  • [40] Saliency Detection by Multiple-Instance Learning
    Wang, Qi
    Yuan, Yuan
    Yan, Pingkun
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (02) : 660 - 672