MultiCon: A Semi-Supervised Approach for Predicting Drug Function from Chemical Structure Analysis

被引:17
|
作者
Sahoo, Pracheta [1 ]
Roy, Indranil [2 ]
Wang, Zhuoyi [1 ]
Mi, Feng [1 ]
Yu, Lin [1 ]
Balasubramani, Pradeep [1 ]
Khan, Latifur [1 ]
Stoddart, J. Fraser [2 ,3 ,4 ]
机构
[1] Univ Texas Dallas, Dept Comp Sci, Richardson, TX 75080 USA
[2] Northwestern Univ, Dept Chem, Evanston, IL 60208 USA
[3] Tianjin Univ, Inst Mol Design & Synth, Tianjin 300072, Peoples R China
[4] Univ New South Wales, Sch Chem, Sydney, NSW 2052, Australia
关键词
DISCOVERY;
D O I
10.1021/acs.jcim.0c00801
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Semi-supervised learning has proved its efficacy in utilizing extensive unlabeled data to alleviate the use of a large amount of supervised data and improve model performance. Despite its tremendous potential, semi-supervised learning has yet to be implemented in the field of drug discovery. Empirical testing of drugs and their classification is costly and time-consuming. In contrast, predicting therapeutic applications of drugs from their structural formulas using semi-supervised learning would reduce costs and time significantly. Herein, we employ a new multicontrastive-based semi-supervised learning algorithm-MultiCon-for classifying drugs into 12 categories, according to therapeutic applications, on the basis of image analyses of their structural formulas. By rational use of data balancing, online augmentations of the drug image data during training, and the combined use of multicontrastive loss with consistency regularization, MultiCon achieves better class prediction accuracies when compared with the state-of-the-art machine learning methods across a variety of existing semi-supervised learning benchmarks. In particular, it performs exceptionally well with a limited number of labeled examples. For instance, with just 5000 labeled drugs in a PubChem (D-3) data set, MultiCon achieved a class prediction accuracy of 97.74%.
引用
收藏
页码:5995 / 6006
页数:12
相关论文
共 50 条
  • [1] Predicting Drug-Target Interactions Based on an Improved Semi-Supervised Learning Approach
    Yu, Weiming
    Cheng, Xuan
    Li, Zhibin
    Jiang, Zhenran
    DRUG DEVELOPMENT RESEARCH, 2011, 72 (02) : 219 - 224
  • [2] A semi-supervised approach to fault diagnosis for chemical processes
    Monroy, Isaac
    Benitez, Raul
    Escudero, Gerard
    Graells, Moises
    COMPUTERS & CHEMICAL ENGINEERING, 2010, 34 (05) : 631 - 642
  • [3] NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning
    Chen, Xing
    Ren, Biao
    Chen, Ming
    Wang, Quanxin
    Zhang, Lixin
    Yan, Guiying
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (07)
  • [4] Predicting Drug-Drug Interactions Based on Integrated Similarity and Semi-Supervised Learning
    Yan, Cheng
    Duan, Guihua
    Zhang, Yayan
    Wu, Fang-Xiang
    Pan, Yi
    Wang, Jianxin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 168 - 179
  • [5] Predicting Adverse Drug-Drug Interactions via Semi-supervised Variational Autoencoders
    Hou, Meihao
    Yang, Fan
    Cui, Lizhen
    Guo, Wei
    WEB AND BIG DATA, PT II, APWEB-WAIM 2020, 2020, 12318 : 132 - 140
  • [6] A semi-supervised learning approach for RNA secondary structure prediction
    Yonemoto, Haruka
    Asai, Kiyoshi
    Hamada, Michiaki
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 57 : 72 - 79
  • [7] Predicting microbe-drug association based on similarity and semi-supervised learning
    Zhu L.
    Wang J.
    Li G.
    Hu X.
    Ge B.
    Zhang B.
    American Journal of Biochemistry and Biotechnology, 2021, 17 (01): : 50 - 58
  • [8] Semi-supervised discriminant analysis
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, : 222 - 228
  • [9] Predicting Protein Function by Multi-Label Correlated Semi-Supervised Learning
    Jiang, Jonathan Q.
    McQuay, Lisa J.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (04) : 1059 - 1069
  • [10] Semi-supervised Component Analysis
    Watanabe, Kenji
    Wada, Toshikazu
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 3011 - 3016