Accurate clinical toxicity prediction using multi-task deep neural nets and contrastive molecular explanations

被引:21
|
作者
Sharma, Bhanushee [1 ]
Chenthamarakshan, Vijil [2 ]
Dhurandhar, Amit [2 ]
Pereira, Shiranee [3 ]
Hendler, James A. [4 ]
Dordick, Jonathan S. [1 ]
Das, Payel [2 ]
机构
[1] RPI, Chem & Biol Engn, Troy, NY 12180 USA
[2] IBM Res, Yorktown Hts, NY 10598 USA
[3] ICARE, Int Ctr Alternat Res & Educ, Chennai, India
[4] RPI, Comp Sci, Troy, NY USA
关键词
DRUG DISCOVERY; TOXICOLOGY; CARCINOGENICITY; 21ST-CENTURY; CONCORDANCE; REGRESSION; CHEMICALS; MACHINE; WELL;
D O I
10.1038/s41598-023-31169-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Explainable machine learning for molecular toxicity prediction is a promising approach for efficient drug development and chemical safety. A predictive ML model of toxicity can reduce experimental cost and time while mitigating ethical concerns by significantly reducing animal and clinical testing. Herein, we use a deep learning framework for simultaneously modeling in vitro, in vivo, and clinical toxicity data. Two different molecular input representations are used; Morgan fingerprints and pre-trained SMILES embeddings. A multi-task deep learning model accurately predicts toxicity for all endpoints, including clinical, as indicated by the area under the Receiver Operator Characteristic curve and balanced accuracy. In particular, pre-trained molecular SMILES embeddings as input to the multi-task model improved clinical toxicity predictions compared to existing models in MoleculeNet benchmark. Additionally, our multitask approach is comprehensive in the sense that it is comparable to state-of-the-art approaches for specific endpoints in in vitro, in vivo and clinical platforms. Through both the multi-task model and transfer learning, we were able to indicate the minimal need of in vivo data for clinical toxicity predictions. To provide confidence and explain the model's predictions, we adapt a post-hoc contrastive explanation method that returns pertinent positive and negative features, which correspond well to known mutagenic and reactive toxicophores, such as unsubstituted bonded heteroatoms, aromatic amines, and Michael receptors. Furthermore, toxicophore recovery by pertinent feature analysis captures more of the in vitro (53%) and in vivo (56%), rather than of the clinical (8%), endpoints, and indeed uncovers a preference in known toxicophore data towards in vitro and in vivo experimental data. To our knowledge, this is the first contrastive explanation, using both present and absent substructures, for predictions of clinical and in vivo molecular toxicity.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Leveraging auxiliary measures: a deep multi-task neural network for predictive modeling in clinical research
    Xiangrui Li
    Dongxiao Zhu
    Phillip Levy
    BMC Medical Informatics and Decision Making, 18
  • [42] Leveraging auxiliary measures: a deep multi-task neural network for predictive modeling in clinical research
    Li, Xiangrui
    Zhu, Dongxiao
    Levy, Phillip
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [43] Multi-Adaptive Optimization for multi-task learning with deep neural networks
    Hervella, alvaro S.
    Rouco, Jose
    Novo, Jorge
    Ortega, Marcos
    NEURAL NETWORKS, 2024, 170 : 254 - 265
  • [44] Multi-Task Deep Neural Networks for Multi-Document Reading Comprehension
    Liu, Chang
    Liu, Zhuang
    Lin, Wayne
    Zhao, Jun
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [45] Deep Convolutional Neural Networks for Multi-Instance Multi-Task Learning
    Zeng, Tao
    Ji, Shuiwang
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 579 - 588
  • [46] Multi-task deep neural networks for just-in-time software defect prediction on mobile apps
    Huang, Qiguo
    Li, Zhengliang
    Gu, Qing
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (10):
  • [47] DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks
    Rifaioglu, Ahmet Sureyya
    Dogan, Tunca
    Martin, Maria Jesus
    Cetin-Atalay, Rengul
    Atalay, Volkan
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [48] Multi-Task Learning Using Task Dependencies for Face Attributes Prediction
    Fan, Di
    Kim, Hyunwoo
    Kim, Junmo
    Liu, Yunhui
    Huang, Qiang
    APPLIED SCIENCES-BASEL, 2019, 9 (12):
  • [49] DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks
    Ahmet Sureyya Rifaioglu
    Tunca Doğan
    Maria Jesus Martin
    Rengul Cetin-Atalay
    Volkan Atalay
    Scientific Reports, 9
  • [50] Combined molecular subtyping, grading, and segmentation of glioma using multi-task deep learning
    van der Voort, Sebastian R.
    Incekara, Fatih
    Wijnenga, Maarten M. J.
    Kapsas, Georgios
    Gahrmann, Renske
    Schouten, Joost W.
    Tewarie, Rishi Nandoe
    Lycklama, Geert J.
    Hamer, Philip C. De Witt
    Eijgelaar, Roelant S.
    French, Pim J.
    Dubbink, Hendrikus J.
    Vincent, Arnaud J. P. E.
    Niessen, Wiro J.
    van den Bent, Martin J.
    Smits, Marion
    Klein, Stefan
    NEURO-ONCOLOGY, 2023, 25 (02) : 279 - 289