Drug-Target Interactions Prediction at Scale: The Komet Algorithm with the LCIdb Dataset

被引:0
|
作者
Guichaoua, Gwenn [1 ,2 ,3 ]
Pinel, Philippe [1 ,2 ,3 ,4 ]
Hoffmann, Brice [4 ]
Azencott, Chloe-Agathe [1 ,2 ,3 ]
Stoven, Veronique [1 ,2 ,3 ]
机构
[1] Mines Paris PSL, Ctr Computat Biol CBIO, F-75006 Paris, France
[2] Univ PSL, Inst Curie, F-75005 Paris, France
[3] INSERM, U900, F-75005 Paris, France
[4] Iktos SAS, F-75017 Paris, France
关键词
CHEMICAL-STRUCTURE; PROTEIN; DATABASE; LANGUAGE;
D O I
10.1021/acs.jcim.4c00422
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Drug-target interactions (DTIs) prediction algorithms are used at various stages of the drug discovery process. In this context, specific problems such as deorphanization of a new therapeutic target or target identification of a drug candidate arising from phenotypic screens require large-scale predictions across the protein and molecule spaces. DTI prediction heavily relies on supervised learning algorithms that use known DTIs to learn associations between molecule and protein features, allowing for the prediction of new interactions based on learned patterns. The algorithms must be broadly applicable to enable reliable predictions, even in regions of the protein or molecule spaces where data may be scarce. In this paper, we address two key challenges to fulfill these goals: building large, high-quality training datasets and designing prediction methods that can scale, in order to be trained on such large datasets. First, we introduce LCIdb, a curated, large-sized dataset of DTIs, offering extensive coverage of both the molecule and druggable protein spaces. Notably, LCIdb contains a much higher number of molecules than publicly available benchmarks, expanding coverage of the molecule space. Second, we propose Komet (Kronecker Optimized METhod), a DTI prediction pipeline designed for scalability without compromising performance. Komet leverages a three-step framework, incorporating efficient computation choices tailored for large datasets and involving the Nystrom approximation. Specifically, Komet employs a Kronecker interaction module for (molecule, protein) pairs, which efficiently captures determinants in DTIs, and whose structure allows for reduced computational complexity and quasi-Newton optimization, ensuring that the model can handle large training sets, without compromising on performance. Our method is implemented in open-source software, leveraging GPU parallel computation for efficiency. We demonstrate the interest of our pipeline on various datasets, showing that Komet displays superior scalability and prediction performance compared to state-of-the-art deep learning approaches. Additionally, we illustrate the generalization properties of Komet by showing its performance on an external dataset, and on the publicly available L H benchmark designed for scaffold hopping problems.
引用
收藏
页码:6938 / 6956
页数:19
相关论文
共 50 条
  • [21] Unsupervised Prediction Method for Drug-Target Interactions Based on Structural Similarity
    Zhang, Xinyuan
    Lin, Xiaoli
    Hu, Jing
    Ding, Wenquan
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 517 - 532
  • [22] Prediction of drug-target interactions using popular Collaborative Filtering methods
    Koohi, Arezou
    2013 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS (GENSIPS 2013), 2013, : 58 - 61
  • [23] Prediction of drug-target interactions through multi-task learning
    Moon, Chaeyoung
    Kim, Dongsup
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [24] DTIP: A Comparative Analytical Framework for Chemogenomic Drug-target Interactions Prediction
    Haddadi, Faraneh
    Keyvanpour, Mohammad Reza
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2021, 17 (01) : 2 - 21
  • [25] Drug-target interactions prediction via deep collaborative filtering with multiembeddings
    Chen, Ruolan
    Xia, Feng
    Hu, Bing
    Jin, Shuting
    Liu, Xiangrong
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (02)
  • [26] Link prediction in drug-target interactions network using similarity indices
    Yiding Lu
    Yufan Guo
    Anna Korhonen
    BMC Bioinformatics, 18
  • [27] Significant Path Selection Improves the Prediction of Novel Drug-target Interactions
    Duc-Hau Le
    Dai-Phong Nguyen
    Anh-Minh Dao
    PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 30 - 35
  • [28] Transformer and Graph Transformer-Based Prediction of Drug-Target Interactions
    Qian, Meiling
    Lu, Weizhong
    Zhang, Yu
    Liu, Junkai
    Wu, Hongjie
    Lu, Yaoyao
    Li, Haiou
    Fu, Qiming
    Shen, Jiyun
    Xiao, Yongbiao
    CURRENT BIOINFORMATICS, 2024, 19 (05) : 470 - 481
  • [29] Trader as a new optimization algorithm predicts drug-target interactions efficiently
    Masoudi-Sobhanzadeh, Yosef
    Omidi, Yadollah
    Amanlou, Massoud
    Masoudi-Nejad, Ali
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [30] Novel drug-target interactions via link prediction and network embedding
    Souri, E. Amiri
    Laddach, R.
    Karagiannis, S. N.
    Papageorgiou, L. G.
    Tsoka, S.
    BMC BIOINFORMATICS, 2022, 23 (01)