Predicting clinical trial outcomes using drug bioactivities through graph database integration and machine learning

被引:1
|
作者
Murali, Vidhya [1 ]
Muralidhar, Y. Pradyumna [2 ]
Koenigs, Cassandra [3 ]
Nair, Meera [4 ]
Madhu, Sethulekshmi [4 ]
Nedungadi, Prema [5 ]
Srinivasa, Gowri [2 ]
Athri, Prashanth [1 ]
机构
[1] Amrita Sch Engn, Dept Comp Sci & Engn, Bengaluru, India
[2] PES Univ, PES Ctr Pattern Recognit, Dept Comp Sci & Engn, Bengaluru, India
[3] Bielefeld Univ, Bioinformat & Med Informat, Northrhine Westphalia, Germany
[4] Amrita Vishwa Vidyapeetham, Amrita Sch Biotechnol, Amritapuri, Kerala, India
[5] Amrita Sch Engn, Dept Comp Sci & Engn, Vallikavu, Kerala, India
关键词
bioactivity; clinical trial; data integration; ensemble algorithms; graph database; machine learning; FINGERPRINT SIMILARITY SEARCH; CHEMBL; RISPERIDONE; SORAFENIB;
D O I
10.1111/cbdd.14092
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The ability to estimate the probability of a drug to receive approval in clinical trials provides natural advantages to optimizing pharmaceutical research workflows. Success rates of clinical trials have deep implications for costs, duration of development, and under pressure due to stringent regulatory approval processes. We propose a machine learning approach that can predict the outcome of the trial with reliable accuracies, using biological activities, physicochemical properties of the compounds, target-related features, and NLP-based compound representation. In the above list, biological activities have never been used as an independent variable towards the prediction of clinical trial outcomes. We have extracted the drug-disease pair from clinical trials and mapped target(s) to that pair using multiple data sources. Empirical results demonstrate that ensemble learning outperforms independently trained, small-data ML models. We report results and inferences derived from a Random forest classifier with an average accuracy of 93%, and an F1 score of 0.96 for the "Pass" class. "Pass" refers to one of the two classes (Pass/Fail) of all clinical trials, and the model performed well in predicting the "Pass" category. Through the analysis of feature contributions to predictive capability, we have demonstrated that bioactivity plays a statistically significant role in predicting clinical trial outcome. A significant effort has gone into the production of the dataset that, for the first time, integrates clinical trial information with protein targets. Cleaned, organized, integrated data and code to map these entities, created as a part of this work, are available open-source. This reproducibility and the freely available code ensure that researchers with access to deep curated and proprietary clinical trial databases (we only use open-source data in this study) can further expand the scope of the results.
引用
收藏
页码:169 / 184
页数:16
相关论文
共 50 条
  • [1] Predicting Clinical Outcomes in Colorectal Cancer Using Machine Learning
    Gruendner, Julian
    Prokosch, Hans-Ulrich
    Stuerzl, Michael
    Croner, Roland
    Christoph, Jan
    Toddenroth, Dennis
    [J]. BUILDING CONTINENTS OF KNOWLEDGE IN OCEANS OF DATA: THE FUTURE OF CO-CREATED EHEALTH, 2018, 247 : 101 - 105
  • [2] Using machine learning for predicting outcomes in ACLF
    Tonon, Marta
    Moreau, Richard
    [J]. LIVER INTERNATIONAL, 2022, 42 (11) : 2354 - 2355
  • [3] PREDICTING INVESTOR SUCCESS USING GRAPH THEORY AND MACHINE LEARNING
    Glupker, Jeffrey
    Nair, Vinit
    Richman, Benjamin
    Riener, Kyle
    Sharma, Amrita
    [J]. JOURNAL OF INVESTMENT MANAGEMENT, 2019, 17 (01): : 92 - 103
  • [4] Predicting Immunotherapy Outcomes in Glioblastoma Patients through Machine Learning
    Mestrallet, Guillaume
    [J]. CANCERS, 2024, 16 (02)
  • [5] Predicting outcomes of pelvic exenteration using machine learning
    Dudurych, I.
    Kelly, M. E.
    Aalbers, A. G. J.
    Aziz, N. Abdul
    Abecasis, N.
    Abraham-Nordling, M.
    Akiyoshi, T.
    Alberda, W.
    Albert, M.
    Andric, M.
    Angenete, E.
    Antoniou, A.
    Auer, R.
    Austin, K. K.
    Aziz, O.
    Baker, R. P.
    Bali, M.
    Baseckas, G.
    Bebington, B.
    Bedford, M.
    Bednarski, B. K.
    Beets, G. L.
    Berg, P. L.
    Beynon, J.
    Biondo, S.
    Boyle, K.
    Bordeianou, L.
    Bremers, A. B.
    Brunner, M.
    Buchwald, P.
    Bui, A.
    Burgess, A.
    Burger, J. W. A.
    Burling, D.
    Burns, E.
    Campain, N.
    Carvalhal, S.
    Castro, L.
    Caycedo-Marulanda, A.
    Chan, K. K. L.
    Chang, G. J.
    Chew, M. H.
    Chok, A. K.
    Chong, P.
    Christensen, H. K.
    Clouston, H.
    Codd, M.
    Collins, D.
    Colquhoun, A. J.
    Corr, A.
    [J]. COLORECTAL DISEASE, 2020, 22 (12) : 1933 - 1940
  • [6] Clinical intelligence: New machine learning techniques for predicting clinical drug response
    Turki, Turki
    Wang, Jason T. L.
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 107 : 302 - 322
  • [7] PREDICTING OUTCOMES IN MULTIPLE SCLEROSIS THROUGH MACHINE LEARNING USING DATA FROM PHARMACEUTICAL CONSULTATION
    Cardoso, P.
    Santos, C.
    Costa, F.
    [J]. VALUE IN HEALTH, 2022, 25 (01) : S200 - S200
  • [8] Predicting Dynamic Clinical Outcomes of the Chemotherapy for Canine Lymphoma Patients Using a Machine Learning Model
    Koo, Jamin
    Choi, Kyucheol
    Lee, Peter
    Polley, Amanda
    Pudupakam, Raghavendra Sumanth
    Tsang, Josephine
    Fernandez, Elmer
    Han, Enyang James
    Park, Stanley
    Swartzfager, Deanna
    Qi, Nicholas Seah Xi
    Jung, Melody
    Ocnean, Mary
    Kim, Hyun Uk
    Lim, Sungwon
    [J]. VETERINARY SCIENCES, 2021, 8 (12)
  • [9] Predicting Clinical Outcomes of Large Vessel Occlusion Before Mechanical Thrombectomy Using Machine Learning
    Nishi, Hidehisa
    Oishi, Naoya
    Ishii, Akira
    Ono, Isao
    Ogura, Takenori
    Sunohara, Tadashi
    Chihara, Hideo
    Fukumitsu, Ryu
    Okawa, Masakazu
    Yamana, Norikazu
    Imamura, Hirotoshi
    Sadamasa, Nobutake
    Hatano, Taketo
    Nakahara, Ichiro
    Sakai, Nobuyuki
    Miyamoto, Susumu
    [J]. STROKE, 2019, 50 (09) : 2379 - 2388
  • [10] Drug repositioning: a machine-learning approach through data integration
    Francesco Napolitano
    Yan Zhao
    Vânia M Moreira
    Roberto Tagliaferri
    Juha Kere
    Mauro D’Amato
    Dario Greco
    [J]. Journal of Cheminformatics, 5