Augmented drug combination dataset to improve the performance of machine learning models predicting synergistic anticancer effects

被引:2
|
作者
Liu, Mengmeng [1 ]
Srivastava, Gopal [2 ]
Ramanujam, J. [1 ,3 ]
Brylinski, Michal [2 ,3 ]
机构
[1] Louisiana State Univ, Div Elect & Comp Engn, Baton Rouge, LA 70803 USA
[2] Louisiana State Univ, Dept Biol Sci, Baton Rouge, LA 70803 USA
[3] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
CANCER; INHIBITOR; EFFICACY; SCREEN;
D O I
10.1038/s41598-024-51940-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Combination therapy has gained popularity in cancer treatment as it enhances the treatment efficacy and overcomes drug resistance. Although machine learning (ML) techniques have become an indispensable tool for discovering new drug combinations, the data on drug combination therapy currently available may be insufficient to build high-precision models. We developed a data augmentation protocol to unbiasedly scale up the existing anti-cancer drug synergy dataset. Using a new drug similarity metric, we augmented the synergy data by substituting a compound in a drug combination instance with another molecule that exhibits highly similar pharmacological effects. Using this protocol, we were able to upscale the AZ-DREAM Challenges dataset from 8798 to 6,016,697 drug combinations. Comprehensive performance evaluations show that ML models trained on the augmented data consistently achieve higher accuracy than those trained solely on the original dataset. Our data augmentation protocol provides a systematic and unbiased approach to generating more diverse and larger-scale drug combination datasets, enabling the development of more precise and effective ML models. The protocol presented in this study could serve as a foundation for future research aimed at discovering novel and effective drug combinations for cancer treatment.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Drug Clearance in Neonates: A Combination of Population Pharmacokinetic Modelling and Machine Learning Approaches to Improve Individual Prediction
    Tang, Bo-Hao
    Guan, Zheng
    Allegaert, Karel
    Wu, Yue-E.
    Manolis, Efthymios
    Leroux, Stephanie
    Yao, Bu-Fan
    Shi, Hai-Yan
    Li, Xiao
    Huang, Xin
    Wang, Wen-Qi
    Shen, A. -Dong
    Wang, Xiao-Ling
    Wang, Tian-You
    Kou, Chen
    Xu, Hai-Yan
    Zhou, Yue
    Zheng, Yi
    Hao, Guo-Xiang
    Xu, Bao-Ping
    Thomson, Alison H.
    Capparelli, Edmund V.
    Biran, Valerie
    Simon, Nicolas
    Meibohm, Bernd
    Lo, Yoke-Lin
    Marques, Remedios
    Peris, Jose-Esteban
    Lutsar, Irja
    Saito, Jumpei
    Burggraaf, Jacobus
    Jacqz-Aigrain, Evelyne
    van den Anker, John
    Zhao, Wei
    CLINICAL PHARMACOKINETICS, 2021, 60 (11) : 1435 - 1448
  • [42] Drug Clearance in Neonates: A Combination of Population Pharmacokinetic Modelling and Machine Learning Approaches to Improve Individual Prediction
    Bo-Hao Tang
    Zheng Guan
    Karel Allegaert
    Yue-E. Wu
    Efthymios Manolis
    Stephanie Leroux
    Bu-Fan Yao
    Hai-Yan Shi
    Xiao Li
    Xin Huang
    Wen-Qi Wang
    A.-Dong Shen
    Xiao-Ling Wang
    Tian-You Wang
    Chen Kou
    Hai-Yan Xu
    Yue Zhou
    Yi Zheng
    Guo-Xiang Hao
    Bao-Ping Xu
    Alison H. Thomson
    Edmund V. Capparelli
    Valerie Biran
    Nicolas Simon
    Bernd Meibohm
    Yoke-Lin Lo
    Remedios Marques
    Jose-Esteban Peris
    Irja Lutsar
    Jumpei Saito
    Jacobus Burggraaf
    Evelyne Jacqz-Aigrain
    John van den Anker
    Wei Zhao
    Clinical Pharmacokinetics, 2021, 60 : 1435 - 1448
  • [43] Increasing the performance of intrusion detection models developed using machine learning method with preprocessing applied to the dataset
    Ilgun, Esen Gul
    Samet, Refik
    JOURNAL OF THE FACULTY OF ENGINEERING AND ARCHITECTURE OF GAZI UNIVERSITY, 2024, 39 (02): : 679 - 692
  • [44] Synergistic effects between data corpora properties and machine learning performance in data pipelines
    Bertolini, Roberto
    Finch, Stephen J.
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2022, 14 (03) : 217 - 233
  • [45] Machine Learning Techniques for Predicting Drug-Related Side Effects: A Scoping Review
    Toni, Esmaeel
    Ayatollahi, Haleh
    Abbaszadeh, Reza
    Siahpirani, Alireza Fotuhi
    PHARMACEUTICALS, 2024, 17 (06)
  • [46] Machine Learning for Predicting Biologic Agent Efficacy in Ulcerative Colitis: An Analysis for Generalizability and Combination with Computational Models
    Pinton, Philippe
    DIAGNOSTICS, 2024, 14 (13)
  • [47] Predicting child occupant crash injury severity in the United Arab Emirates using machine learning models for imbalanced dataset
    Abdulazeez, Muhammad Uba
    Khan, Wasif
    Abdullah, Kassim Abdulrahman
    IATSS RESEARCH, 2023, 47 (02) : 134 - 159
  • [48] Network Effects on Dual Machine Learning Models Predicting Smart Home Sensor Measurements
    Almhairat, Saif
    Wallace, Bruce
    Lariviere-Chartier, Julien
    El-Haraki, Ali
    Goubran, Rafik
    Knoefel, Frank
    2022 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2022), 2022,
  • [49] Machine learning models for predicting the performance of solar-geothermal desalination in different meteorological conditions
    Farahani, Somayeh Davoodabadi
    Farahani, Amir Davoodabadi
    AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (03)
  • [50] Analysis of the Performance of Machine Learning Models in Predicting the Severity Level of Large-Truck Crashes
    Liu, Jinli
    Qi, Yi
    Tao, Jueqiang
    Tao, Tao
    FUTURE TRANSPORTATION, 2022, 2 (04): : 939 - 955