Evaluation of Machine Learning Models for Aqueous Solubility Prediction in Drug Discovery

被引:0
|
作者
Xue, Nian [1 ]
Zhang, Yuzhu [2 ]
Liu, Sensen [3 ]
机构
[1] NYU, Dept Comp Sc & Engn, New York, NY USA
[2] Carnegie Mellon Univ, Sch Comp Sc, Pittsburgh, PA 15213 USA
[3] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63110 USA
关键词
Machine Learning; Solubility Prediction; Drug Discovery; Feature Importance; DESCRIPTORS; QSAR;
D O I
10.1109/ICAIBD62003.2024.10604556
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the aqueous solubility of the chemical compound is of great importance in-silico drug discovery. However, correctly and rapidly predicting the aqueous solubility remains a challenging task. This paper explores and evaluates the predictability of multiple machine learning models in the aqueous solubility of compounds. Specifically, we apply a series of machine learning algorithms, including Random Forest, XG-Boost, LightGBM, and CatBoost, on a well-established aqueous solubility dataset (i.e., the Huuskonen dataset) of over 1200 compounds. Experimental results show that even traditional machine learning algorithms can achieve satisfactory performance with high accuracy. In addition, our investigation goes beyond mere prediction accuracy, delving into the interpretability of models to identify key features and understand the molecular properties that influence the predicted outcomes. This study sheds light on the ability to use machine learning approaches to predict compound solubility, significantly shortening the time that researchers spend on new drug discovery.
引用
收藏
页码:26 / 33
页数:8
相关论文
共 50 条
  • [31] Recent development of machine learning models for the prediction of drug-drug interactions
    Hong, Eujin
    Jeon, Junhyeok
    Kim, Hyun Uk
    KOREAN JOURNAL OF CHEMICAL ENGINEERING, 2023, 40 (02) : 276 - 285
  • [32] Global and local computational models for aqueous solubility prediction of drug-like molecules
    Bergström, CAS
    Wassvik, CM
    Norinder, U
    Luthman, K
    Artursson, P
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (04): : 1477 - 1488
  • [33] Scaling Machine Learning for Target Prediction in Drug Discovery using Apache Spark
    Harnie, Dries
    Vapirev, Alexander E.
    Wegner, Jorg Kurt
    Gedich, Andrey
    Steijaert, Marvin
    Wuyts, Roel
    De Meuter, Wolfgang
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 871 - 879
  • [34] Machine learning in preclinical drug discovery
    Catacutan, Denise B.
    Alexander, Jeremie
    Arnold, Autumn
    Stokes, Jonathan M.
    NATURE CHEMICAL BIOLOGY, 2024, 20 (08) : 960 - 973
  • [35] Scaling machine learning for target prediction in drug discovery using Apache Spark
    Harnie, Dries
    Saey, Mathijs
    Vapirev, Alexander E.
    Wegner, Jorg Kurt
    Gedich, Andrey
    Steijaert, Marvin
    Ceulemans, Hugo
    Wuyts, Roel
    De Meuter, Wolfgang
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2017, 67 : 409 - 417
  • [36] Machine learning in chemoinformatics and drug discovery
    Lo, Yu-Chen
    Rensi, Stefano E.
    Torng, Wen
    Altman, Russ B.
    DRUG DISCOVERY TODAY, 2018, 23 (08) : 1538 - 1546
  • [37] Machine Learning in Drug Discovery: A Review
    Dara, Suresh
    Dhamercherla, Swetha
    Jadav, Surender Singh
    Babu, C. H. Madhu
    Ahsan, Mohamed Jawed
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (03) : 1947 - 1999
  • [38] Machine Learning in Drug Discovery and Development
    Wale, Nikil
    DRUG DEVELOPMENT RESEARCH, 2011, 72 (01) : 112 - 119
  • [39] Machine Learning in Drug Discovery: A Review
    Suresh Dara
    Swetha Dhamercherla
    Surender Singh Jadav
    CH Madhu Babu
    Mohamed Jawed Ahsan
    Artificial Intelligence Review, 2022, 55 : 1947 - 1999
  • [40] Machine Learning Methods in Drug Discovery
    Patel, Lauv
    Shukla, Tripti
    Huang, Xiuzhen
    Ussery, David W.
    Wang, Shanzhi
    MOLECULES, 2020, 25 (22):