Evaluation of Machine Learning Models for Aqueous Solubility Prediction in Drug Discovery

被引:0
|
作者
Xue, Nian [1 ]
Zhang, Yuzhu [2 ]
Liu, Sensen [3 ]
机构
[1] NYU, Dept Comp Sc & Engn, New York, NY USA
[2] Carnegie Mellon Univ, Sch Comp Sc, Pittsburgh, PA 15213 USA
[3] Washington Univ, Dept Elect & Syst Engn, St Louis, MO 63110 USA
关键词
Machine Learning; Solubility Prediction; Drug Discovery; Feature Importance; DESCRIPTORS; QSAR;
D O I
10.1109/ICAIBD62003.2024.10604556
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Determining the aqueous solubility of the chemical compound is of great importance in-silico drug discovery. However, correctly and rapidly predicting the aqueous solubility remains a challenging task. This paper explores and evaluates the predictability of multiple machine learning models in the aqueous solubility of compounds. Specifically, we apply a series of machine learning algorithms, including Random Forest, XG-Boost, LightGBM, and CatBoost, on a well-established aqueous solubility dataset (i.e., the Huuskonen dataset) of over 1200 compounds. Experimental results show that even traditional machine learning algorithms can achieve satisfactory performance with high accuracy. In addition, our investigation goes beyond mere prediction accuracy, delving into the interpretability of models to identify key features and understand the molecular properties that influence the predicted outcomes. This study sheds light on the ability to use machine learning approaches to predict compound solubility, significantly shortening the time that researchers spend on new drug discovery.
引用
收藏
页码:26 / 33
页数:8
相关论文
共 50 条
  • [21] Comparing and validating machine learning models for Mycobacterium tuberculosis drug discovery
    Lane, Thomas
    Russo, Daniel
    Zorn, Kimberley
    Clark, Alex
    Korotcov, Alexandru
    Tkachenko, Valery
    Reynolds, Robert
    Perryman, Alexander
    Freundlich, Joel
    Ekins, Sean
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [22] Dual-event machine learning models to accelerate drug discovery
    Ekins, Sean
    Reynolds, Robert C.
    Kim, Hiyun
    Koo, Mi-Sun
    Ekonomidis, Marilyn
    Talaue, Meliza
    Paget, Steve
    Woolhiser, Lisa
    Lenaerts, Anne J.
    Bunin, Barry A.
    Connell, Nancy
    Freundlich, Joel S.
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2013, 245
  • [23] Comparing and Validating Machine Learning Models for Mycobacterium tuberculosis Drug Discovery
    Lane, Thomas
    Russo, Daniel P.
    Zorn, Kimberley M.
    Clark, Alex M.
    Korotcov, Alexandru
    Tkachenko, Valery
    Reynolds, Robert C.
    Perryman, Alexander L.
    Freundlich, Joel S.
    Ekins, Sean
    MOLECULAR PHARMACEUTICS, 2018, 15 (10) : 4346 - 4360
  • [24] Development and sharing of ADME/Tox and drug discovery machine learning models
    Clark, Alex
    Dole, Krishna
    Coulon-Spector, Anna
    McNutt, Andrew
    Grass, George
    Freundlich, Joel
    Reynolds, Robert
    Ekins, Sean
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 250
  • [25] In silico prediction of aqueous solubility – classification models
    C Kramer
    B Beck
    T Clark
    Chemistry Central Journal, 2 (Suppl 1)
  • [26] Expression of Concern: Application of extreme learning machine for prediction of aqueous solubility of carbon dioxide
    Mohammadian, Erfan
    Motamedi, Shervin
    Shamshirband, Shahaboddin
    Hashim, Roslan
    Junin, Radzuan
    Roy, Chandrabhushan
    Azdarpour, Amin
    ENVIRONMENTAL EARTH SCIENCES, 2020, 79 (12)
  • [27] Recent development of machine learning models for the prediction of drug-drug interactions
    Eujin Hong
    Junhyeok Jeon
    Hyun Uk Kim
    Korean Journal of Chemical Engineering, 2023, 40 : 276 - 285
  • [28] A hybrid approach to aqueous solubility prediction using COSMO-RS and machine learning
    Fhionnlaoich, Niamh Mac
    Zeglinski, Jacek
    Simon, Melba
    Wood, Barbara
    Davin, Sharon
    Glennon, Brian
    CHEMICAL ENGINEERING RESEARCH & DESIGN, 2024, 209 : 67 - 71
  • [29] Expression of Concern: Application of extreme learning machine for prediction of aqueous solubility of carbon dioxide
    Erfan Mohammadian
    Shervin Motamedi
    Shahaboddin Shamshirband
    Roslan Hashim
    Radzuan Junin
    Chandrabhushan Roy
    Amin Azdarpour
    Environmental Earth Sciences, 2020, 79
  • [30] A machine learning approach for the prediction of aqueous solubility of pharmaceuticals: a comparative model and dataset analysis
    Ghanavati, Mohammad Amin
    Ahmadi, Soroush
    Rohani, Sohrab
    DIGITAL DISCOVERY, 2024, 3 (10): : 2085 - 2104