SolTranNet-A Machine Learning Tool for Fast Aqueous Solubility Prediction

被引:47
|
作者
Francoeur, Paul G. [1 ]
Koes, David R. [1 ]
机构
[1] Univ Pittsburgh, Dept Computat & Syst Biol, Pittsburgh, PA 15260 USA
关键词
FREE-ENERGIES;
D O I
10.1021/acs.jcim.1c00331
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
While accurate prediction of aqueous solubility remains a challenge in drug discovery, machine learning (ML) approaches have become increasingly popular for this task. For instance, in the Second Challenge to Predict Aqueous Solubility (SC2), all groups utilized machine learning methods in their submissions. We present SolTranNet, a molecule attention transformer to predict aqueous solubility from a molecule's SMILES representation. Atypically, we demonstrate that larger models perform worse at this task, with SolTranNet's final architecture having 3,393 parameters while outperforming linear ML approaches. SolTranNet has a 3-fold scaffold split cross-validation root-mean-square error (RMSE) of 1.459 on AqSolDB and an RMSE of 1.711 on a withheld test set. We also demonstrate that, when used as a classifier to filter out insoluble compounds, SolTranNet achieves a sensitivity of 94.8% on the SC2 data set and is competitive with the other methods submitted to the competition. SolTranNet is distributed via PIP, and its source code is available at https://github.com/gnina/SolTranNet.
引用
收藏
页码:2530 / 2536
页数:7
相关论文
共 50 条
  • [1] SolTranNet-A Machine Learning Tool for Fast Aqueous Solubility Prediction (vol 61, pg 2530, 2021)
    Francoeur, Paul G.
    Koes, David R.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (08) : 4120 - 4123
  • [2] Application of extreme learning machine for prediction of aqueous solubility of carbon dioxide
    Erfan Mohammadian
    Shervin Motamedi
    Shahaboddin Shamshirband
    Roslan Hashim
    Radzuan Junin
    Chandrabhushan Roy
    Amin Azdarpour
    [J]. Environmental Earth Sciences, 2016, 75
  • [3] Evaluation of Machine Learning Models for Aqueous Solubility Prediction in Drug Discovery
    Xue, Nian
    Zhang, Yuzhu
    Liu, Sensen
    [J]. 2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 26 - 33
  • [4] Application of extreme learning machine for prediction of aqueous solubility of carbon dioxide
    Mohammadian, Erfan
    Motamedi, Shervin
    Shamshirband, Shahaboddin
    Hashim, Roslan
    Junin, Radzuan
    Roy, Chandrabhushan
    Azdarpour, Amin
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2016, 75 (03) : 1 - 11
  • [5] Expression of Concern: Application of extreme learning machine for prediction of aqueous solubility of carbon dioxide
    Mohammadian, Erfan
    Motamedi, Shervin
    Shamshirband, Shahaboddin
    Hashim, Roslan
    Junin, Radzuan
    Roy, Chandrabhushan
    Azdarpour, Amin
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2020, 79 (12)
  • [6] Expression of Concern: Application of extreme learning machine for prediction of aqueous solubility of carbon dioxide
    Erfan Mohammadian
    Shervin Motamedi
    Shahaboddin Shamshirband
    Roslan Hashim
    Radzuan Junin
    Chandrabhushan Roy
    Amin Azdarpour
    [J]. Environmental Earth Sciences, 2020, 79
  • [7] A hybrid approach to aqueous solubility prediction using COSMO-RS and machine learning
    Fhionnlaoich, Niamh Mac
    Zeglinski, Jacek
    Simon, Melba
    Wood, Barbara
    Davin, Sharon
    Glennon, Brian
    [J]. CHEMICAL ENGINEERING RESEARCH & DESIGN, 2024, 209 : 67 - 71
  • [8] A machine learning approach for the prediction of aqueous solubility of pharmaceuticals: a comparative model and dataset analysis
    Ghanavati, Mohammad Amin
    Ahmadi, Soroush
    Rohani, Sohrab
    [J]. DIGITAL DISCOVERY, 2024,
  • [9] Prediction of CO2 solubility in aqueous amine solutions using machine learning method
    Liu, Bin
    Yu, Yanan
    Liu, Zijian
    Cui, Zhe
    Tian, Wende
    [J]. SEPARATION AND PURIFICATION TECHNOLOGY, 2025, 354
  • [10] Pruned Machine Learning Models to Predict Aqueous Solubility
    Perryman, Alexander L.
    Inoyama, Daigo
    Patel, Jimmy S.
    Ekins, Sean
    Freundlich, Joel S.
    [J]. ACS OMEGA, 2020, 5 (27): : 16562 - 16567