An accurate and interpretable deep learning model for environmental properties prediction using hybrid molecular representations

被引:25
|
作者
Zhang, Jun [1 ]
Wang, Qin [2 ]
Su, Yang [3 ]
Jin, Saimeng [1 ]
Ren, Jingzheng [4 ]
Eden, Mario [5 ]
Shen, Weifeng [1 ]
机构
[1] Chongqing Univ, Sch Chem & Chem Engn, Chongqing 400044, Peoples R China
[2] Chongqing Univ Sci & Technol, Sch Chem & Chem Engn, Chongqing, Peoples R China
[3] Chongqing Univ Sci & Technol, Sch Intelligent Technol & Engn, Chongqing, Peoples R China
[4] Hong Kong Polytech Univ, Dept Ind & Syst Engn, Hong Kong, Peoples R China
[5] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
deep learning network; interpretability; lipophilicity; message-passing neural network; QSPR; WATER PARTITION-COEFFICIENTS; OCTANOL-WATER; EXTRACTIVE DISTILLATION; IONIC LIQUIDS; GREEN CHEMISTRY; LIPOPHILICITY; DESIGN; APPLICABILITY;
D O I
10.1002/aic.17634
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Lipophilicity, as quantified by the decimal logarithm of the octanol-water partition coefficient (log K-OW), is an essential environmental property. Deep neural networks (DNNs) based quantitative structure-property relationship (QSPR) studies have received more and more attention because of their excellent performance for prediction. However, the black-box nature of DNNs limits the application range where interpretability is essential. Hence, this study aims to develop an accurate and interpretable deep neural network (AI-DNN) model for log K-OW prediction. A hybrid method of molecular representation was employed to guarantee the accuracy of the proposed AI-DNN model. The hybrid molecular representations are able to integrate the directed message passing neural networks (D-MPNNs) learned molecular representations and the fixed molecule-level features of CDK descriptors, and can capture both the local and the global features of overall molecule. The performance analysis shows that the proposed QSPR model exhibits promising predictive accuracy and discriminative power in the structural isomers and stereoisomers. Moreover, the Monte Carlo Tree Search (MCTS) approach was used to interpret the proposed AI-DNN model by identifying the molecular substructures contributed to the lipophilicity. This interpretability can be applied to critical fields where there is a high demand for interpretable deep networks, such as green solvent design and drug discovery.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A Deep Multimodal Representation Learning Framework for Accurate Molecular Properties Prediction
    Yang, Yuxin
    Wang, Zixu
    Ahadian, Pegah
    Jerger, Abby
    Zucker, Jeremy
    Feng, Song
    Cheng, Feixiong
    Guan, Qiang
    [J]. PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 760 - 765
  • [2] An Interpretable Machine Learning Model for Accurate Prediction of Sepsis in the ICU
    Nemati, Shamim
    Holder, Andre
    Razmi, Fereshteh
    Stanley, Matthew D.
    Clifford, Gari D.
    Buchman, Timothy G.
    [J]. CRITICAL CARE MEDICINE, 2018, 46 (04) : 547 - 553
  • [3] A Novel Interpretable Deep Learning Model for Ozone Prediction
    Chen, Xingguo
    Li, Yang
    Xu, Xiaoyan
    Shao, Min
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [4] Using an interpretable deep learning model for the prediction of riverine suspended sediment load
    Mohammadi-Raigani Z.
    Gholami H.
    Mohamadifar A.
    Samani A.N.
    Pradhan B.
    [J]. Environmental Science and Pollution Research, 2024, 31 (22) : 32480 - 32493
  • [5] Prediction of anticancer drug sensitivity using an interpretable model guided by deep learning
    Pang, Weixiong
    Chen, Ming
    Qin, Yufang
    [J]. BMC BIOINFORMATICS, 2024, 25 (01)
  • [6] Antibody structure prediction using interpretable deep learning
    Ruffolo, Jeffrey A.
    Sulam, Jeremias
    Gray, Jeffrey J.
    [J]. PATTERNS, 2022, 3 (02):
  • [7] Accurate prediction of somatic variants using deep learning model.
    Zhang, Peng
    Wang, Kai
    Yao, Ming
    Wang, Aodi
    Chen, Lijuan
    Liu, Angen
    Shi, Xiaoliang
    Zhang, Shiyue
    [J]. JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (15)
  • [8] Pretraining deep learning molecular representations for property prediction
    Liu, Bowen
    Hu, Weihua
    Leskovec, Jure
    Liang, Percy
    Pande, Vijay
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258
  • [9] Interpretable Deep Learning Prediction Model for Compressive Strength of Concrete
    Zhang, Wei-Qi
    Wang, Hui-Ming
    [J]. Dongbei Daxue Xuebao/Journal of Northeastern University, 2024, 45 (05): : 738 - 744
  • [10] An advanced hybrid deep learning model for accurate energy load prediction in smart building
    Sunder, R.
    Sreeraj, R.
    Paul, Vince
    Punia, Sanjeev Kumar
    Konduri, Bhagavan
    Nabilal, Khan Vajid
    Lilhore, Umesh Kumar
    Lohani, Tarun Kumar
    Ghith, Ehab
    Tlija, Mehdi
    [J]. ENERGY EXPLORATION & EXPLOITATION, 2024, : 2241 - 2269