Machine Learning-Guided Prediction of Cocrystals Using Point Cloud-Based Molecular Representation

被引:5
|
作者
Ahmadi, Soroush [1 ]
Ghanavati, Mohammad Amin [1 ]
Rohani, Sohrab [1 ]
机构
[1] Western Univ, Chem & Biochem Engn, London, ON N6A 5B9, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Design for testability - Machine learning - Physicochemical properties - Salicylic acid - Synthesis (chemical);
D O I
10.1021/acs.chemmater.3c01437
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The design and synthesis of cocrystals have emerged as promising crystal engineering strategies for enhancing the physicochemical properties of a diverse range of target molecules. A prediction strategy to identify whether a pair of target and auxiliary molecules would form a cocrystal can greatly accelerate the process of cocrystal discovery. In this study, we compiled and performed DFT calculations for 12,776 molecules (6,388 cocrystals). All entries in the database were obtained from experimental attempts reported in the literature. Electrostatic potential (ESP) surfaces were then extracted from the DFT results and used for the development of four machine learning models (PointNet, ANN, RF, Ensemble). The Ensemble model, leveraging the complementary strengths of the PointNet, ANN, and RF models, demonstrated superior discriminatory performance with a BACC (0.942) and an AUC (0.986) on the unseen test data subset. To assess the performance of the models on individual molecules, we separated the cocrystals of caffeine, fumaric acid, and salicylic acid from the overall database. The Ensemble model exhibited remarkable robustness, classifying the 312 cocrystals in this subset into their respective classes, with an average BACC of 98%. Furthermore, through conducting data analysis, 132 batches of cocrystal instances were gathered. After three batches were excluded, our proposed models were tested with these previously unseen molecules both before and after implementation of a batchwise retraining method.
引用
收藏
页码:1153 / 1161
页数:9
相关论文
共 50 条
  • [21] Machine learning-guided property prediction of energetic materials: Recent advances, challenges, and perspectives
    Tian, Xiao-lan
    Song, Si-wei
    Chen, Fang
    Qi, Xiu-juan
    Wang, Yi
    Zhang, Qing-hua
    ENERGETIC MATERIALS FRONTIERS, 2022, 3 (03): : 177 - 186
  • [22] A Cloud-based Architecture for Condition Monitoring based on Machine Learning
    Arevalo, Fernando
    Diprasetya, Mochammad Rizky
    Schwung, Andreas
    2018 IEEE 16TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2018, : 163 - 168
  • [23] Guarding the Cloud: An Effective Detection of Cloud-Based Cyber Attacks using Machine Learning Algorithms
    Rexha, Blerim
    Thaqi, Rrezearta
    Mazrekaj, Artan
    Vishi, Kamer
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (18) : 158 - 174
  • [24] Cloud-based email phishing attack using machine and deep learning algorithm
    Umer Ahmed Butt
    Rashid Amin
    Hamza Aldabbas
    Senthilkumar Mohan
    Bader Alouffi
    Ali Ahmadian
    Complex & Intelligent Systems, 2023, 9 : 3043 - 3070
  • [25] Cloud-Based Diabetes Decision Support System Using Machine Learning Fusion
    Aftab, Shabib
    Alanazi, Saad
    Ahmad, Munir
    Khan, Muhammad Adnan
    Fatima, Areej
    Elmitwally, Nouh Sabri
    CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 68 (01): : 1341 - 1357
  • [26] Cloud-based email phishing attack using machine and deep learning algorithm
    Butt, Umer Ahmed
    Amin, Rashid
    Aldabbas, Hamza
    Mohan, Senthilkumar
    Alouffi, Bader
    Ahmadian, Ali
    COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (03) : 3043 - 3070
  • [27] Collaborative machine learning-guided overall survival prediction of oral squamous cell carcinoma
    Alabi, Rasheed Omobolaji
    Elmusrati, Mohammed
    Leivo, Ilmo
    Almangush, Alhadi
    Makitie, Antti A.
    ACTA OTO-LARYNGOLOGICA, 2024,
  • [28] A Short Review on the Machine Learning-Guided Oxygen Uptake Prediction for Sport Science Applications
    Alzamer, Haneen
    Abuhmed, Tamer
    Hamad, Kotiba
    ELECTRONICS, 2021, 10 (16)
  • [29] Machine Learning-guided Observational Method for Prediction of Preloading-induced Consolidation Settlement
    Tian, Hua-Ming
    Lee, Siew-Wei
    Wang, Yu
    COMPUTERS AND GEOTECHNICS, 2025, 181
  • [30] Machine Learning-Guided Prediction of Desalination Capacity and Rate of Porous Carbons for Capacitive Deionization
    Wang, Hao
    Jiang, Mingxi
    Xu, Guangsheng
    Wang, Chenglong
    Xu, Xingtao
    Liu, Yong
    Li, Yuquan
    Lu, Ting
    Yang, Guang
    Pan, Likun
    SMALL, 2024, 20 (42)