Machine Learning-Guided Prediction of Cocrystals Using Point Cloud-Based Molecular Representation

被引:5
|
作者
Ahmadi, Soroush [1 ]
Ghanavati, Mohammad Amin [1 ]
Rohani, Sohrab [1 ]
机构
[1] Western Univ, Chem & Biochem Engn, London, ON N6A 5B9, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Design for testability - Machine learning - Physicochemical properties - Salicylic acid - Synthesis (chemical);
D O I
10.1021/acs.chemmater.3c01437
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The design and synthesis of cocrystals have emerged as promising crystal engineering strategies for enhancing the physicochemical properties of a diverse range of target molecules. A prediction strategy to identify whether a pair of target and auxiliary molecules would form a cocrystal can greatly accelerate the process of cocrystal discovery. In this study, we compiled and performed DFT calculations for 12,776 molecules (6,388 cocrystals). All entries in the database were obtained from experimental attempts reported in the literature. Electrostatic potential (ESP) surfaces were then extracted from the DFT results and used for the development of four machine learning models (PointNet, ANN, RF, Ensemble). The Ensemble model, leveraging the complementary strengths of the PointNet, ANN, and RF models, demonstrated superior discriminatory performance with a BACC (0.942) and an AUC (0.986) on the unseen test data subset. To assess the performance of the models on individual molecules, we separated the cocrystals of caffeine, fumaric acid, and salicylic acid from the overall database. The Ensemble model exhibited remarkable robustness, classifying the 312 cocrystals in this subset into their respective classes, with an average BACC of 98%. Furthermore, through conducting data analysis, 132 batches of cocrystal instances were gathered. After three batches were excluded, our proposed models were tested with these previously unseen molecules both before and after implementation of a batchwise retraining method.
引用
收藏
页码:1153 / 1161
页数:9
相关论文
共 50 条
  • [41] Cloud-based machine learning for the detection of anonymous web proxies
    Miller, Shane
    Curran, Kevin
    Lunney, Tom
    2016 27TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2016,
  • [42] MACHINE LEARNING-GUIDED EARLY PREDICTION OF PERSISTENT ACUTE KIDNEY INJURY AFTER CARDIAC SURGERY
    Jangda, Mateen
    Desman, Jacob
    Takkavatakarn, Kullaya
    Yimen, Mekeleya
    Kumar, Gagan
    McCarthy, Paul
    Kohli-Seth, Roopa
    Nadkarni, Girish
    Sakhuja, Ankit
    CRITICAL CARE MEDICINE, 2024, 52
  • [43] Machine Learning-Based Precipitation Prediction Using Cloud Properties
    Yakubu, Abdulaziz Tunde
    Abayomi, Abdultaofeek
    Chetty, Naven
    HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 243 - 252
  • [44] Advanced Cloud-Based Prediction Models for Cardiovascular Disease: Integrating Machine Learning and Feature Selection Techniques
    Dhiyanesh B.
    Ammal S.G.
    Saranya K.
    Narayana K.E.
    SN Computer Science, 5 (5)
  • [45] Rapid traversal of vast chemical space using machine learning-guided docking screens
    Luttens, Andreas
    de Vaca, Israel Cabeza
    Sparring, Leonard
    Brea, Jose
    Martinez, Anton Leandro
    Kahlous, Nour Aldin
    Radchenko, Dmytro S.
    Moroz, Yurii S.
    Loza, Maria Isabel
    Norinder, Ulf
    Carlsson, Jens
    NATURE COMPUTATIONAL SCIENCE, 2025, : 301 - 312
  • [46] Accelerating the discovery of novel magnetic materials using machine learning-guided adaptive feedback
    Xia, Weiyi
    Sakurai, Masahiro
    Balasubramanian, Balamurugan
    Liao, Timothy
    Wang, Renhai
    Zhang, Chao
    Sun, Huaijun
    Ho, Kai-Ming
    Chelikowsky, James R.
    Sellmyer, David J.
    Wang, Cai-Zhuang
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (47)
  • [47] Assessment of intracranial aneurysm rupture risk using a point cloud-based deep learning model
    Cao, Heshan
    Zeng, Hui
    Lv, Lei
    Wang, Qi
    Ouyang, Hua
    Gui, Long
    Hua, Ping
    Yang, Songran
    FRONTIERS IN PHYSIOLOGY, 2024, 15
  • [48] Countering Malware Evolution Using Cloud-Based Learning
    Ouellette, Jacob
    Pfeffer, Avi
    Lakhotia, Arun
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON MALICIOUS AND UNWANTED SOFTWARE: THE AMERICAS (MALWARE), 2013, : 85 - 94
  • [49] Point cloud-based dimensional quality assessment of precast concrete components using deep learning
    Shu, Jiangpeng
    Li, Wenhao
    Zhang, Congguang
    Gao, Yifan
    Xiang, Yiqiang
    Ma, Ling
    JOURNAL OF BUILDING ENGINEERING, 2023, 70
  • [50] Using machine learning for service candidate sets retrieval in service composition of cloud-based manufacturing
    Hamed Bouzary
    F. Frank Chen
    Mohammad Shahin
    The International Journal of Advanced Manufacturing Technology, 2021, 115 : 941 - 948