One class classification as a practical approach for accelerating π-π co-crystal discovery

被引:18
|
作者
Vriza, Aikaterini [1 ,2 ,3 ]
Canaj, Angelos B. [1 ,2 ]
Vismara, Rebecca [1 ,2 ]
Cook, Laurence J. Kershaw [1 ,2 ]
Manning, Troy D. [1 ,2 ]
Gaultois, Michael W. [1 ,2 ,3 ]
Wood, Peter A. [4 ]
Kurlin, Vitaliy [5 ]
Berry, Neil [1 ,2 ]
Dyer, Matthew S. [1 ,2 ,3 ]
Rosseinsky, Matthew J. [1 ,2 ,3 ]
机构
[1] Univ Liverpool, Dept Chem, 51 Oxford St, Liverpool L7 3NY, Merseyside, England
[2] Univ Liverpool, Mat Innovat Factory, 51 Oxford St, Liverpool L7 3NY, Merseyside, England
[3] Univ Liverpool, Leverhulme Res Ctr Funct Mat Design, Oxford St, Oxford, England
[4] Cambridge Crystallog Data Ctr, 12 Union Rd, Cambridge CB2 1EZ, England
[5] Univ Liverpool, Dept Comp Sci, Mat Innovat Factory, Liverpool L69 3BX, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
CHARGE-TRANSFER; ORGANIC COCRYSTALS; MOLECULAR-COMPLEX; DESIGN; ANTHRACENE; PYRENE; WILL;
D O I
10.1039/d0sc04263c
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The implementation of machine learning models has brought major changes in the decision-making process for materials design. One matter of concern for the data-driven approaches is the lack of negative data from unsuccessful synthetic attempts, which might generate inherently imbalanced datasets. We propose the application of the one-class classification methodology as an effective tool for tackling these limitations on the materials design problems. This is a concept of learning based only on a well-defined class without counter examples. An extensive study on the different one-class classification algorithms is performed until the most appropriate workflow is identified for guiding the discovery of emerging materials belonging to a relatively small class, that being the weakly bound polyaromatic hydrocarbon co-crystals. The two-step approach presented in this study first trains the model using all the known molecular combinations that form this class of co-crystals extracted from the Cambridge Structural Database (1722 molecular combinations), followed by scoring possible yet unknown pairs from the ZINC15 database (21 736 possible molecular combinations). Focusing on the highest-ranking pairs predicted to have higher probability of forming co-crystals, materials discovery can be accelerated by reducing the vast molecular space and directing the synthetic efforts of chemists. Further on, using interpretability techniques a more detailed understanding of the molecular properties causing co-crystallization is sought after. The applicability of the current methodology is demonstrated with the discovery of two novel co-crystals, namely pyrene-6H-benzo[c]chromen-6-one (1) and pyrene-9,10-dicyanoanthracene (2).
引用
收藏
页码:1702 / 1719
页数:18
相关论文
共 50 条
  • [21] Coupling complementary strategy to flexible graph neural network for quick discovery of coformer in diverse co-crystal materials
    Jiang, Yuanyuan
    Yang, Zongwei
    Guo, Jiali
    Li, Hongzhen
    Liu, Yijing
    Guo, Yanzhi
    Li, Menglong
    Pu, Xuemei
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [22] A thermodynamic based approach on the investigation of a diflunisal pharmaceutical co-crystal with improved intrinsic dissolution rate
    Evora, Antonio O. L.
    Castro, Ricardo A. E.
    Maria, Teresa M. R.
    Silva, M. Ramos
    ter Horst, J. H.
    Canotilho, Joao
    Eusebio, M. Ermelinda S.
    INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2014, 466 (1-2) : 68 - 75
  • [23] Unlocking the Potential of Machine Learning in Co-crystal Prediction by a Novel Approach Integrating Molecular Thermodynamics
    Song, Yutong
    Ding, Yewei
    Su, Junyi
    Li, Jian
    Ji, Yuanhui
    ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2025,
  • [24] Coupling complementary strategy to flexible graph neural network for quick discovery of coformer in diverse co-crystal materials
    Yuanyuan Jiang
    Zongwei Yang
    Jiali Guo
    Hongzhen Li
    Yijing Liu
    Yanzhi Guo
    Menglong Li
    Xuemei Pu
    Nature Communications, 12
  • [25] Trustworthiness of X Users: A One-Class Classification Approach
    Khan, Tanveer
    Michalas, Fahad Sohrab Antonis
    Michalas, Antonis
    Gabbouj, Moncef
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2, AINA 2024, 2024, 200 : 331 - 343
  • [26] A one-class classification approach for bot detection on Twitter
    Rodriguez-Ruiz, Jorge
    Israel Mata-Sanchez, Javier
    Monroy, Raul
    Loyola-Gonzalez, Octavio
    Lopez-Cuevas, Armando
    COMPUTERS & SECURITY, 2020, 91
  • [27] A One-Class Classification Approach for Protein Sequences and Structures
    Banhalmi, Andras
    Busa-Fekete, Robert
    Kegl, Balazs
    BIOINFORMATICS RESEARCH AND APPLICATIONS: 5TH INTERNATIONAL SYMPOSIUM, ISBRA 2009, 2009, 5542 : 310 - +
  • [28] [2+2] solid-state photodimerisation in a ternary co-crystal: a synthon approach
    Raju, K. Bal
    Reddy, Chilla Malla
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2017, 73 : C763 - C763
  • [29] Classification of cancerous cells based on the one-class problem approach
    Murshed, NA
    Bortolozzi, F
    Sabourin, R
    APPLICATIONS AND SCIENCE OF ARTIFICIAL NEURAL NETWORKS II, 1996, 2760 : 487 - 494
  • [30] Mapping raised bogs with an iterative one-class classification approach
    Mack, Benjamin
    Roscher, Ribana
    Stenzel, Stefanie
    Feilhauer, Hannes
    Schmidtlein, Sebastian
    Waske, Bjoern
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 120 : 53 - 64