One class classification as a practical approach for accelerating π-π co-crystal discovery

被引:18
|
作者
Vriza, Aikaterini [1 ,2 ,3 ]
Canaj, Angelos B. [1 ,2 ]
Vismara, Rebecca [1 ,2 ]
Cook, Laurence J. Kershaw [1 ,2 ]
Manning, Troy D. [1 ,2 ]
Gaultois, Michael W. [1 ,2 ,3 ]
Wood, Peter A. [4 ]
Kurlin, Vitaliy [5 ]
Berry, Neil [1 ,2 ]
Dyer, Matthew S. [1 ,2 ,3 ]
Rosseinsky, Matthew J. [1 ,2 ,3 ]
机构
[1] Univ Liverpool, Dept Chem, 51 Oxford St, Liverpool L7 3NY, Merseyside, England
[2] Univ Liverpool, Mat Innovat Factory, 51 Oxford St, Liverpool L7 3NY, Merseyside, England
[3] Univ Liverpool, Leverhulme Res Ctr Funct Mat Design, Oxford St, Oxford, England
[4] Cambridge Crystallog Data Ctr, 12 Union Rd, Cambridge CB2 1EZ, England
[5] Univ Liverpool, Dept Comp Sci, Mat Innovat Factory, Liverpool L69 3BX, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
CHARGE-TRANSFER; ORGANIC COCRYSTALS; MOLECULAR-COMPLEX; DESIGN; ANTHRACENE; PYRENE; WILL;
D O I
10.1039/d0sc04263c
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The implementation of machine learning models has brought major changes in the decision-making process for materials design. One matter of concern for the data-driven approaches is the lack of negative data from unsuccessful synthetic attempts, which might generate inherently imbalanced datasets. We propose the application of the one-class classification methodology as an effective tool for tackling these limitations on the materials design problems. This is a concept of learning based only on a well-defined class without counter examples. An extensive study on the different one-class classification algorithms is performed until the most appropriate workflow is identified for guiding the discovery of emerging materials belonging to a relatively small class, that being the weakly bound polyaromatic hydrocarbon co-crystals. The two-step approach presented in this study first trains the model using all the known molecular combinations that form this class of co-crystals extracted from the Cambridge Structural Database (1722 molecular combinations), followed by scoring possible yet unknown pairs from the ZINC15 database (21 736 possible molecular combinations). Focusing on the highest-ranking pairs predicted to have higher probability of forming co-crystals, materials discovery can be accelerated by reducing the vast molecular space and directing the synthetic efforts of chemists. Further on, using interpretability techniques a more detailed understanding of the molecular properties causing co-crystallization is sought after. The applicability of the current methodology is demonstrated with the discovery of two novel co-crystals, namely pyrene-6H-benzo[c]chromen-6-one (1) and pyrene-9,10-dicyanoanthracene (2).
引用
收藏
页码:1702 / 1719
页数:18
相关论文
共 50 条
  • [1] Thermodynamic Approach for Co-crystal Screening
    Veith, Heiner
    Schleinitz, Miko
    Schauerte, Carsten
    Sadowski, Gabriele
    CRYSTAL GROWTH & DESIGN, 2019, 19 (06) : 3253 - 3264
  • [2] Co-crystal forms of the BCS class IV drug sulfamethoxazole
    Alsubaie, Moneerh
    Aljohani, Marwah
    Erxleben, Andrea
    McArdle, Patrick
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2018, 74 : E192 - E192
  • [3] The host guest co-crystal approach to supramolecular structure
    Kane, JJ
    Nguyen, T
    Xiao, J
    Fowler, FW
    Lauher, JW
    MOLECULAR CRYSTALS AND LIQUID CRYSTALS, 2001, 356 : 449 - 458
  • [4] The co-crystal of two GRAS substances: (citric acid)•(nicotinamide). Formation of four hydrogen bonding heterosynthons in one co-crystal
    Lemmerer, Andreas
    Bernstein, Joel
    CRYSTENGCOMM, 2010, 12 (07): : 2029 - 2033
  • [5] The utility of a ternary phase diagram in the discovery of new co-crystal forms
    Chadwick, Keith
    Davey, Roger
    Sadiq, Ghazala
    Cross, Wendy
    Pritchard, Robin
    CRYSTENGCOMM, 2009, 11 (03): : 412 - 414
  • [6] Resampling approach for one-Class classification; Resampling approach for one-Class classification
    Lee H.-H.
    Park S.
    Im J.
    Pattern Recognition, 2023, 143
  • [7] Drug discovery and optimization based on the co-crystal structure of natural product with target
    Chen, Xing
    Varghese, Swapna
    Zhang, Zhaoyan
    Du, Juncheng
    Ruan, Banfeng
    Baell, Jonathan B.
    Liu, Xinhua
    EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2024, 266
  • [8] The co-crystal approach to improve the exposure of a water-insoluble compound: AMG 517 sorbic acid co-crystal characterization and pharmacokinetics
    Bak, Annette
    Gore, Anu
    Yanez, Evelyn
    Stanton, Mary
    Tufekcic, Sunita
    Syed, Rashid
    Akrami, Anna
    Rose, Mark
    Surapaneni, Sekhar
    Bostick, Tracy
    King, Anthony
    Neervannan, Sesha
    Ostovic, Drazen
    Koparkar, Arun
    JOURNAL OF PHARMACEUTICAL SCIENCES, 2008, 97 (09) : 3942 - 3956
  • [9] A practical approach to novel class discovery in tabular data
    Colin, Troisemaine
    Alexandre, Reiffers-Masson
    Stephane, Gosselin
    Vincent, Lemaire
    Sandrine, Vaton
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (04) : 2087 - 2116
  • [10] Caffeine Co-Crystal Mechanics Evaluated with a Combined Structural and Spectroscopic Approach
    Singaraju, Aditya B.
    Iyer, Mamta
    Haware, Rahul V.
    Stevens, Lewis L.
    CRYSTAL GROWTH & DESIGN, 2016, 16 (08) : 4383 - 4391