Structure-Based Synthesizability Prediction of Crystals Using Partially Supervised Learning

被引:74
|
作者
Jang, Jidon [1 ]
Gu, Geun Ho [1 ]
Noh, Juhwan [1 ]
Kim, Juhwan [1 ]
Jung, Yousung [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Chem & Biomol Engn, Daejeon 34141, South Korea
基金
美国国家科学基金会;
关键词
DESIGN;
D O I
10.1021/jacs.0c07384
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Predicting the synthesizability of inorganic materials is one of the major challenges in accelerated material discovery. A widely employed approximate approach is to consider the thermodynamic decomposition stability due to its simplicity of computing, but it is notorious for either producing too many candidates or missing important metastable materials. These results, however, are not unexcepted since the synthesizability is a complex phenomenon, and the thermodynamic stability is just one contributor. Here, we suggest a machine-learning model to quantify the probability of synthesis based on the partially supervised learning of materials database. We adapted the positive and unlabeled machine learning (PU learning) by implementing the graph convolutional neural network as a classifier in which the model outputs crystal-likeness scores (CLscore). The model shows 87.4% true positive (CLscore > 0.5) prediction accuracy for the test set of experimentally reported cases (9356 materials) in the Materials Project. We further validated the model by predicting the synthesizability of newly reported experimental materials in the last 5 years (2015-2019) with an 86.2% true positive rate using the model trained with the database as of the end of year 2014. Our analysis shows that our model captures the structural motif for synthesizability beyond what is possible by E-hull. We find that 71 materials among the top 100 high-scoring virtual materials have indeed been previously synthesized in the literature. With the proposed data-driven metric of the crystal-likeness score, high-throughput virtual screenings and generative models can benefit significantly by effectively reducing the chemical space that needs to be explored experimentally in the future toward more rational materials design.
引用
收藏
页码:18836 / 18843
页数:8
相关论文
共 50 条
  • [1] Synthesizability of materials stoichiometry using semi-supervised learning
    Jang, Jidon
    Noh, Juhwan
    Zhou, Lan
    Gu, Geun Ho
    Gregoire, John M.
    Jung, Yousung
    MATTER, 2024, 7 (06) : 2294 - 2312
  • [2] Interaction prediction in structure-based virtual screening using deep learning
    Gonczarek, Adam
    Tomczak, Jakub M.
    Zareba, Szymon
    Kaczmar, Joanna
    Dabrowski, Piotr
    Walczak, Michal J.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 100 : 253 - 258
  • [3] Physics-Guided Dual Self-Supervised Learning for Structure-Based Material Property Prediction
    Fu, Nihang
    Wei, Lai
    Hu, Jianjun
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2024, 15 (10): : 2841 - 2850
  • [4] Crystal synthesizability prediction using contrastive positive unlabeled learning
    Sun, Tao
    Yuan, Jianmei
    COMPUTER PHYSICS COMMUNICATIONS, 2025, 308
  • [5] Structure-based prediction of BRAF mutation classes using machine-learning approaches
    Krebs, Fanny S.
    Britschgi, Christian
    Pradervand, Sylvain
    Achermann, Rita
    Tsantoulis, Petros
    Haefliger, Simon
    Wicki, Andreas
    Michielin, Olivier
    Zoete, Vincent
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [6] Structure-based prediction of BRAF mutation classes using machine-learning approaches
    Fanny S. Krebs
    Christian Britschgi
    Sylvain Pradervand
    Rita Achermann
    Petros Tsantoulis
    Simon Haefliger
    Andreas Wicki
    Olivier Michielin
    Vincent Zoete
    Scientific Reports, 12
  • [7] Prediction of rifampicin resistance beyond the RRDR using structure-based machine learning approaches
    Portelli, Stephanie
    Myung, Yoochan
    Furnham, Nicholas
    Vedithi, Sundeep Chaitanya
    Pires, Douglas E. V.
    Ascher, David B.
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [8] Prediction of rifampicin resistance beyond the RRDR using structure-based machine learning approaches
    Stephanie Portelli
    Yoochan Myung
    Nicholas Furnham
    Sundeep Chaitanya Vedithi
    Douglas E. V. Pires
    David B. Ascher
    Scientific Reports, 10
  • [9] Structure-Based Approaches for Protein-Protein Interaction Prediction Using Machine Learning and Deep Learning
    Kiouri, Despoina P.
    Batsis, Georgios C.
    Chasapis, Christos T.
    BIOMOLECULES, 2025, 15 (01)
  • [10] SybilBelief: A Semi-Supervised Learning Approach for Structure-Based Sybil Detection
    Gong, Neil Zhenqiang
    Frank, Mario
    Mittal, Prateek
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2014, 9 (06) : 976 - 987