Crystal synthesizability prediction using contrastive positive unlabeled learning

被引:0
|
作者
Sun, Tao [1 ,3 ]
Yuan, Jianmei [1 ,2 ,3 ]
机构
[1] Xiangtan Univ, Sch Math & Computat Sci, Xiangtan 411105, Hunan, Peoples R China
[2] Xiangtan Univ, Minist Educ, Key Lab Intelligent Comp & Informat Proc, Xiangtan 411105, Hunan, Peoples R China
[3] Natl Ctr Appl Math Hunan, Xiangtan 411105, Hunan, Peoples R China
关键词
Perovskite materials; Photovoltaic applications; Contrastive learning; Positive unlabeled learning; PEROVSKITE; ALGORITHM;
D O I
10.1016/j.cpc.2024.109465
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
High-throughput screening or generative models rapidly identify crystal structures with the desired properties, but the synthesizable ratio is generally low. Experimentally verifying the synthesizability of individual virtual crystals would entail significant time and resources. Therefore, a method for automatically assessing the synthesizability of virtual crystals is urgently needed. This paper describes an approach that combines contrastive learning and positive unlabeled learning. The resulting contrastive positive unlabeled learning (CPUL) model predicts the crystal-likeness score (CLscore) of virtual materials. The model achieves a true positive (CLscore > 0.5) prediction accuracy of 93.95% on a test set containing 10,000 materials taken from the Materials Project (MP) database. We further validate the model by using all Fe-containing materials from the MP database as the test set, obtaining a true positive rate of 88.89%. This indicates that the CPUL model performs well, even with limited knowledge of the interactions between Fe and the atoms in the crystals. The CPUL model is then used to assess the CLscore of virtual crystals in the MP database and analyze their synthesizability by combining the energy above the hull. Finally, the synthesizability of perovskite materials is predicted using the proposed CPUL model, resulting in seven candidate halide perovskite materials for photovoltaic applications.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Learning from positive and unlabeled examples
    Denis, F
    Gilleron, R
    Letouzey, F
    THEORETICAL COMPUTER SCIENCE, 2005, 348 (01) : 70 - 83
  • [32] Positive and unlabeled learning in categorical data
    Ienco, Dino
    Pensa, Ruggero G.
    NEUROCOMPUTING, 2016, 196 : 113 - 124
  • [33] Positive and Unlabeled Learning with Label Disambiguation
    Zhang, Chuang
    Ren, Dexin
    Liu, Tongliang
    Yang, Jian
    Gong, Chen
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 4250 - 4256
  • [34] Learning from positive and unlabeled examples
    Letouzey, F
    Denis, F
    Gilleron, R
    ALGORITHMIC LEARNING THEORY, PROCEEDINGS, 2000, 1968 : 71 - 85
  • [35] Robust and unbiased positive and unlabeled learning
    Liu, Yinjie
    Zhao, Jie
    Xu, Yitian
    KNOWLEDGE-BASED SYSTEMS, 2023, 277
  • [36] Multi-Positive and Unlabeled Learning
    Xu, Yixing
    Xu, Chang
    Xu, Chao
    Tao, Dacheng
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3182 - 3188
  • [37] Positive unlabeled learning with tensor networks
    Zunkovic, Bojan
    NEUROCOMPUTING, 2023, 552
  • [38] False positive rate control for positive unlabeled learning
    Kong, Shuchen
    Shen, Weiwei
    Zheng, Yingbin
    Zhang, Ao
    Pu, Jian
    Wang, Jun
    NEUROCOMPUTING, 2019, 367 : 13 - 19
  • [39] Learning with Positive and Unlabeled Examples Using Topic-Sensitive PLSA
    Zhou, Ke
    Xue, Gui-Rong
    Yang, Qiang
    Yu, Yong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (01) : 46 - 58
  • [40] Automatic Identification of Teachers in Social Media using Positive Unlabeled Learning
    Karimi, Hamid
    Tang, Jiliang
    Weiss, Xochitl
    Huang, Jiangtao
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 643 - 652