Inverse design of ligands using a deep generative model semi-supervised by a data-driven ligand field strength metric

被引:0
|
作者
Lee, Zhi-Hang [1 ]
Lin, Po Chuan [1 ]
Yang, Tzuhsiung [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Chem, Hsinchu, Taiwan
关键词
crystal structures; data-driven discovery; deep generative model; inorganic complexes; ligand field strength; machine learning; TRANSITION-METAL-COMPLEXES; CATALYSTS; SERIES; STATES; SCALE;
D O I
10.1002/jccs.202300066
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Transition metal (TM) complexes exhibit diverse structural and electronic properties. The properties of a TM complex can be tuned by modulating the ligand field strength (LFS) inflicted by its ligands. Current quantification of the LFS of a ligand is mainly derived from experimental measurements on a subset of highly symmetrical TM complexes and is limited in ligand scope. Herein, we report a data-driven method to quantify the LFS of ligands assigned from experimental crystal structures of TM complexes. We first show that the experimental metal-ligand bond lengths of over 4,000 mononuclear Fe, Co, and Mn complexes form bimodal distributions. Using Gaussian fits on the bimodal distributions, each TM complex is assigned a spin state (SS) label. These SS labels can then be used to calculate the LFS of the ligands of the complexes. Using the obtained data-driven LFS metric, we establish that a semi-supervised deep generative model, junction tree variational autoencoder (JTVAE), can be employed to predict LFS values. Our model exhibits a mean absolute error (MAE) of 0.047 and root mean squared error of 0.072 on the training set. The model also allows the generation of novel ligands with desirable LFS values.
引用
收藏
页码:1095 / 1101
页数:7
相关论文
共 50 条
  • [31] Semi-supervised COVID-19 CT image segmentation using deep generative models
    Judah Zammit
    Daryl L. X. Fung
    Qian Liu
    Carson Kai-Sang Leung
    Pingzhao Hu
    BMC Bioinformatics, 23
  • [32] Semi-supervised COVID-19 CT image segmentation using deep generative models
    Zammit, Judah
    Fung, Daryl L. X.
    Liu, Qian
    Leung, Carson Kai-Sang
    Hu, Pingzhao
    BMC BIOINFORMATICS, 2022, 23 (SUPPL 7)
  • [33] Data-driven Event Detection with Partial Knowledge: A Hidden Structure Semi-Supervised Learning Method
    Zhou, Yuxun
    Arghandeh, Reza
    Konstantakopoulos, Ioannis
    Abdullah, Shayaan
    Spanos, Costas J.
    2016 AMERICAN CONTROL CONFERENCE (ACC), 2016, : 5962 - 5968
  • [34] Metric learning for semi-supervised clustering using pairwise constraints and the geometrical structure of data
    Baghshah, Mahdieh Soleymani
    Shouraki, Saeed Bagheri
    INTELLIGENT DATA ANALYSIS, 2009, 13 (06) : 887 - 899
  • [35] A Data-Driven Semi-Supervised Soft-Sensor Method: Application on an Industrial Cracking Furnace
    Ma, Fangyuan
    Wang, Jingde
    Sun, Wei
    FRONTIERS IN CHEMICAL ENGINEERING, 2022, 4
  • [36] Semi-Supervised Learning Based Big Data-Driven Anomaly Detection in Mobile Wireless Networks
    Hussain, Bilal
    Du, Qinghe
    Ren, Pinyi
    CHINA COMMUNICATIONS, 2018, 15 (04) : 41 - 57
  • [37] Enhancing Classification of Energy Meters with Limited Labels using a Semi-Supervised Generative Model
    Fu, Chun
    Kazmi, Hussain
    Quintana, Matias
    Miller, Clayton
    PROCEEDINGS OF THE 10TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2023, 2023, : 450 - 453
  • [38] Semi-Supervised Learning Based Big Data-Driven Anomaly Detection in Mobile Wireless Networks
    Bilal Hussain
    Qinghe Du
    Pinyi Ren
    China Communications, 2018, 15 (04) : 41 - 57
  • [39] Metric Learning Using Labeled and Unlabeled Data for Semi-Supervised/Domain Adaptation Classification
    Benisty, Hadas
    Crammer, Koby
    2014 IEEE 28TH CONVENTION OF ELECTRICAL & ELECTRONICS ENGINEERS IN ISRAEL (IEEEI), 2014,
  • [40] Semi-Supervised Multi-Label Learning from Crowds via Deep Sequential Generative Model
    Shi, Wanli
    Sheng, Victor S.
    Li, Xiang
    Gu, Bin
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 1141 - 1149