Data-Driven Imputation of Miscibility of Aqueous Solutions via Graph-Regularized Logistic Matrix Factorization

被引:3
|
作者
Behnoudfar, Diba [1 ]
Simon, Cory M. [1 ]
Schrier, Joshua [2 ]
机构
[1] Oregon State Univ, Sch Chem Biol & Environm Engn, Corvallis, OR 97331 USA
[2] Fordham Univ, Dept Chem, The Bronx, NY 10458 USA
来源
JOURNAL OF PHYSICAL CHEMISTRY B | 2023年 / 127卷 / 37期
基金
美国国家科学基金会;
关键词
2-PHASE SYSTEMS; PHASE-SEPARATION; PREDICTION; COEFFICIENTS;
D O I
10.1021/acs.jpcb.3c03789
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Aqueous, two-phase systems (ATPSs) may form upon mixing two solutions of independently water-soluble compounds. Many separation, purification, and extraction processes rely on ATPSs. Predicting the miscibility of solutions can accelerate and reduce the cost of the discovery of new ATPSs for these applications. Whereas previous machine learning approaches to ATPS prediction used physicochemical properties of each solute as a descriptor, in this work, we show how to impute missing miscibility outcomes directly from an incomplete collection of pairwise miscibility experiments. We use graph-regularized logistic matrix factorization (GR-LMF) to learn a latent vector of each solution from (i) the observed entries in the pairwise miscibility matrix and (ii) a graph where each node is a solution and edges are relationships indicating the general category of the solute (i.e., polymer, surfactant, salt, protein). For an experimental data set of the pairwise miscibility of 68 solutions from Peacock et al. [ACS Appl. Mater. Interfaces 2021, 13, 11449-11460], we find that GR-LMF more accurately predicts missing (im)miscibility outcomes of pairs of solutions than ordinary logistic matrix factorization and random forest classifiers that use physicochemical features of the solutes. GR-LMF obviates the need for features of the solutions and solutions to impute missing miscibility outcomes, but it cannot predict the miscibility of a new solution without some observations of its miscibility with other solutions in the training data set.
引用
收藏
页码:7964 / 7973
页数:10
相关论文
共 50 条
  • [1] ORTHOGONAL GRAPH-REGULARIZED MATRIX FACTORIZATION AND ITS APPLICATION FOR RECOMMENDATION
    Zhu, Zhenfeng
    Xin, Peilu
    Wei, Shikui
    Zhao, Yao
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [2] Dual graph-regularized Constrained Nonnegative Matrix Factorization for Image Clustering
    Sun, Jing
    Cai, Xibiao
    Sun, Fuming
    Hong, Richang
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (05): : 2607 - 2627
  • [3] Orthogonal Dual Graph-Regularized Nonnegative Matrix Factorization for Co-Clustering
    Jiayi Tang
    Zhong Wan
    Journal of Scientific Computing, 2021, 87
  • [4] Predicting lncRNA-Protein Interaction With Weighted Graph-Regularized Matrix Factorization
    Sun, Xibo
    Cheng, Leiming
    Liu, Jinyang
    Xie, Cuinan
    Yang, Jiasheng
    Li, Fu
    FRONTIERS IN GENETICS, 2021, 12
  • [5] Orthogonal Dual Graph-Regularized Nonnegative Matrix Factorization for Co-Clustering
    Tang, Jiayi
    Wan, Zhong
    JOURNAL OF SCIENTIFIC COMPUTING, 2021, 87 (03)
  • [6] Total Variation Constrained Graph-Regularized Convex Non-Negative Matrix Factorization for Data Representation
    Tian, Miao
    Leng, Chengcai
    Wu, Haonan
    Basu, Anup
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 126 - 130
  • [7] Sparse Dual Graph-Regularized Deep Nonnegative Matrix Factorization for Image Clustering
    Guo, Weiyu
    IEEE ACCESS, 2021, 9 : 39926 - 39938
  • [8] Sparse robust graph-regularized non-negative matrix factorization based on correntropy
    Wang, Chuan-Yuan
    Gao, Ying-Lian
    Liu, Jin-Xing
    Dai, Ling-Yun
    Shang, Junliang
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2021, 19 (01)
  • [9] Determining Functional Units of Tongue Motion via Graph-Regularized Sparse Non-negative Matrix Factorization
    Woo, Jonghye
    Xing, Fangxu
    Lee, Junghoon
    Stone, Maureen
    Prince, Jerry L.
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2014, PT II, 2014, 8674 : 146 - 153
  • [10] Orthogonal Graph-regularized Non-negative Matrix Factorization for Hyperspectral Image Clustering
    Tian, Long
    Du, Qian
    Kopriva, Ivica
    Younan, Nicolas
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 795 - 798