Multi-task learning for pKa prediction

被引:0
|
作者
Grigorios Skolidis
Katja Hansen
Guido Sanguinetti
Matthias Rupp
机构
[1] University College London,Department of Statistical Science
[2] Fritz Haber Institute of the Max Planck Society,Theory Department
[3] University of Edinburgh,School of Informatics
[4] Machine Learning Group,undefined
[5] TU Berlin,undefined
[6] Institute of Pharmaceutical Sciences,undefined
[7] ETH Zurich,undefined
关键词
pK; prediction; Multi-task learning; Quantitative structure–property relationships; Gaussian processes;
D O I
暂无
中图分类号
学科分类号
摘要
Many compound properties depend directly on the dissociation constants of its acidic and basic groups. Significant effort has been invested in computational models to predict these constants. For linear regression models, compounds are often divided into chemically motivated classes, with a separate model for each class. However, sometimes too few measurements are available for a class to build a reasonable model, e.g., when investigating a new compound series. If data for related classes are available, we show that multi-task learning can be used to improve predictions by utilizing data from these other classes. We investigate performance of linear Gaussian process regression models (single task, pooling, and multi-task models) in the low sample size regime, using a published data set (n = 698, mostly monoprotic, in aqueous solution) divided beforehand into 15 classes. A multi-task regression model using the intrinsic model of co-regionalization and incomplete Cholesky decomposition performed best in 85 % of all experiments. The presented approach can be applied to estimate other molecular properties where few measurements are available.
引用
收藏
页码:883 / 895
页数:12
相关论文
共 50 条
  • [1] Multi-task learning for pKa prediction
    Skolidis, Grigorios
    Hansen, Katja
    Sanguinetti, Guido
    Rupp, Matthias
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2012, 26 (07) : 883 - 895
  • [2] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    [J]. MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [3] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    [J]. Memetic Computing, 2020, 12 : 355 - 369
  • [4] Structured Multi-task Learning for Molecular Property Prediction
    Liu, Shengchao
    Qu, Meng
    Zhang, Zuobai
    Cai, Huiyu
    Tang, Jian
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [5] Water Quality Prediction Based on Multi-Task Learning
    Wu, Huan
    Cheng, Shuiping
    Xin, Kunlun
    Ma, Nian
    Chen, Jie
    Tao, Liang
    Gao, Min
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (15)
  • [6] Multi-Task Learning for Dense Prediction Tasks: A Survey
    Vandenhende, Simon
    Georgoulis, Stamatios
    Van Gansbeke, Wouter
    Proesmans, Marc
    Dai, Dengxin
    Van Gool, Luc
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3614 - 3633
  • [7] Multi-Task Learning with Knowledge Distillation for Dense Prediction
    Xu, Yangyang
    Yang, Yibo
    Zhang, Lefei
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21493 - 21502
  • [8] Enhancement of acute toxicity prediction by multi-task learning
    Sosnin, Sergey
    Karlov, Dmitry
    Tetko, Igor
    Fedorov, Maxim
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 257
  • [9] Situation Aware Multi-Task Learning for Traffic Prediction
    Deng, Dingxiong
    Shahabi, Cyrus
    Demiryurek, Ugur
    Zhu, Linhong
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 81 - 90
  • [10] Deep Multi-task Learning for Air Quality Prediction
    Wang, Bin
    Yan, Zheng
    Lu, Jie
    Zhang, Guangquan
    Li, Tianrui
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 93 - 103