Synergy conformal prediction applied to large-scale bioactivity datasets and in federated learning

被引:0
|
作者
Norinder, Ulf [1 ,2 ,3 ,4 ]
Spjuth, Ola [1 ,2 ]
Svensson, Fredrik [5 ]
机构
[1] Uppsala Univ, Dept Pharmaceut Biosci, Box 591, SE-75124 Uppsala, Sweden
[2] Uppsala Univ, Sci Life Lab, Box 591, SE-75124 Uppsala, Sweden
[3] Stockholm Univ, Dept Comp & Syst Sci, Box 7003, S-16407 Kista, Sweden
[4] Orebro Univ, MTM Res Ctr, Sch Sci & Technol, S-70182 Orebro, Sweden
[5] UCL, Alzheimers Res UK UCL Drug Discovery Inst, Cruciform Bldg,Gower St, London WC1E 6BT, England
关键词
Conformal prediction; Federated learning; Confidence; Machine learning;
D O I
10.1186/s13321-021-00555-7
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Confidence predictors can deliver predictions with the associated confidence required for decision making and can play an important role in drug discovery and toxicity predictions. In this work we investigate a recently introduced version of conformal prediction, synergy conformal prediction, focusing on the predictive performance when applied to bioactivity data. We compare the performance to other variants of conformal predictors for multiple partitioned datasets and demonstrate the utility of synergy conformal predictors for federated learning where data cannot be pooled in one location. Our results show that synergy conformal predictors based on training data randomly sampled with replacement can compete with other conformal setups, while using completely separate training sets often results in worse performance. However, in a federated setup where no method has access to all the data, synergy conformal prediction is shown to give promising results. Based on our study, we conclude that synergy conformal predictors are a valuable addition to the conformal prediction toolbox.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] A fast diagonal distance metric learning approach for large-scale datasets
    Li, Tie
    Kou, Gang
    Peng, Yi
    Yu, Philip S.
    [J]. INFORMATION SCIENCES, 2021, 571 : 225 - 245
  • [22] Research on machine learning based processing strategies for large-scale datasets
    Yang, Longfei
    Zheng, Kai
    Xiao, Hui
    Yang, Zhiqiang
    Li, Shufang
    Fan, Lei
    [J]. Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
  • [23] Large-Scale Learning with Structural Kernels for Class-Imbalanced Datasets
    Severyn, Aliaksei
    Moschitti, Alessandro
    [J]. ETERNAL SYSTEMS, 2012, 255 : 34 - 41
  • [24] Harnessing Large-Scale Herbarium Image Datasets Through Representation Learning
    Walker, Barnaby E.
    Tucker, Allan
    Nicolson, Nicky
    [J]. FRONTIERS IN PLANT SCIENCE, 2022, 12
  • [25] Visualization of large-scale trajectory datasets
    Zachar, Gergely
    [J]. 2023 CYBER-PHYSICAL SYSTEMS AND INTERNET-OF-THINGS WEEK, CPS-IOT WEEK WORKSHOPS, 2023, : 152 - 157
  • [26] Large-Scale Machine Learning for Business Sector Prediction
    Angenent, Mitch N.
    Barata, Antonio Pereira
    Takes, Frank W.
    [J]. PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 1143 - 1146
  • [27] An Incremental Learning framework for Large-scale CTR Prediction
    Katsileros, Petros
    Mandilaras, Nikiforos
    Mallis, Dimitrios
    Pitsikalis, Vassilis
    Theodorakis, Stavros
    Chamiel, Gil
    [J]. PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 490 - 493
  • [28] Ensemble learning for large-scale crowd flow prediction
    Karbovskii, Vladislav
    Lees, Michael
    Presbitero, Alva
    Kurilkin, Alexey
    Voloshin, Daniil
    Derevitskii, Ivan
    Karsakov, Andrey
    Sloot, Peter M. A.
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106
  • [29] Prediction of Intellectual Disability From Developmental Milestones in Large-Scale Autism Datasets
    Nadig, Ajay
    van der Merwe, Celia
    Robinson, Elise
    [J]. NEUROPSYCHOPHARMACOLOGY, 2021, 46 (SUPPL 1) : 135 - 135
  • [30] CFL: Cluster Federated Learning in Large-Scale Peer-to-Peer Networks
    Chen, Qian
    Wang, Zilong
    Zhou, Yilin
    Chen, Jiawei
    Xiao, Dan
    Lin, Xiaodong
    [J]. INFORMATION SECURITY, ISC 2022, 2022, 13640 : 464 - 472