Coreset-based Conformal Prediction for Large-scale Learning

被引:0
|
作者
Riquelme-Granada, Nery [1 ]
Khuong An Nguyen [1 ]
Luo, Zhiyuan [1 ]
机构
[1] Royal Holloway Univ London, Dept Comp Sci, Egham TW20 0EX, Surrey, England
关键词
Coreset; logistic regression; importance sampling; conformal predictors; ALGORITHMS; SETS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the volume of data increase rapidly, most traditional machine learning algorithms become computationally prohibitive. Furthermore, the available data can be so big that a single machine's memory can easily be overflown. We propose Coreset-Based Conformal Prediction, a strategy for dealing with big data by applying conformal predictors to a weighted summary of data - namely the coreset. We compare our approach against standalone inductive conformal predictors over three large competition-grade datasets to demonstrate that our coreset-based strategy may not only significantly improve the learning speed, but also retains predictions validity and the predictors' efficiency.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Machine-learning-based multiple abnormality prediction with large-scale chest computed tomography volumes
    Draelos, Rachel Lea
    Dov, David
    Mazurowski, Maciej A.
    Lo, Joseph Y.
    Henao, Ricardo
    Rubin, Geoffrey D.
    Carin, Lawrence
    MEDICAL IMAGE ANALYSIS, 2021, 67
  • [42] Deep learning-based coagulant dosage prediction for extreme events leveraging large-scale data
    Kim, Jiwoong
    Hua, Chuanbo
    Lin, Subin
    Kang, Seoktae
    Kang, Joo-Hyon
    Park, Mi-Hyun
    JOURNAL OF WATER PROCESS ENGINEERING, 2024, 66
  • [43] The Prediction Method of Large-Scale Land Subsidence Based on Multi-Temporal InSAR and Machine Learning
    Luo H.
    Xu Q.
    Jiang Y.
    Meng R.
    Pu C.
    Diqiu Kexue - Zhongguo Dizhi Daxue Xuebao/Earth Science - Journal of China University of Geosciences, 2024, 49 (05): : 1736 - 1745
  • [44] Kernel-Based Autoencoders for Large-Scale Representation Learning
    Bao, Jinzhou
    Zhao, Bo
    Guo, Ping
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 112 - 117
  • [45] Large-scale asynchronous distributed learning based on parameter exchanges
    Joshi, Bikash
    Iutzeler, Franck
    Amini, Massih-Reza
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2018, 5 (04) : 223 - 232
  • [46] Large-scale comparison of machine learning algorithms for target prediction of natural products
    Liang, Lu
    Liu, Ye
    Kang, Bo
    Wang, Ru
    Sun, Meng-Yu
    Wu, Qi
    Meng, Xiang-Fei
    Lin, Jian-Ping
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (05)
  • [47] Large-scale comparison of machine learning methods for profiling prediction of kinase inhibitors
    Jiangxia Wu
    Yihao Chen
    Jingxing Wu
    Duancheng Zhao
    Jindi Huang
    MuJie Lin
    Ling Wang
    Journal of Cheminformatics, 16
  • [48] Large-scale comparison of machine learning methods for drug target prediction on ChEMBL
    Mayr, Andreas
    Klambauer, Guenter
    Unterthiner, Thomas
    Steijaert, Marvin
    Wegner, Jorg K.
    Ceulemans, Hugo
    Clevert, Djork-Arne
    Hochreiter, Sepp
    CHEMICAL SCIENCE, 2018, 9 (24) : 5441 - 5451
  • [49] A study of dynamic meta-learning for failure prediction in large-scale systems
    Lan, Zhiling
    Gu, Jiexing
    Zheng, Ziming
    Thakur, Rajeev
    Coghlan, Susan
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (06) : 630 - 643
  • [50] Hybrid deep learning models for traffic prediction in large-scale road networks
    Zheng, Ge
    Chai, Wei Koong
    Duanmu, Jing-Lin
    Katos, Vasilis
    INFORMATION FUSION, 2023, 92 : 93 - 114