Coreset-based Conformal Prediction for Large-scale Learning

被引:0
|
作者
Riquelme-Granada, Nery [1 ]
Khuong An Nguyen [1 ]
Luo, Zhiyuan [1 ]
机构
[1] Royal Holloway Univ London, Dept Comp Sci, Egham TW20 0EX, Surrey, England
关键词
Coreset; logistic regression; importance sampling; conformal predictors; ALGORITHMS; SETS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the volume of data increase rapidly, most traditional machine learning algorithms become computationally prohibitive. Furthermore, the available data can be so big that a single machine's memory can easily be overflown. We propose Coreset-Based Conformal Prediction, a strategy for dealing with big data by applying conformal predictors to a weighted summary of data - namely the coreset. We compare our approach against standalone inductive conformal predictors over three large competition-grade datasets to demonstrate that our coreset-based strategy may not only significantly improve the learning speed, but also retains predictions validity and the predictors' efficiency.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Geospatial learning for large-scale transport infrastructure depth prediction
    Zhang, Pengcheng
    Yi, Wen
    Song, Yongze
    Thomson, Giles
    Wu, Peng
    Aghamohammadi, Nasrin
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 132
  • [22] Rich Punctuations Prediction Using Large-scale Deep Learning
    Wu, Xueyang
    Zhu, Su
    Wu, Yue
    Yu, Kai
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [23] Large-scale conformal rigidity in dimension three
    Sylvain Maillot
    Mathematische Annalen, 2007, 337 : 613 - 630
  • [24] Hashing Based Prediction for Large-Scale Kernel Machine
    Lu, Lijing
    Yin, Rong
    Liu, Yong
    Wang, Weiping
    COMPUTATIONAL SCIENCE - ICCS 2020, PT II, 2020, 12138 : 496 - 509
  • [25] Large-scale testing of chemical shift prediction algorithms and improved learning-based approaches to shift prediction
    Arun, K
    Langmead, CJ
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 712 - 713
  • [26] Transfer learning based hybrid model for power demand prediction of large-scale electric vehicles
    Tian, Chenlu
    Liu, Yechun
    Zhang, Guiqing
    Yang, Yalong
    Yan, Yi
    Li, Chengdong
    ENERGY, 2024, 300
  • [27] A large-scale microblog dataset and stock movement prediction based on Supervised Contrastive Learning model
    Yang, Song
    Tang, Daniel
    NEUROCOMPUTING, 2024, 584
  • [28] Traffic matrix prediction and estimation based on deep learning in large-scale IP backbone networks
    Nie, Laisen
    Jiang, Dingde
    Guo, Lei
    Yu, Shui
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2016, 76 : 16 - 22
  • [29] GOLabeler: improving sequence-based large-scale protein function prediction by learning to rank
    You, Ronghui
    Zhang, Zihan
    Xiong, Yi
    Sun, Fengzhu
    Mamitsuka, Hiroshi
    Zhu, Shanfeng
    BIOINFORMATICS, 2018, 34 (14) : 2465 - 2473
  • [30] Deep Learning Method for RNA Secondary Structure Prediction with Pseudoknots Based on Large-Scale Data
    Shen, Bowen
    Zhang, Hao
    Li, Cong
    Zhao, Tianheng
    Liu, Yuanning
    JOURNAL OF HEALTHCARE ENGINEERING, 2021, 2021