Efficient Active Learning for Gaussian Process Classification by Error Reduction

被引:0
|
作者
Zhao, Guang [1 ]
Dougherty, Edward R. [1 ]
Yoon, Byung-Jun [1 ,3 ]
Alexander, Francis J. [3 ]
Qian, Xiaoning [1 ,2 ,3 ]
机构
[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA
[2] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA
[3] Brookhaven Natl Lab, Computat Sci Initiat, Upton, NY 11973 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning sequentially selects the best instance for labeling by optimizing an acquisition function to enhance data/label efficiency. The selection can be either from a discrete instance set (pool-based scenario) or a continuous instance space (query synthesis scenario). In this work, we study both active learning scenarios for Gaussian Process Classification (GPC). The existing active learning strategies that maximize the Estimated Error Reduction (EER) aim at reducing the classification error after training with the new acquired instance in a one-step-look-ahead manner. The computation of EER-based acquisition functions is typically prohibitive as it requires retraining the GPC with every new query. Moreover, as the EER is not smooth, it can not be combined with gradient-based optimization techniques to efficiently explore the continuous instance space for query synthesis. To overcome these critical limitations, we develop computationally efficient algorithms for EER-based active learning with GPC. We derive the joint predictive distribution of label pairs as a one-dimensional integral, as a result of which the computation of the acquisition function avoids retraining the GPC for each query, remarkably reducing the computational overhead. We also derive the gradient chain rule to efficiently calculate the gradient of the acquisition function, which leads to the first query synthesis active learning algorithm implementing EER-based strategies. Our experiments clearly demonstrate the computational efficiency of the proposed algorithms. We also benchmark our algorithms on both synthetic and real-world datasets, which show superior performance in terms of sampling efficiency compared to the existing state-of-the-art algorithms.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Gaussian Process Classification and Active Learning with Multiple Annotators
    Rodrigues, Filipe
    Pereira, Francisco C.
    Ribeiro, Bernardete
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 433 - 441
  • [2] Active Learning With Gaussian Process Classifier for Hyperspectral Image Classification
    Sun, Shujin
    Zhong, Ping
    Xiao, Huaitie
    Wang, Runsheng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04): : 1746 - 1760
  • [3] Efficient approaches to Gaussian Process classification
    Csató, L
    Fokoué, E
    Opper, M
    Schottky, B
    Winther, O
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 251 - 257
  • [4] Efficient Seismic Fragility Assessment Through Active Learning and Gaussian Process Regression
    Ning, Chunxiao
    Xie, Yazhou
    PROCEEDINGS OF THE CANADIAN SOCIETY FOR CIVIL ENGINEERING ANNUAL CONFERENCE 2023, VOL 10, CSCE 2023, 2024, 504 : 1 - 13
  • [5] LEARNING FILTERS IN GAUSSIAN PROCESS CLASSIFICATION PROBLEMS
    Ruiz, Pablo
    Mateos, Javier
    Molina, Rafael
    Katsaggelos, Aggelos K.
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2913 - 2917
  • [6] Active Learning for Deep Gaussian Process Surrogates
    Sauer, Annie
    Gramacy, Robert B.
    Higdon, David
    TECHNOMETRICS, 2023, 65 (01) : 4 - 18
  • [7] The error bar estimation for the soft classification with Gaussian process models
    Gao, Junbin
    Zhang, Lei
    APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY, 2006, 34 : 675 - 684
  • [8] Scalable Active Learning by Approximated Error Reduction
    Fu, Weijie
    Wang, Meng
    Hao, Shijie
    Wu, Xindong
    KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1396 - 1405
  • [9] Efficient TMS-Based Motor Cortex Mapping Using Gaussian Process Active Learning
    Faghihpirayesh, Razieh
    Yarossi, Mathew
    Imbiriba, Tales
    Brooks, Dana H.
    Tunik, Eugene
    Erdogmus, Deniz
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 1679 - 1689
  • [10] PAC-Bayesian generalisation error bounds for Gaussian process classification
    Seeger, M
    JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (02) : 233 - 269