Efficient Active Learning for Gaussian Process Classification by Error Reduction

被引：0

作者：

Zhao, Guang ^{[1
]}

Dougherty, Edward R. ^{[1
]}

Yoon, Byung-Jun ^{[1
,3
]}

Alexander, Francis J. ^{[3
]}

Qian, Xiaoning ^{[1
,2
,3
]}

机构：

[1] Texas A&M Univ, Dept Elect & Comp Engn, College Stn, TX 77843 USA

[2] Texas A&M Univ, Dept Comp Sci & Engn, College Stn, TX 77843 USA

[3] Brookhaven Natl Lab, Computat Sci Initiat, Upton, NY 11973 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Active learning sequentially selects the best instance for labeling by optimizing an acquisition function to enhance data/label efficiency. The selection can be either from a discrete instance set (pool-based scenario) or a continuous instance space (query synthesis scenario). In this work, we study both active learning scenarios for Gaussian Process Classification (GPC). The existing active learning strategies that maximize the Estimated Error Reduction (EER) aim at reducing the classification error after training with the new acquired instance in a one-step-look-ahead manner. The computation of EER-based acquisition functions is typically prohibitive as it requires retraining the GPC with every new query. Moreover, as the EER is not smooth, it can not be combined with gradient-based optimization techniques to efficiently explore the continuous instance space for query synthesis. To overcome these critical limitations, we develop computationally efficient algorithms for EER-based active learning with GPC. We derive the joint predictive distribution of label pairs as a one-dimensional integral, as a result of which the computation of the acquisition function avoids retraining the GPC for each query, remarkably reducing the computational overhead. We also derive the gradient chain rule to efficiently calculate the gradient of the acquisition function, which leads to the first query synthesis active learning algorithm implementing EER-based strategies. Our experiments clearly demonstrate the computational efficiency of the proposed algorithms. We also benchmark our algorithms on both synthetic and real-world datasets, which show superior performance in terms of sampling efficiency compared to the existing state-of-the-art algorithms.

引用

页数：13

共 50 条

[1] Gaussian Process Classification and Active Learning with Multiple Annotators
Rodrigues, Filipe
Pereira, Francisco C.
Ribeiro, Bernardete
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 32 (CYCLE 2), 2014, 32 : 433 - 441
[2] Active Learning With Gaussian Process Classifier for Hyperspectral Image Classification
Sun, Shujin
Zhong, Ping
Xiao, Huaitie
Wang, Runsheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04): : 1746 - 1760
[3] Efficient approaches to Gaussian Process classification
Csató, L
Fokoué, E
Opper, M
Schottky, B
Winther, O
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 251 - 257
[4] Efficient Seismic Fragility Assessment Through Active Learning and Gaussian Process Regression
Ning, Chunxiao
Xie, Yazhou
PROCEEDINGS OF THE CANADIAN SOCIETY FOR CIVIL ENGINEERING ANNUAL CONFERENCE 2023, VOL 10, CSCE 2023, 2024, 504 : 1 - 13
[5] LEARNING FILTERS IN GAUSSIAN PROCESS CLASSIFICATION PROBLEMS
Ruiz, Pablo
Mateos, Javier
Molina, Rafael
Katsaggelos, Aggelos K.
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 2913 - 2917
[6] Active Learning for Deep Gaussian Process Surrogates
Sauer, Annie
Gramacy, Robert B.
Higdon, David
TECHNOMETRICS, 2023, 65 (01) : 4 - 18
[7] The error bar estimation for the soft classification with Gaussian process models
Gao, Junbin
Zhang, Lei
APPLIED SOFT COMPUTING TECHNOLOGIES: THE CHALLENGE OF COMPLEXITY, 2006, 34 : 675 - 684
[8] Scalable Active Learning by Approximated Error Reduction
Fu, Weijie
Wang, Meng
Hao, Shijie
Wu, Xindong
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 1396 - 1405
[9] Efficient TMS-Based Motor Cortex Mapping Using Gaussian Process Active Learning
Faghihpirayesh, Razieh
Yarossi, Mathew
Imbiriba, Tales
Brooks, Dana H.
Tunik, Eugene
Erdogmus, Deniz
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 1679 - 1689
[10] PAC-Bayesian generalisation error bounds for Gaussian process classification
Seeger, M
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (02) : 233 - 269

← 1 2 3 4 5 →