Online Kernel Selection via Grouped Adversarial Bandit Model

被引：0

作者：

Li, Junfan ^{[1
]}

Liao, Shizhong ^{[1
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China

来源：

2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019) | 2019年

基金：

中国国家自然科学基金;

关键词：

online kernel selection; sequential decision; group bandit model; regret; time complexity; REGRET;

D O I：

10.1109/ICTAI.2019.00100

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study kernel selection for online kernel learning, also known as online kernel selection which can be treated as a sequential decision problem and thus must balance the regret and the time complexity. Existing online kernel selection approaches via expert advice and classical adversarial bandit model can not meet the issue. In this work, we propose a novel grouped adversarial bandit solution to the problem. We first correspond each candidate kernel to a basic arm of an adversarial bandit problem. Then, all of the kernels are divided into several groups where each group is abstracted as a super arm. At each round, we choose a super arm and a basic kernel within the selected super arm, and make prediction by an online kernel learning algorithm. Besides, we introduce a Bernoulli random variable to decide whether to choose all of the rest super arms. Theoretical analysis shows the proposed approach balances the regret and the time complexity explicitly, which could enjoy better pseudo-regret and high probability regret bound than classical adversarial bandit model and lighter time complexity than expert advice model. Experimental results on benchmark datasets verify that the proposed approach balances the efficiency and effectiveness better.

引用

页码：682 / 689

页数：8

共 50 条

[1] An Online Kernel Selection Wrapper via Multi-Armed Bandit Model
Li, Junfan
Liao, Shizhong
2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1307 - 1312
[2] Improved Regret Bounds for Online Kernel Selection Under Bandit Feedback
Li, Junfan
Liao, Shizhong
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 333 - 348
[3] Online Kernel Selection with Multiple Bandit Feedbacks in Random Feature Space
Li, Junfan
Liao, Shizhong
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2018, PT II, 2018, 11062 : 301 - 312
[4] Online Vehicle Selection for Task Replication via Bandit Learning
Qian, Yongfeng
Zuo, Zhoutong
Hao, Yixue
2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1627 - 1632
[5] Online Kernel Selection via Incremental Sketched Kernel Alignment
Zhang, Xiao
Liao, Shizhong
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3118 - 3124
[6] Online Kernel Selection via Tensor Sketching
Liao, Shizhong
Zhang, Xiao
PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 801 - 810
[7] Relay Selection with No Side Information: An Adversarial Bandit Approach
Maghsudi, Setareh
Stanczak, Slawomir
2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 715 - 720
[8] A survey on online kernel selection for online kernel learning
Zhang, Xiao
Liao, Yun
Liao, Shizhong
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (02)
[9] An Online Minimax Optimal Algorithm for Adversarial Multiarmed Bandit Problem
Gokcesu, Kaan
Kozat, Suleyman Serdar
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5565 - 5580
[10] Robust Regression via Online Feature Selection under Adversarial Data Corruption
Zhang, Xuchao
Lei, Shuo
Zhao, Liang
Boedihardjo, Arnold P.
Lu, Chang-Tien
2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1440 - 1445

← 1 2 3 4 5 →