Online Kernel Selection via Grouped Adversarial Bandit Model

被引:0
|
作者
Li, Junfan [1 ]
Liao, Shizhong [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
online kernel selection; sequential decision; group bandit model; regret; time complexity; REGRET;
D O I
10.1109/ICTAI.2019.00100
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study kernel selection for online kernel learning, also known as online kernel selection which can be treated as a sequential decision problem and thus must balance the regret and the time complexity. Existing online kernel selection approaches via expert advice and classical adversarial bandit model can not meet the issue. In this work, we propose a novel grouped adversarial bandit solution to the problem. We first correspond each candidate kernel to a basic arm of an adversarial bandit problem. Then, all of the kernels are divided into several groups where each group is abstracted as a super arm. At each round, we choose a super arm and a basic kernel within the selected super arm, and make prediction by an online kernel learning algorithm. Besides, we introduce a Bernoulli random variable to decide whether to choose all of the rest super arms. Theoretical analysis shows the proposed approach balances the regret and the time complexity explicitly, which could enjoy better pseudo-regret and high probability regret bound than classical adversarial bandit model and lighter time complexity than expert advice model. Experimental results on benchmark datasets verify that the proposed approach balances the efficiency and effectiveness better.
引用
收藏
页码:682 / 689
页数:8
相关论文
共 50 条
  • [1] An Online Kernel Selection Wrapper via Multi-Armed Bandit Model
    Li, Junfan
    Liao, Shizhong
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1307 - 1312
  • [2] Improved Regret Bounds for Online Kernel Selection Under Bandit Feedback
    Li, Junfan
    Liao, Shizhong
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 333 - 348
  • [3] Online Kernel Selection with Multiple Bandit Feedbacks in Random Feature Space
    Li, Junfan
    Liao, Shizhong
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2018, PT II, 2018, 11062 : 301 - 312
  • [4] Online Vehicle Selection for Task Replication via Bandit Learning
    Qian, Yongfeng
    Zuo, Zhoutong
    Hao, Yixue
    2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021), 2021, : 1627 - 1632
  • [5] Online Kernel Selection via Incremental Sketched Kernel Alignment
    Zhang, Xiao
    Liao, Shizhong
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3118 - 3124
  • [6] Online Kernel Selection via Tensor Sketching
    Liao, Shizhong
    Zhang, Xiao
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 801 - 810
  • [7] Relay Selection with No Side Information: An Adversarial Bandit Approach
    Maghsudi, Setareh
    Stanczak, Slawomir
    2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 715 - 720
  • [8] A survey on online kernel selection for online kernel learning
    Zhang, Xiao
    Liao, Yun
    Liao, Shizhong
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 9 (02)
  • [9] An Online Minimax Optimal Algorithm for Adversarial Multiarmed Bandit Problem
    Gokcesu, Kaan
    Kozat, Suleyman Serdar
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (11) : 5565 - 5580
  • [10] Robust Regression via Online Feature Selection under Adversarial Data Corruption
    Zhang, Xuchao
    Lei, Shuo
    Zhao, Liang
    Boedihardjo, Arnold P.
    Lu, Chang-Tien
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1440 - 1445