Weighted Gaussian Process Bandits for Non-stationary Environments

被引:0
|
作者
Deng, Yuntian [1 ]
Zhou, Xingyu [2 ]
Kim, Baekjin [3 ]
Tewari, Ambuj [3 ]
Gupta, Abhishek [1 ]
Shroff, Ness [1 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] Wayne State Univ, Detroit, MI 48202 USA
[3] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider the Gaussian process (GP) bandit optimization problem in a non-stationary environment. To capture external changes, the black-box function is allowed to be time-varying within a reproducing kernel Hilbert space (RKHS). To this end, we develop WGP-UCB, a novel UCB-type algorithm based on weighted Gaussian process regression. A key challenge is how to cope with infinite-dimensional feature maps. To that end, we leverage kernel approximation techniques to prove a sublinear regret bound, which is the first (frequentist) sublinear regret guarantee on weighted time-varying bandits with general nonlinear rewards. This result generalizes both non-stationary linear bandits and standard GP-UCB algorithms. Further, a novel concentration inequality is achieved for weighted Gaussian process regression with general weights. We also provide universal upper bounds and weight-dependent upper bounds for weighted maximum information gains. These results are of independent interest for applications such as news ranking and adaptive pricing, where weights can be adopted to capture the importance or quality of data. Finally, we conduct experiments to highlight the favorable gains of the proposed algorithm in many cases when compared to existing methods.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Online Nonstationary and Nonlinear Bandits with Recursive Weighted Gaussian Process
    Miyake, Yusuke
    Watanabe, Ryuji
    Mine, Tsunenori
    2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024, 2024, : 11 - 20
  • [42] Non-stationary Gaussian models with physical barriers
    Bakka, Haakon
    Vanhatalo, Jarno
    Illian, Janine B.
    Simpson, Daniel
    Rue, Havard
    SPATIAL STATISTICS, 2019, 29 : 268 - 288
  • [43] A class of models for non-stationary Gaussian processes
    Grigoriu, M
    PROBABILISTIC ENGINEERING MECHANICS, 2003, 18 (03) : 203 - 213
  • [44] Non-stationary data reorganization for weighted wind turbine icing monitoring with Gaussian mixture model
    Jing, Hua
    Zhao, Chunhui
    Gao, Furong
    COMPUTERS & CHEMICAL ENGINEERING, 2021, 147
  • [45] Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits
    Saha, Aadirupa
    Gupta, Shubham
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19027 - 19049
  • [46] Non-stationary Continuum-armed Bandits for Online Hyperparameter Optimization
    Lu, Shiyin
    Zhou, Yu-Hang
    Shi, Jing-Cheng
    Zhu, Wenya
    Yu, Qingtao
    Chen, Qing-Guo
    Da, Qing
    Zhang, Lijun
    WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 618 - 627
  • [47] An Ensemble Method for Incremental Classification in Stationary and Non-stationary Environments
    Nanculef, Ricardo
    Lopez, Erick
    Allende, Hector
    Allende-Cid, Hector
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, 2011, 7042 : 541 - 548
  • [48] META-GRADIENTS IN NON-STATIONARY ENVIRONMENTS
    Luketina, Jelena
    Flennerhag, Sebastian
    Schroecker, Yannick
    Abel, David
    Zahavy, Tom
    Singh, Satinder
    CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199, 2022, 199
  • [49] Speech recognition in non-stationary adverse environments
    Wang, ZH
    Kenny, P
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 265 - 268
  • [50] Evolutionary Multiobjective Optimization in Non-Stationary Environments
    Aragon, Victoria
    Esquivel, Susana
    Coello Coello, Carlos A.
    JOURNAL OF COMPUTER SCIENCE & TECHNOLOGY, 2005, 5 (03): : 133 - 143