Expert Selection in High-Dimensional Markov Decision Processes

被引:0
|
作者
Rubies-Royo, Vicenc [1 ]
Mazumdar, Eric [1 ]
Dong, Roy [1 ]
Tomlin, Claire [1 ]
Sastry, S. Shankar [1 ]
机构
[1] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA 94720 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work we present a multi-armed bandit framework for online expert selection in Markov decision processes and demonstrate its use in high-dimensional settings. Our method takes a set of candidate expert policies and switches between them to rapidly identify the best performing expert using a variant of the classical upper confidence bound algorithm, thus ensuring low regret in the overall performance of the system. This is useful in applications where several expert policies may be available, and one needs to be selected at run-time for the underlying environment.
引用
收藏
页码:3604 / 3610
页数:7
相关论文
共 50 条
  • [1] Variational Bayesian Variable Selection for High-Dimensional Hidden Markov Models
    Zhai, Yao
    Liu, Wei
    Jin, Yunzhi
    Zhang, Yanqing
    MATHEMATICS, 2024, 12 (07)
  • [2] Reinforced steering Evolutionary Markov Chain for high-dimensional feature selection
    Rehman, Atiq ur
    Belhaouari, Samir Brahim
    Bermak, Amine
    SWARM AND EVOLUTIONARY COMPUTATION, 2024, 91
  • [3] A Novel Feature Selection Method for High-Dimensional Mixed Decision Tables
    Nguyen Ngoc Thuy
    Wongthanavasu, Sartra
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (07) : 3024 - 3037
  • [4] HIGH-DIMENSIONAL VARIABLE SELECTION
    Wasserman, Larry
    Roeder, Kathryn
    ANNALS OF STATISTICS, 2009, 37 (5A): : 2178 - 2201
  • [5] Sparse Markov Models for High-dimensional Inference
    Ost, Guilherme
    Takahashi, Daniel Y.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [6] Markov Neighborhood Regression for High-Dimensional Inference
    Liang, Faming
    Xue, Jingnan
    Jia, Bochao
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1200 - 1214
  • [7] Expert-guided Symmetry Detection in Markov Decision Processes
    Angelotti, Giorgio
    Drougard, Nicolas
    Chanel, Caroline P. C.
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 88 - 98
  • [8] Dimensional decision covariance colony predation algorithm: global optimization and high-dimensional feature selection
    Xu, Boyang
    Heidari, Ali Asghar
    Cai, Zhennao
    Chen, Huiling
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (10) : 11415 - 11471
  • [9] Variable selection-based SPC procedures for high-dimensional multistage processes
    Kim, Sangahn
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2019, 30 (01) : 144 - 153
  • [10] Fast and Scalable Spike and Slab Variable Selection in High-Dimensional Gaussian Processes
    Dance, Hugh
    Paige, Brooks
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151