ML-based Arm Recommendation in Short-Horizon MABs

被引:0
|
作者
Zipori, Or [1 ]
Sarne, David [1 ]
机构
[1] Bar Ilan Univ, Ramat Gan, Israel
关键词
HAI experimental methods; human-virtual agent interaction; Multi Armed Bandit; Machine learning; Monte-Carlo Simulation; Recommender Agents;
D O I
10.1145/3472307.3484673
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In many settings where an agent needs to suggest or recommend a course of action to its user, the agent's goal may not fully align with the user's goal. In particular, the agent may maximize its benefit if the user chooses specific alternatives that are not necessarily the ones that maximize her own individual benefit. In this paper we study such setting in the context of providing advice in two-armed bandit problems. We explore a potential strategy for the agent aiming to influence the arm to be picked. In particular we focus on a somehow naive recommendation strategy that always recommend the preferred arm and a strategy that recommends based on various Machine Learning models that aim to guide the decision regarding when to switch to the agent's least preferred arm. Based on extensive evaluation we find that both recommendation strategies results in better performance compared to not making any recommendation, and that the naive recommendation strategy performs slightly better than the ML-based recommendations, despite using a substantial amount of training data for the latter.
引用
收藏
页码:377 / 381
页数:5
相关论文
共 50 条
  • [41] On ML-Based Program Translation: Perils and Promises
    Malyala, Aniketh
    Zhou, Katelyn
    Ray, Baishakhi
    Chakraborty, Saikat
    2023 IEEE/ACM 45TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING-NEW IDEAS AND EMERGING RESULTS, ICSE-NIER, 2023, : 60 - 65
  • [42] Application of four machine-learning methods to predict short-horizon wind energy
    Bouabdallaoui, Doha
    Haidi, Touria
    Elmariami, Faissal
    Derri, Mounir
    Mellouli, El Mehdi
    GLOBAL ENERGY INTERCONNECTION-CHINA, 2023, 6 (06): : 726 - 737
  • [43] Application of four machine-learning methods to predict short-horizon wind energy
    Bouabdallaoui D.
    Haidi T.
    Elmariami F.
    Derri M.
    Mellouli E.M.
    Global Energy Interconnection, 2023, 6 (06) : 726 - 737
  • [44] Empirical analyses of three explanations for the positive autocorrelation of short-horizon stock index returns
    Ogden J.P.
    Review of Quantitative Finance and Accounting, 1997, 9 (2) : 203 - 217
  • [45] Short-Horizon Model Predictive Modulation of Three-Phase Voltage Source Inverters
    Townsend, Christopher D.
    Mirzaeva, Galina
    Goodwin, Graham C.
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2018, 65 (04) : 2945 - 2955
  • [46] Asymmetric reverting behavior of short-horizon stock returns: An evidence of stock market overreaction
    Nam, K
    Pyun, CS
    Avard, SL
    JOURNAL OF BANKING & FINANCE, 2001, 25 (04) : 807 - 824
  • [47] Application of four machine-learning methods to predict short-horizon wind energy
    Doha Bouabdallaoui
    Touria Haidi
    Faissal Elmariami
    Mounir Derri
    El Mehdi Mellouli
    Global Energy Interconnection, 2023, 6 (06) : 726 - 737
  • [48] Simulation of human motion data using short-horizon model-predictive control
    da Silva, M.
    Abe, Y.
    Popovic, J.
    COMPUTER GRAPHICS FORUM, 2008, 27 (02) : 371 - 380
  • [49] Learning Surrogate Functions for the Short-Horizon Planning in Same-Day Delivery Problems
    Bracher, Adrian
    Frohner, Nikolaus
    Raidl, Gunther R.
    INTEGRATION OF CONSTRAINT PROGRAMMING, ARTIFICIAL INTELLIGENCE, AND OPERATIONS RESEARCH, 2021, 12735 : 283 - 298
  • [50] Robust Energy Management System With Safe Reinforcement Learning Using Short-Horizon Forecasts
    Hong, Seong-Hyun
    Lee, Hyun-Suk
    IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (03) : 2485 - 2488