ML-based Arm Recommendation in Short-Horizon MABs

被引:0
|
作者
Zipori, Or [1 ]
Sarne, David [1 ]
机构
[1] Bar Ilan Univ, Ramat Gan, Israel
关键词
HAI experimental methods; human-virtual agent interaction; Multi Armed Bandit; Machine learning; Monte-Carlo Simulation; Recommender Agents;
D O I
10.1145/3472307.3484673
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In many settings where an agent needs to suggest or recommend a course of action to its user, the agent's goal may not fully align with the user's goal. In particular, the agent may maximize its benefit if the user chooses specific alternatives that are not necessarily the ones that maximize her own individual benefit. In this paper we study such setting in the context of providing advice in two-armed bandit problems. We explore a potential strategy for the agent aiming to influence the arm to be picked. In particular we focus on a somehow naive recommendation strategy that always recommend the preferred arm and a strategy that recommends based on various Machine Learning models that aim to guide the decision regarding when to switch to the agent's least preferred arm. Based on extensive evaluation we find that both recommendation strategies results in better performance compared to not making any recommendation, and that the naive recommendation strategy performs slightly better than the ML-based recommendations, despite using a substantial amount of training data for the latter.
引用
收藏
页码:377 / 381
页数:5
相关论文
共 50 条
  • [1] Short-Horizon Beta or Long-Horizon Alpha?
    Kamara, Avraham
    Korajczyk, Robert
    Lou, Xiaoxia
    Sadka, Ronnie
    JOURNAL OF PORTFOLIO MANAGEMENT, 2018, 45 (01): : 96 - 105
  • [2] A ML-based Approach for HTML']HTML-based Style Recommendation
    Aponte, Ryan
    Rossi, Ryan A.
    Guo, Shunan
    Hofswell, Jane
    Lipka, Nedim
    Xiao, Chang
    Chan, Gromit
    Koh, Eunyee
    Ahmed, Nesreen
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 9 - 13
  • [3] SHORT-HORIZON INPUTS AND LONG-HORIZON PORTFOLIO CHOICE
    GOETZMANN, WN
    EDWARDS, FR
    JOURNAL OF PORTFOLIO MANAGEMENT, 1994, 20 (04): : 76 - 81
  • [4] Short-horizon return predictability and oil prices
    Casassus, Jaime
    Higuera, Freddy
    QUANTITATIVE FINANCE, 2012, 12 (12) : 1909 - 1934
  • [5] Consumer expectations and short-horizon return predictability
    Kalotay, Egon
    Gray, Philip
    Sin, Samantha
    JOURNAL OF BANKING & FINANCE, 2007, 31 (10) : 3102 - 3124
  • [6] Short-horizon incentives and stock price inflation
    Chi, Jianxin Daniel
    Gupta, Manu
    Johnson, Shane A.
    JOURNAL OF CORPORATE FINANCE, 2020, 65
  • [7] Time-varying short-horizon predictability
    Henkel, Sam James
    Martin, J. Spencer
    Nardari, Federico
    JOURNAL OF FINANCIAL ECONOMICS, 2011, 99 (03) : 560 - 580
  • [8] Mean Reversion in Short-Horizon Expected Returns
    Conrad, Jennifer
    Kaul, Gautam
    REVIEW OF FINANCIAL STUDIES, 1989, 2 (02): : 225 - 240
  • [9] COMPONENTS OF SHORT-HORIZON INDIVIDUAL SECURITY RETURNS
    CONRAD, J
    KAUL, G
    NIMALENDRAN, M
    JOURNAL OF FINANCIAL ECONOMICS, 1991, 29 (02) : 365 - 384
  • [10] Short-horizon regulation for long-term investors
    Shi, Zhen
    Werker, Bas J. M.
    JOURNAL OF BANKING & FINANCE, 2012, 36 (12) : 3227 - 3238