ML-based Arm Recommendation in Short-Horizon MABs

被引:0
|
作者
Zipori, Or [1 ]
Sarne, David [1 ]
机构
[1] Bar Ilan Univ, Ramat Gan, Israel
关键词
HAI experimental methods; human-virtual agent interaction; Multi Armed Bandit; Machine learning; Monte-Carlo Simulation; Recommender Agents;
D O I
10.1145/3472307.3484673
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In many settings where an agent needs to suggest or recommend a course of action to its user, the agent's goal may not fully align with the user's goal. In particular, the agent may maximize its benefit if the user chooses specific alternatives that are not necessarily the ones that maximize her own individual benefit. In this paper we study such setting in the context of providing advice in two-armed bandit problems. We explore a potential strategy for the agent aiming to influence the arm to be picked. In particular we focus on a somehow naive recommendation strategy that always recommend the preferred arm and a strategy that recommends based on various Machine Learning models that aim to guide the decision regarding when to switch to the agent's least preferred arm. Based on extensive evaluation we find that both recommendation strategies results in better performance compared to not making any recommendation, and that the naive recommendation strategy performs slightly better than the ML-based recommendations, despite using a substantial amount of training data for the latter.
引用
收藏
页码:377 / 381
页数:5
相关论文
共 50 条
  • [21] Distinguishing Between Rationales for Short-Horizon Predictability of Stock Returns
    Subrahmanyam, Avanidhar
    FINANCIAL REVIEW, 2005, 40 (01) : 11 - 35
  • [22] Measuring R&D curtailment among short-horizon CEOs
    Cazier, Richard A.
    JOURNAL OF CORPORATE FINANCE, 2011, 17 (03) : 584 - 594
  • [23] A TALE OF 3 SCHOOLS - INSIGHTS ON AUTOCORRELATIONS OF SHORT-HORIZON STOCK RETURNS
    BOUDOUKH, J
    RICHARDSON, M
    WHITELAW, RF
    JOURNAL OF FINANCE, 1994, 49 (03): : 1052 - 1052
  • [24] Short-Horizon Prediction of Wind Power: A Data-Driven Approach
    Kusiak, Andrew
    Zhang, Zijun
    IEEE TRANSACTIONS ON ENERGY CONVERSION, 2010, 25 (04) : 1112 - 1122
  • [25] A New ML-based AFIB Detector
    Tudjarski, Stojancho
    Ignjatov, Tomislav
    Gusev, Marjan
    2021 29TH TELECOMMUNICATIONS FORUM (TELFOR), 2021,
  • [26] SHORT-HORIZON ASYMMETRY IN CONDITIONAL MEAN OF ASEAN STOCK MARKET RETURNS
    Ibrahim, Mansor H.
    ASIAN ACADEMY OF MANAGEMENT JOURNAL OF ACCOUNTING AND FINANCE, 2010, 6 (02): : 115 - 128
  • [27] A TALE OF 3 SCHOOLS - INSIGHTS ON AUTOCORRELATIONS OF SHORT-HORIZON STOCK RETURNS
    BOUDOUKH, J
    RICHARDSON, MP
    WHITELAW, RF
    REVIEW OF FINANCIAL STUDIES, 1994, 7 (03): : 539 - 573
  • [28] Short-horizon excess returns and exchange rate and interest rate effects
    Joseph, Nathan Lael
    Lambertides, Neophytos
    Savva, Christos S.
    JOURNAL OF INTERNATIONAL FINANCIAL MARKETS INSTITUTIONS & MONEY, 2015, 37 : 54 - 76
  • [29] ML-Based Wildfire Prediction and Detection
    Joshi, Chiragee C.
    Payyavula, Jaya S. S. K.
    Patel, Soham
    Alginahi, Yasser M.
    2024 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING AND MACHINE INTELLIGENCE, ICMI 2024, 2024,
  • [30] The Challenges in ML-based Security for SDN
    Nguyen, Tam N.
    2018 2ND CYBER SECURITY IN NETWORKING CONFERENCE (CSNET), 2018,