ML-based Arm Recommendation in Short-Horizon MABs

被引:0
|
作者
Zipori, Or [1 ]
Sarne, David [1 ]
机构
[1] Bar Ilan Univ, Ramat Gan, Israel
关键词
HAI experimental methods; human-virtual agent interaction; Multi Armed Bandit; Machine learning; Monte-Carlo Simulation; Recommender Agents;
D O I
10.1145/3472307.3484673
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In many settings where an agent needs to suggest or recommend a course of action to its user, the agent's goal may not fully align with the user's goal. In particular, the agent may maximize its benefit if the user chooses specific alternatives that are not necessarily the ones that maximize her own individual benefit. In this paper we study such setting in the context of providing advice in two-armed bandit problems. We explore a potential strategy for the agent aiming to influence the arm to be picked. In particular we focus on a somehow naive recommendation strategy that always recommend the preferred arm and a strategy that recommends based on various Machine Learning models that aim to guide the decision regarding when to switch to the agent's least preferred arm. Based on extensive evaluation we find that both recommendation strategies results in better performance compared to not making any recommendation, and that the naive recommendation strategy performs slightly better than the ML-based recommendations, despite using a substantial amount of training data for the latter.
引用
收藏
页码:377 / 381
页数:5
相关论文
共 50 条
  • [31] Short Paper: Static and Microarchitectural ML-Based Approaches For Detecting Spectre Vulnerabilities and Attacks
    Biringa, Chidera
    Baye, Gaspard
    Kul, Gokhan
    PROCEEDINGS OF THE 11TH INTERNATIONAL WORKSHOP ON HARDWARE AND ARCHITECTURAL SUPPORT FOR SECURITY AND PRIVACY, HASP 2022, 2022, : 53 - 57
  • [32] Short-horizon event study estimation with a STAR model and real contaminated events
    Andreou P.C.
    Louca C.
    Savva C.S.
    Review of Quantitative Finance and Accounting, 2016, 47 (3) : 673 - 697
  • [33] Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits
    Gray, Robert C.
    Zhu, Jichen
    Ontanon, Santiago
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 312 - 319
  • [34] ML-based Demand Forecast with External Factors
    Hellmers López D.
    Julia Kramer K.
    Schmidt M.
    ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2023, 118 (05): : 324 - 329
  • [35] ML-Based Teaching Systems: A Conceptual Framework
    Spitzer P.
    Kühl N.
    Heinz D.
    Satzger G.
    Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (CSCW2)
  • [36] ML-based Expert Products Scoring System
    Mendori, Patryk
    Pelc, Mariusz
    Kawala-Sterniuk, Aleksandra
    Gola, Mariusz
    2024 PROGRESS IN APPLIED ELECTRICAL ENGINEERING, PAEE 2024, 2024,
  • [37] ML-based Power Seat Control system
    Hong, Kang-Woon
    Park, Dong-Hwan
    2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1260 - 1261
  • [38] ML-based EDA from Research to Production
    Liu, Wen-Hao
    Ren, Haoxing
    2024 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI TSA, 2024,
  • [39] ML-Based Early Detection of IoT Botnets
    Kumar, Ayush
    Shridhar, Mrinalini
    Swaminathan, Sahithya
    Lim, Teng Joon
    SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT II, 2020, 336 : 254 - 260
  • [40] Robustify ML-Based Lithography Hotspot Detectors
    Pan, Jingyu
    Chang, Chen-Chia
    Xie, Zhiyao
    Hu, Jiang
    Chen, Yiran
    2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,