ML-based Arm Recommendation in Short-Horizon MABs

被引：0

作者：

Zipori, Or ^{[1
]}

Sarne, David ^{[1
]}

机构：

[1] Bar Ilan Univ, Ramat Gan, Israel

来源：

PROCEEDINGS OF THE 9TH INTERNATIONAL USER MODELING, ADAPTATION AND PERSONALIZATION HUMAN-AGENT INTERACTION, HAI 2021 | 2021年

关键词：

HAI experimental methods; human-virtual agent interaction; Multi Armed Bandit; Machine learning; Monte-Carlo Simulation; Recommender Agents;

D O I：

10.1145/3472307.3484673

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In many settings where an agent needs to suggest or recommend a course of action to its user, the agent's goal may not fully align with the user's goal. In particular, the agent may maximize its benefit if the user chooses specific alternatives that are not necessarily the ones that maximize her own individual benefit. In this paper we study such setting in the context of providing advice in two-armed bandit problems. We explore a potential strategy for the agent aiming to influence the arm to be picked. In particular we focus on a somehow naive recommendation strategy that always recommend the preferred arm and a strategy that recommends based on various Machine Learning models that aim to guide the decision regarding when to switch to the agent's least preferred arm. Based on extensive evaluation we find that both recommendation strategies results in better performance compared to not making any recommendation, and that the naive recommendation strategy performs slightly better than the ML-based recommendations, despite using a substantial amount of training data for the latter.

引用

页码：377 / 381

页数：5

共 50 条

[31] Short Paper: Static and Microarchitectural ML-Based Approaches For Detecting Spectre Vulnerabilities and Attacks
Biringa, Chidera
Baye, Gaspard
Kul, Gokhan
PROCEEDINGS OF THE 11TH INTERNATIONAL WORKSHOP ON HARDWARE AND ARCHITECTURAL SUPPORT FOR SECURITY AND PRIVACY, HASP 2022, 2022, : 53 - 57
[32] Short-horizon event study estimation with a STAR model and real contaminated events
Andreou P.C.
Louca C.
Savva C.S.
Review of Quantitative Finance and Accounting, 2016, 47 (3) : 673 - 697
[33] Regression Oracles and Exploration Strategies for Short-Horizon Multi-Armed Bandits
Gray, Robert C.
Zhu, Jichen
Ontanon, Santiago
2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 312 - 319
[34] ML-based Demand Forecast with External Factors
Hellmers López D.
Julia Kramer K.
Schmidt M.
ZWF Zeitschrift fuer Wirtschaftlichen Fabrikbetrieb, 2023, 118 (05): : 324 - 329
[35] ML-Based Teaching Systems: A Conceptual Framework
Spitzer P.
Kühl N.
Heinz D.
Satzger G.
Proceedings of the ACM on Human-Computer Interaction, 2023, 7 (CSCW2)
[36] ML-based Expert Products Scoring System
Mendori, Patryk
Pelc, Mariusz
Kawala-Sterniuk, Aleksandra
Gola, Mariusz
2024 PROGRESS IN APPLIED ELECTRICAL ENGINEERING, PAEE 2024, 2024,
[37] ML-based Power Seat Control system
Hong, Kang-Woon
Park, Dong-Hwan
2019 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC): ICT CONVERGENCE LEADING THE AUTONOMOUS FUTURE, 2019, : 1260 - 1261
[38] ML-based EDA from Research to Production
Liu, Wen-Hao
Ren, Haoxing
2024 INTERNATIONAL VLSI SYMPOSIUM ON TECHNOLOGY, SYSTEMS AND APPLICATIONS, VLSI TSA, 2024,
[39] ML-Based Early Detection of IoT Botnets
Kumar, Ayush
Shridhar, Mrinalini
Swaminathan, Sahithya
Lim, Teng Joon
SECURITY AND PRIVACY IN COMMUNICATION NETWORKS (SECURECOMM 2020), PT II, 2020, 336 : 254 - 260
[40] Robustify ML-Based Lithography Hotspot Detectors
Pan, Jingyu
Chang, Chen-Chia
Xie, Zhiyao
Hu, Jiang
Chen, Yiran
2022 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN, ICCAD, 2022,

← 1 2 3 4 5 →