Model-Free Preference Elicitation

被引：0

作者：

Martinet, Carlos ^{[2
,6
]}

Boutilieri, Craig ^{[1
]}

Meshil, Ofer ^{[1
]}

Sandholm, Tuomas ^{[2
,3
,4
,5
]}

机构：

[1] Google Res, Mountain View, CA 94043 USA

[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[3] Strategy Robot Inc, Pittsburgh, PA USA

[4] Optimized Markets Inc, Pittsburgh, PA USA

[5] Strateg Machine Inc, Pittsburgh, PA USA

[6] Google, Mountain View, CA 94043 USA

来源：

PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024 | 2024年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

In recommender systems, preference elicitation (PE) is an effective way to learn about a user's preferences to improve recommendation quality. Expected value of information (EVOI), a Bayesian technique that computes expected gain in user utility, has proven to be effective in selecting useful PE queries. Most EVOI methods use probabilistic models of user preferences and query responses to compute posterior utilities. By contrast, we develop model-free variants of EVOI that rely on function approximation to obviate the need for specific modeling assumptions. Specifically, we learn user response and utility models from existing data (often available in real-world recommender systems), which are used to estimate EVOI rather than relying on explicit probabilistic inference. We augment our approach by using online planning, specifically, Monte Carlo tree search, to further enhance our elicitation policies. We show that our approach offers significant improvement in recommendation quality over standard baselines on several PE tasks.

引用

页码：3493 / 3503

页数：11

共 50 条

[31] Constructive Preference Elicitation
Dragone, Paolo
Teso, Stefano
Passerini, Andrea
FRONTIERS IN ROBOTICS AND AI, 2018, 4
[32] Preference elicitation and learning
Mousseau, Vincent
Pirlot, Marc
EURO JOURNAL ON DECISION PROCESSES, 2015, 3 (1-2) : 1 - 3
[33] Preference Elicitation for DCOPs
Tabakhi, Atena M.
Le, Tiep
Fioretto, Ferdinando
Yeoh, William
PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING (CP 2017), 2017, 10416 : 278 - 296
[34] Stakeholder preference elicitation
Cooke, R. M.
Environmental Security in Harbors and Coastal Areas: MANAGEMENT USING COMPARATIVE RISK ASSESSMENT AND MULTI-CRITERIA DECISION ANALYSIS, 2007, : 149 - 160
[35] Preference anomalies, preference elicitation and the discovered preference hypothesis
Braga, J
Starmer, C
ENVIRONMENTAL & RESOURCE ECONOMICS, 2005, 32 (01): : 55 - 89
[36] Preference Anomalies, Preference Elicitation and the Discovered Preference Hypothesis
Jacinto Braga
Chris Starmer
Environmental and Resource Economics, 2005, 32 : 55 - 89
[37] Model-free adaptive (MFA) control
Cheng, GS
COMPUTING & CONTROL ENGINEERING JOURNAL, 2004, 15 (03): : 28 - 33
[38] A Proof of Stability of Model-Free Control
Delaleau, Emmanuel
2014 IEEE CONFERENCE ON NORBERT WIENER IN THE 21ST CENTURY (21CW), 2014,
[39] A model-free identification of relative risk
Kuzmina, Olga
ECONOMICS LETTERS, 2020, 190
[40] Sequential Model-free Hyperparameter Tuning
Wistuba, Martin
Schilling, Nicolas
Schmidt-Thieme, Lars
2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 1033 - 1038

← 1 2 3 4 5 →