Model-Free Preference Elicitation

被引:0
|
作者
Martinet, Carlos [2 ,6 ]
Boutilieri, Craig [1 ]
Meshil, Ofer [1 ]
Sandholm, Tuomas [2 ,3 ,4 ,5 ]
机构
[1] Google Res, Mountain View, CA 94043 USA
[2] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[3] Strategy Robot Inc, Pittsburgh, PA USA
[4] Optimized Markets Inc, Pittsburgh, PA USA
[5] Strateg Machine Inc, Pittsburgh, PA USA
[6] Google, Mountain View, CA 94043 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In recommender systems, preference elicitation (PE) is an effective way to learn about a user's preferences to improve recommendation quality. Expected value of information (EVOI), a Bayesian technique that computes expected gain in user utility, has proven to be effective in selecting useful PE queries. Most EVOI methods use probabilistic models of user preferences and query responses to compute posterior utilities. By contrast, we develop model-free variants of EVOI that rely on function approximation to obviate the need for specific modeling assumptions. Specifically, we learn user response and utility models from existing data (often available in real-world recommender systems), which are used to estimate EVOI rather than relying on explicit probabilistic inference. We augment our approach by using online planning, specifically, Monte Carlo tree search, to further enhance our elicitation policies. We show that our approach offers significant improvement in recommendation quality over standard baselines on several PE tasks.
引用
收藏
页码:3493 / 3503
页数:11
相关论文
共 50 条
  • [31] Constructive Preference Elicitation
    Dragone, Paolo
    Teso, Stefano
    Passerini, Andrea
    FRONTIERS IN ROBOTICS AND AI, 2018, 4
  • [32] Preference elicitation and learning
    Mousseau, Vincent
    Pirlot, Marc
    EURO JOURNAL ON DECISION PROCESSES, 2015, 3 (1-2) : 1 - 3
  • [33] Preference Elicitation for DCOPs
    Tabakhi, Atena M.
    Le, Tiep
    Fioretto, Ferdinando
    Yeoh, William
    PRINCIPLES AND PRACTICE OF CONSTRAINT PROGRAMMING (CP 2017), 2017, 10416 : 278 - 296
  • [34] Stakeholder preference elicitation
    Cooke, R. M.
    Environmental Security in Harbors and Coastal Areas: MANAGEMENT USING COMPARATIVE RISK ASSESSMENT AND MULTI-CRITERIA DECISION ANALYSIS, 2007, : 149 - 160
  • [35] Preference anomalies, preference elicitation and the discovered preference hypothesis
    Braga, J
    Starmer, C
    ENVIRONMENTAL & RESOURCE ECONOMICS, 2005, 32 (01): : 55 - 89
  • [36] Preference Anomalies, Preference Elicitation and the Discovered Preference Hypothesis
    Jacinto Braga
    Chris Starmer
    Environmental and Resource Economics, 2005, 32 : 55 - 89
  • [37] Model-free adaptive (MFA) control
    Cheng, GS
    COMPUTING & CONTROL ENGINEERING JOURNAL, 2004, 15 (03): : 28 - 33
  • [38] A Proof of Stability of Model-Free Control
    Delaleau, Emmanuel
    2014 IEEE CONFERENCE ON NORBERT WIENER IN THE 21ST CENTURY (21CW), 2014,
  • [39] A model-free identification of relative risk
    Kuzmina, Olga
    ECONOMICS LETTERS, 2020, 190
  • [40] Sequential Model-free Hyperparameter Tuning
    Wistuba, Martin
    Schilling, Nicolas
    Schmidt-Thieme, Lars
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 1033 - 1038