Uncertainty quantification and exploration-exploitation trade-off in humans

被引:1
|
作者
Candelieri, Antonio [1 ]
Ponti, Andrea [2 ]
Archetti, Francesco [2 ]
机构
[1] Univ Milano Bicocca, Dept Econ Management & Stat, Milan, Italy
[2] Univ Milano Bicocca, Dept Comp Sci Syst & Commun, Milan, Italy
关键词
Active learning; Pareto analysis; Uncertainty quantification; Human learning; Exploration; exploitation dilemma; INFORMATION; OPTIMIZATION; DOPAMINE;
D O I
10.1007/s12652-021-03547-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The main objective of this paper is to outline a theoretical framework to analyse how humans' decision-making strategies under uncertainty manage the trade-off between information gathering (exploration) and reward seeking (exploitation). A key observation, motivating this line of research, is the awareness that human learners are amazingly fast and effective at adapting to unfamiliar environments and incorporating upcoming knowledge: this is an intriguing behaviour for cognitive sciences as well as an important challenge for Machine Learning. The target problem considered is active learning in a black-box optimization task and more specifically how the exploration/exploitation dilemma can be modelled within Gaussian Process based Bayesian Optimization framework, which is in turn based on uncertainty quantification. The main contribution is to analyse humans' decisions with respect to Pareto rationality where the two objectives are improvement expected and uncertainty quantification. According to this Pareto rationality model, if a decision set contains a Pareto efficient (dominant) strategy, a rational decision maker should always select the dominant strategy over its dominated alternatives. The distance from the Pareto frontier determines whether a choice is (Pareto) rational (i.e., lays on the frontier) or is associated to "exasperate" exploration. However, since the uncertainty is one of the two objectives defining the Pareto frontier, we have investigated three different uncertainty quantification measures and selected the one resulting more compliant with the Pareto rationality model proposed. The key result is an analytical framework to characterize how deviations from "rationality" depend on uncertainty quantifications and the evolution of the reward seeking process.
引用
收藏
页码:6843 / 6876
页数:34
相关论文
共 50 条
  • [1] Uncertainty quantification and exploration–exploitation trade-off in humans
    Antonio Candelieri
    Andrea Ponti
    Francesco Archetti
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 6843 - 6876
  • [2] Uncertainty avoidance and the exploration-exploitation trade-off
    Broekhuizen, Thijs L. J.
    Giarratana, Marco S.
    Torres, Anna
    [J]. EUROPEAN JOURNAL OF MARKETING, 2017, 51 (11-12) : 2080 - 2100
  • [3] Exploration-exploitation Trade-off in a Treasure Hunting Game
    Volchenkov, Dimitri
    Helbach, Jonathan
    Tscherepanow, Marko
    Kueheel, Sina
    [J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 299 : 101 - 121
  • [4] The Exploration-Exploitation Trade-off in Interactive Recommender Systems
    Barraza-Urbina, Andrea
    [J]. PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 431 - 435
  • [5] Dopamine blockade impairs the exploration-exploitation trade-off in rats
    François Cinotti
    Virginie Fresno
    Nassim Aklil
    Etienne Coutureau
    Benoît Girard
    Alain R. Marchand
    Mehdi Khamassi
    [J]. Scientific Reports, 9
  • [6] Dopamine blockade impairs the exploration-exploitation trade-off in rats
    Cinotti, Francois
    Fresno, Virginie
    Aklil, Nassim
    Coutureau, Etienne
    Girard, Benoit
    Marchand, Alain R.
    Khamassi, Mehdi
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [7] Energetic state regulates the exploration-exploitation trade-off in honeybees
    Katz, Keziah
    Naug, Dhruba
    [J]. BEHAVIORAL ECOLOGY, 2015, 26 (04) : 1045 - 1050
  • [8] The implied exploration-exploitation trade-off in human motor learning
    Holly N Phillips
    Nikhil A Howai
    Guy-Bart V Stan
    Aldo A Faisal
    [J]. BMC Neuroscience, 12 (Suppl 1)
  • [9] Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off
    Huang, Shaoyi
    Lei, Bowen
    Xu, Dongkuan
    Peng, Hongwu
    Sun, Yue
    Xie, Mimi
    Ding, Caiwen
    [J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [10] The role of the noradrenergic system in the exploration-exploitation trade-off: a psychopharmacological study
    Jepma, Marieke
    Beek, Erik T. Te
    Wagenmakers, Eric-Jan
    van Gerven, Joop M. A.
    Nieuwenhuis, Sander
    [J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2010, 4