Uncertainty quantification and exploration-exploitation trade-off in humans

被引：1

作者：

Candelieri, Antonio ^{[1
]}

Ponti, Andrea ^{[2
]}

Archetti, Francesco ^{[2
]}

机构：

[1] Univ Milano Bicocca, Dept Econ Management & Stat, Milan, Italy

[2] Univ Milano Bicocca, Dept Comp Sci Syst & Commun, Milan, Italy

来源：

JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING | 2021年 / 14卷 / 6期

关键词：

Active learning; Pareto analysis; Uncertainty quantification; Human learning; Exploration; exploitation dilemma; INFORMATION; OPTIMIZATION; DOPAMINE;

D O I：

10.1007/s12652-021-03547-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The main objective of this paper is to outline a theoretical framework to analyse how humans' decision-making strategies under uncertainty manage the trade-off between information gathering (exploration) and reward seeking (exploitation). A key observation, motivating this line of research, is the awareness that human learners are amazingly fast and effective at adapting to unfamiliar environments and incorporating upcoming knowledge: this is an intriguing behaviour for cognitive sciences as well as an important challenge for Machine Learning. The target problem considered is active learning in a black-box optimization task and more specifically how the exploration/exploitation dilemma can be modelled within Gaussian Process based Bayesian Optimization framework, which is in turn based on uncertainty quantification. The main contribution is to analyse humans' decisions with respect to Pareto rationality where the two objectives are improvement expected and uncertainty quantification. According to this Pareto rationality model, if a decision set contains a Pareto efficient (dominant) strategy, a rational decision maker should always select the dominant strategy over its dominated alternatives. The distance from the Pareto frontier determines whether a choice is (Pareto) rational (i.e., lays on the frontier) or is associated to "exasperate" exploration. However, since the uncertainty is one of the two objectives defining the Pareto frontier, we have investigated three different uncertainty quantification measures and selected the one resulting more compliant with the Pareto rationality model proposed. The key result is an analytical framework to characterize how deviations from "rationality" depend on uncertainty quantifications and the evolution of the reward seeking process.

引用

页码：6843 / 6876

页数：34

共 50 条

[1] Uncertainty quantification and exploration–exploitation trade-off in humans
Antonio Candelieri
Andrea Ponti
Francesco Archetti
[J]. Journal of Ambient Intelligence and Humanized Computing, 2023, 14 : 6843 - 6876
[2] Uncertainty avoidance and the exploration-exploitation trade-off
Broekhuizen, Thijs L. J.
Giarratana, Marco S.
Torres, Anna
[J]. EUROPEAN JOURNAL OF MARKETING, 2017, 51 (11-12) : 2080 - 2100
[3] Exploration-exploitation Trade-off in a Treasure Hunting Game
Volchenkov, Dimitri
Helbach, Jonathan
Tscherepanow, Marko
Kueheel, Sina
[J]. ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 299 : 101 - 121
[4] The Exploration-Exploitation Trade-off in Interactive Recommender Systems
Barraza-Urbina, Andrea
[J]. PROCEEDINGS OF THE ELEVENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS'17), 2017, : 431 - 435
[5] Dopamine blockade impairs the exploration-exploitation trade-off in rats
François Cinotti
Virginie Fresno
Nassim Aklil
Etienne Coutureau
Benoît Girard
Alain R. Marchand
Mehdi Khamassi
[J]. Scientific Reports, 9
[6] Dopamine blockade impairs the exploration-exploitation trade-off in rats
Cinotti, Francois
Fresno, Virginie
Aklil, Nassim
Coutureau, Etienne
Girard, Benoit
Marchand, Alain R.
Khamassi, Mehdi
[J]. SCIENTIFIC REPORTS, 2019, 9 (1)
[7] Energetic state regulates the exploration-exploitation trade-off in honeybees
Katz, Keziah
Naug, Dhruba
[J]. BEHAVIORAL ECOLOGY, 2015, 26 (04) : 1045 - 1050
[8] The implied exploration-exploitation trade-off in human motor learning
Holly N Phillips
Nikhil A Howai
Guy-Bart V Stan
Aldo A Faisal
[J]. BMC Neuroscience, 12 (Suppl 1)
[9] Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off
Huang, Shaoyi
Lei, Bowen
Xu, Dongkuan
Peng, Hongwu
Sun, Yue
Xie, Mimi
Ding, Caiwen
[J]. 2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
[10] The role of the noradrenergic system in the exploration-exploitation trade-off: a psychopharmacological study
Jepma, Marieke
Beek, Erik T. Te
Wagenmakers, Eric-Jan
van Gerven, Joop M. A.
Nieuwenhuis, Sander
[J]. FRONTIERS IN HUMAN NEUROSCIENCE, 2010, 4

← 1 2 3 4 5 →