An adaptive approach for the exploration-exploitation dilemma for learning agents

被引:0
|
作者
Rejeb, L
Guessoum, Z
M'Hallah, R
机构
[1] CReSTIC, MODECO Team, Reims 2, France
[2] Univ Paris 06, LIP6, OASIS Team, F-75252 Paris 5, France
[3] Kuwait Univ, Dept Stat & Operat Res, Kuwait 13060, Kuwait
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning agents have to deal with the exploration-exploitation dilemma. The choice between exploration and exploitation is very difficult in dynamic systems; in particular in large scale ones such as economic systems. Recent research shows that there is neither an optimal nor a unique solution for this problem. In this paper, we propose an adaptive approach based on meta-rules to adapt the choice between exploration and exploitation. This new adaptive approach relies on the variations of the performance of the agents. To validate the approach, we apply it to economic systems and compare it to two adaptive methods: one local and one global. Herein, we adapt these two methods, which were originally proposed by Wilson, to economic systems. Moreover, we compare different exploration strategies and focus on their influence on the performance of the agents.
引用
收藏
页码:316 / 325
页数:10
相关论文
共 50 条
  • [21] The role of intolerance of uncertainty when solving the exploration-exploitation dilemma1
    Krypotos, Angelos-Miltiadis
    Alves, Maryna
    Crombez, Geert
    Vlaeyen, Johan W. S.
    [J]. INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2022, 181 : 33 - 39
  • [22] Explaining Exploration-Exploitation in Humans
    Candelieri, Antonio
    Ponti, Andrea
    Archetti, Francesco
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [23] Stream-based Joint Exploration-Exploitation Active Learning
    Loy, Chen Change
    Hospedales, Timothy M.
    Xiang, Tao
    Gong, Shaogang
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 1560 - 1567
  • [24] Bayesian Deep Learning based Exploration-Exploitation for Personalized Recommendations
    Wang, Xin
    Kadioglu, Serdar
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1715 - 1719
  • [25] Organizational success, human resources practices and exploration-exploitation learning
    Ubeda-Garcia, Mercedes
    Claver-Cortes, Enrique
    Marco-Lajara, Bartolome
    Garcia-Lillo, Francisco
    Zaragoza-Saez, Patrocinio
    [J]. EMPLOYEE RELATIONS, 2019, 41 (06) : 1379 - 1397
  • [26] Neuronal Mechanisms Underlying Exploration-Exploitation Strategies in Operant Learning
    Lew, Sergio
    Rey, Hernan G.
    Silvano Zanutto, B.
    [J]. 2013 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2013,
  • [27] Adaptive Genetic Algorithm with Exploration-Exploitation Tradeoff for Preprocessing Microarray Datasets
    Rajappan, Sivaraj
    Rangasamy, DeviPriya
    [J]. CURRENT BIOINFORMATICS, 2017, 12 (05) : 441 - 451
  • [28] Shareholder returns and the exploration-exploitation dilemma: R&D announcements by biotechnology firms
    Mc Namara, Peter
    Baden-Fuller, Charles
    [J]. RESEARCH POLICY, 2007, 36 (04) : 548 - 565
  • [29] Unpacking the exploration-exploitation tradeoff on Snapchat: The relationships between users' exploration-exploitation interests and server log data
    Gomez-Zara, Diego
    Liu, Yozen
    Neves, Leonardo
    Shah, Neil
    Bos, Maarten W.
    [J]. COMPUTERS IN HUMAN BEHAVIOR, 2024, 150
  • [30] The Exploration-Exploitation Dilemma: A Multidisciplinary Framework (vol 9, e95693, 2014)
    Berger-Tal, O.
    Nathan, J.
    Meron, E.
    Saltz, D.
    [J]. PLOS ONE, 2015, 10 (03):