The two facets of the exploration-exploitation dilemma

被引:1
|
作者
Zhang, Kaifu [1 ]
Pan, Wei [1 ]
机构
[1] Tsinghua Univ, Beijing 100084, Peoples R China
关键词
D O I
10.1109/IAT.2006.120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an algorithm to better solve the exploration-exploitation dilemma faced by model-less reinforcement learning agents. The main contribution is twofold: (1) The two facets of the exploration-exploitation dilemma are distinguished: in some cases, the agent faces a non-stationary environment, therefore it needs to choose the best moment to explore in order to adapt to the changes; in some other cases, the agent faces a relatively large state-action space, and it therefore needs to choose the most promising subset of states/actions to explore. In this two-facet framework, we compared the relative advantage and limitations of two previously proposed algorithms in difference situations. (2) We unified these two algorithms to produce the new algorithm which works fairly well in all testing situations.
引用
收藏
页码:371 / +
页数:2
相关论文
共 50 条
  • [1] The exploration-exploitation dilemma in pain
    Krypotos, Angelos
    [J]. PSYCHOSOMATIC MEDICINE, 2020, 82 (06) : A166 - A166
  • [2] The Exploration-Exploitation Dilemma: A Multidisciplinary Framework
    Berger-Tal, Oded
    Nathan, Jonathan
    Meron, Ehud
    Saltz, David
    [J]. PLOS ONE, 2014, 9 (04):
  • [3] The dynamics of pain avoidance: the exploration-exploitation dilemma
    Krypotos, Angelos-Miltiadis
    Crombez, Geert
    Vlaeyen, Johan W. S.
    [J]. PAIN, 2024, 165 (10) : 2145 - 2149
  • [4] The exploration-exploitation dilemma in pain: an experimental investigation
    Krypotos, Angelos-Miltiadis
    Crombez, Geert
    Alves, Maryna
    Claes, Nathalie
    Vlaeyen, Johan W. S.
    [J]. PAIN, 2022, 163 (02) : E215 - E233
  • [5] Exploration-exploitation: A cognitive dilemma still unresolved
    James, Russell N., III
    [J]. COGNITIVE NEUROSCIENCE, 2015, 6 (04) : 219 - 221
  • [6] An adaptive approach for the exploration-exploitation dilemma for learning agents
    Rejeb, L
    Guessoum, Z
    M'Hallah, R
    [J]. MULTI-AGENT SYSTEMS AND APPLICATIONS IV, PROCEEDINGS, 2005, 3690 : 316 - 325
  • [7] Probing the temporal dynamics of the exploration-exploitation dilemma of eye movements
    Ehinger, Benedikt V.
    Kaufhold, Lilli
    Koenig, Peter
    [J]. JOURNAL OF VISION, 2018, 18 (03):
  • [8] An adaptive approach for the exploration-exploitation dilemma and its application to economic systems
    Rejeb, Lilia
    Guessoum, Zahia
    M'Hallah, Rym
    [J]. LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 165 - 176
  • [9] CSDSE: Apply Cooperative Search to Solve the Exploration-Exploitation Dilemma of Design Space Exploration
    Feng, Kaijie
    Fan, Xiaoya
    An, Jianfeng
    Wang, Haoyang
    Li, Chuxi
    [J]. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2023, PT IV, 2024, 14490 : 1 - 23
  • [10] Overtaking Method based on Variance of Values: Resolving the Exploration-Exploitation Dilemma
    Ochi, Kento
    Kamiura, Moto
    [J]. 17TH ASIA PACIFIC SYMPOSIUM ON INTELLIGENT AND EVOLUTIONARY SYSTEMS, IES2013, 2013, 24 : 126 - 136