THE STATE-ACTION PROBLEM

被引:0
|
作者
FREUND, PA
机构
关键词
D O I
暂无
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
引用
收藏
页码:3 / 12
页数:10
相关论文
共 50 条
  • [31] Estimation of the Change of Agents Behavior Strategy Using State-Action History
    Uchida, Shihori
    Oba, Sigeyuki
    Ishii, Shin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 100 - 107
  • [32] Extracting important patterns for building state-action evaluation function in Othello
    Huy Nguyen
    Ikeda, Kokolo
    Le, Bac
    2012 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2012, : 278 - 283
  • [33] Swarm Reinforcement Learning Methods for Problems with Continuous State-Action Space
    Iima, Hitoshi
    Kuroe, Yasuaki
    Emoto, Kazuo
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2173 - 2180
  • [34] Scaling Up Q-Learning via Exploiting State-Action Equivalence
    Lyu, Yunlian
    Come, Aymeric
    Zhang, Yijie
    Talebi, Mohammad Sadegh
    ENTROPY, 2023, 25 (04)
  • [35] A Plume-Tracing Strategy via Continuous State-action Reinforcement Learning
    Niu, Lvyin
    Song, Shiji
    You, Keyou
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 759 - 764
  • [36] Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
    Wan, Michael
    Gangwani, Tanmay
    Peng, Jian
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 1218 - 1227
  • [37] Using Memory-Based Learning to Solve Tasks with State-Action Constraints
    Verghese, Mrinal
    Atkeson, Christopher
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9558 - 9565
  • [38] SA-Net: Robust State-Action Recognition for Learning from Observations
    Soans, Nihal
    Asali, Ehsan
    Hong, Yi
    Doshi, Prashant
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2153 - 2159
  • [39] Convergence of Markov decision processes with constraints and state-action dependent discount factors
    Wu, Xiao
    Guo, Xianping
    SCIENCE CHINA-MATHEMATICS, 2020, 63 (01) : 167 - 182
  • [40] Convergence of Markov decision processes with constraints and state-action dependent discount factors
    Xiao Wu
    Xianping Guo
    Science China Mathematics, 2020, 63 : 167 - 182