THE STATE-ACTION PROBLEM

被引：0

作者：

FREUND, PA

机构：

来源：

PROCEEDINGS OF THE AMERICAN PHILOSOPHICAL SOCIETY | 1991年 / 135卷 / 01期

关键词：

D O I：

暂无

中图分类号：

C [社会科学总论];

学科分类号：

03 ; 0303 ;

摘要：

引用

页码：3 / 12

页数：10

共 50 条

[31] Estimation of the Change of Agents Behavior Strategy Using State-Action History
Uchida, Shihori
Oba, Sigeyuki
Ishii, Shin
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 100 - 107
[32] Extracting important patterns for building state-action evaluation function in Othello
Huy Nguyen
Ikeda, Kokolo
Le, Bac
2012 CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2012, : 278 - 283
[33] Swarm Reinforcement Learning Methods for Problems with Continuous State-Action Space
Iima, Hitoshi
Kuroe, Yasuaki
Emoto, Kazuo
2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 2173 - 2180
[34] Scaling Up Q-Learning via Exploiting State-Action Equivalence
Lyu, Yunlian
Come, Aymeric
Zhang, Yijie
Talebi, Mohammad Sadegh
ENTROPY, 2023, 25 (04)
[35] A Plume-Tracing Strategy via Continuous State-action Reinforcement Learning
Niu, Lvyin
Song, Shiji
You, Keyou
2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 759 - 764
[36] Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch
Wan, Michael
Gangwani, Tanmay
Peng, Jian
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 1218 - 1227
[37] Using Memory-Based Learning to Solve Tasks with State-Action Constraints
Verghese, Mrinal
Atkeson, Christopher
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9558 - 9565
[38] SA-Net: Robust State-Action Recognition for Learning from Observations
Soans, Nihal
Asali, Ehsan
Hong, Yi
Doshi, Prashant
2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 2153 - 2159
[39] Convergence of Markov decision processes with constraints and state-action dependent discount factors
Wu, Xiao
Guo, Xianping
SCIENCE CHINA-MATHEMATICS, 2020, 63 (01) : 167 - 182
[40] Convergence of Markov decision processes with constraints and state-action dependent discount factors
Xiao Wu
Xianping Guo
Science China Mathematics, 2020, 63 : 167 - 182

← 1 2 3 4 5 →