A two-step algorithm for learning from unspecific reinforcement

被引:0
|
作者
Kuhn, R
Stamatescu, IO
机构
[1] Univ Heidelberg, Inst Theoret Phys, D-69120 Heidelberg, Germany
[2] FEST, D-69118 Heidelberg, Germany
来源
关键词
D O I
暂无
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We study a simple learning model based on the Hebb rule to cope with 'delayed', unspecific reinforcement. In spite of the unspecific nature of the information-feedback convergence to asymptotically perfect generalization is observed, with a rate depending, however, in a nonuniversal way on learning parameters. Asymptotic convergence can be as fast as that of Hebbian learning, but may be slower. Morever, for a certain range of parameter settings, it depends on initial conditions whether the system can reach the regime of asymptotically perfect generalization, or rather approaches a stationary state of poor generalization.
引用
收藏
页码:5749 / 5762
页数:14
相关论文
共 50 条
  • [1] A novel dynamic target tracking algorithm for image based on two-step reinforcement learning
    [J]. Wang, Xiaokun, 1600, International Frequency Sensor Association (166):
  • [2] Two-step reinforcement learning for multistage strategy card game
    Godlewski, Konrad
    Sawicki, Bartosz
    [J]. Bulletin of the Polish Academy of Sciences: Technical Sciences, 2024, 72 (06)
  • [3] Two-step gradient-based reinforcement learning for underwater robotics behavior learning
    El-Fakdi, Andres
    Carreras, Marc
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (03) : 271 - 282
  • [4] A simple model for learning from unspecific reinforcement
    Stamatescu, IO
    [J]. ADAPTIVITY AND LEARNING: AN INTERDISCIPLINARY DEBATE, 2003, : 265 - 280
  • [5] A Two-Step Approach of Feature Construction for a Genetic Learning Algorithm
    Garcia, David
    Gonzalez, Antonio
    Perez, Raul
    [J]. IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ 2011), 2011, : 1255 - 1262
  • [6] Learning structured data from unspecific reinforcement
    Biehl, M
    Kühn, R
    Stamatescu, IO
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2000, 33 (39): : 6843 - 6857
  • [7] Coronary artery decision algorithm trained by two-step machine learning algorithm
    Kim, Young Woo
    Yu, Hee-Jin
    Kim, Jung-Sun
    Ha, Jinyong
    Choi, Jongeun
    Lee, Joon Sang
    [J]. RSC ADVANCES, 2020, 10 (07) : 4014 - 4022
  • [8] Two-Step FORCE Learning Algorithm for Fast Convergence in Reservoir Computing
    Tamura, Hiroto
    Tanaka, Gouhei
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 459 - 469
  • [9] A new two-step learning vector quantization algorithm for image compression
    Liu, Ruochen
    Li, Bingjie
    Zhang, Lang
    Jiao, Licheng
    [J]. TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2015, 37 (01) : 3 - 14
  • [10] A Two-Step Iris Recognition Algorithm
    Liu Jin
    Liu Cangming
    Liu Tingting
    [J]. APPLIED SCIENCE, MATERIALS SCIENCE AND INFORMATION TECHNOLOGIES IN INDUSTRY, 2014, 513-517 : 559 - 562