Comments on "Co-Evolution in the successful learning of backgammon strategy"

被引:10
|
作者
Tesauro, G [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
关键词
co-evolution; backgammon; temporal difference learning;
D O I
10.1023/A:1007469231743
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The results obtained by Pollack and Blair substantially underperform my 1992 TD Learning results. This is shown by directly benchmarking the 1992 TD nets against Pubeval. A plausible hypothesis for this underperformance is that, unlike TD learning, the hillclimbing algorithm fails to capture nonlinear structure inherent in the problem, and despite the presence of hidden units, only obtains a linear approximation to the optimal policy for backgammon. Two lines of evidence supporting this hypothesis are discussed, the first coming from the structure of the Pubeval benchmark program, and the second coming from experiments replicating the Pollack and Blair results.
引用
收藏
页码:241 / 243
页数:3
相关论文
共 50 条
  • [1] Comments on “Co-Evolution in the Successful Learning of Backgammon Strategy”
    Gerald Tesauro
    [J]. Machine Learning, 1998, 32 : 241 - 243
  • [2] Co-Evolution in the Successful Learning of Backgammon Strategy
    Jordan B. Pollack
    Alan D. Blair
    [J]. Machine Learning, 1998, 32 : 225 - 240
  • [3] Co-evolution in the successful learning of backgammon strategy
    Pollack, JB
    Blair, AD
    [J]. MACHINE LEARNING, 1998, 32 (03) : 225 - 240
  • [4] The Co-evolution of Learning and Internationalization Strategy in International New Ventures
    Juan M. Pellegrino
    Rod B. McNaughton
    [J]. Management International Review, 2015, 55 : 457 - 483
  • [5] The Co-evolution of Learning and Internationalization Strategy in International New Ventures
    Pellegrino, Juan M.
    McNaughton, Rod B.
    [J]. MANAGEMENT INTERNATIONAL REVIEW, 2015, 55 (04) : 457 - 483
  • [6] Why Co-Evolution beats Temporal Difference learning at Backgammon for a linear architecture, but not a non-linear architecture
    Darwen, PJ
    [J]. PROCEEDINGS OF THE 2001 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2001, : 1003 - 1010
  • [7] A Novel Differential Evolution with Co-evolution Strategy
    Lee, Wei-Ping
    Chien, Wan-Jou
    [J]. JOURNAL OF COMPUTERS, 2011, 6 (03) : 594 - 602
  • [8] Analyzing the co-evolution of comments and source code
    Fluri, Beat
    Wuersch, Michael
    Giger, Emanuel
    Gall, Harald C.
    [J]. SOFTWARE QUALITY JOURNAL, 2009, 17 (04) : 367 - 394
  • [9] Analyzing the co-evolution of comments and source code
    Beat Fluri
    Michael Würsch
    Emanuel Giger
    Harald C. Gall
    [J]. Software Quality Journal, 2009, 17 : 367 - 394
  • [10] The Co-evolution of Digital Platform Strategy and Platform Architecture
    Kovacevic-Opacic, Lana
    Marjanovic, Olivera
    [J]. AMCIS 2020 PROCEEDINGS, 2020,