Compression Progress, Pseudorandomness, & Hyperbolic Discounting

被引:0
|
作者
Looks, Moshe [1 ]
机构
[1] Google Inc, 1600 Amphitheatre Pkwy, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
General intelligence requires open-ended exploratory learning. The principle of compression progress proposes that agents should derive intrinsic reward from maximizing "interestingness", the first derivative of compression progress over the agent's history. Schmid-huber posits that such a drive can explain "essential aspects of ... curiosity, creativity, art, science, music, [and] jokes", implying that such phenomena might be replicated in an artificial general intelligence programmed with such a drive. I pose two caveats: 1) as pointed out by Rayhawk, not everything that can be considered "interesting" according to this definition is interesting to humans; 2) because of (irrational) hyperbolic discounting of future rewards, humans have an additional preference for rewards that are structured to prevent premature satiation, often superseding intrinsic preferences for compression progress.
引用
收藏
页码:186 / 187
页数:2
相关论文
共 50 条