LEARNING SKILLS DIVERSE IN VALUE-RELEVANT FEATURES

被引：0

作者：

Smith, Matthew J. A. ^{[1
]}

Luketina, Jelena ^{[1
]}

Hartikainen, Kristian ^{[1
]}

Igl, Maximilian ^{[1
]}

Whiteson, Shimon ^{[1
]}

机构：

[1] Univ Oxford, Oxford, England

来源：

CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 199 | 2022年 / 199卷

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Behavioural abstraction via temporally extended actions is vital to solving large-scale reinforcement learning problems. Skills structure exploration, speed up credit assignment, and can be used in transfer learning. However, such abstraction is often difficult or expensive for experts to craft by hand. Unsupervised information-theoretic methods (Gregor et al., 2016; Eysenbach et al., 2019; Sharma et al., 2020) address this problem by learning a set of skills without using environment rewards, typically by maximizing discriminability of the states visited by individual skills. However, since only some features of the state matter in complex environments, these methods often discover behaviours that are trivially diverse, learning skills that are not helpful for downstream tasks. To overcome this limitation, we propose a method for learning skills that only control features important to the tasks of interest. First, by training on a small set of source tasks, the agent learns which features are most relevant. Then, the discriminability objective for an unsupervised information-theoretic method is defined for this learned feature space. This allows the construction of sets of diverse and useful skills that can control the most important features. Experimental results in continuous control domains validate our method, demonstrating that it yields skills that substantially improve learning on downstream locomotion tasks with sparse rewards.

引用

页数：21

共 50 条

[41] Learning Relevant Image Features With Multiple-Kernel Classification
Tuia, Devis
Camps-Valls, Gustavo
Matasci, Giona
Kanevski, Mikhail
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2010, 48 (10): : 3780 - 3791
[42] ISOLATING RELEVANT FEATURES AND RETARDED AND NORMAL PUPILS LEARNING AFFIXES
BLAKE, K
JOURNAL OF RESEARCH AND DEVELOPMENT IN EDUCATION, 1975, 8 : 57 - 58
[43] CODING RELEVANT FEATURES AND RETARDED AND NORMAL PUPILS LEARNING AFFIXES
BLAKE, K
JOURNAL OF RESEARCH AND DEVELOPMENT IN EDUCATION, 1975, 8 : 51 - 52
[44] Learning task-relevant features from robot data
Vlassis, N
Bunschoten, R
Kröse, B
2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 499 - 504
[45] Fractional Norm Regularization: Learning With Very Few Relevant Features
Kaban, Ata
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (06) : 953 - 963
[46] ``The Value of Relevant, Project-Based Learning to Youth Development
Schwartz, Kerry
Tessman, Darcy
McDonald, Daniel
JOURNAL OF YOUTH DEVELOPMENT, 2013, 8 (01): : 65 - 71
[47] Attentional Selection Can Be Predicted by Reinforcement Learning of Task-relevant Stimulus Features Weighted by Value-independent Stickiness
Balcarras, Matthew
Ardid, Salva
Kaping, Daniel
Everling, Stefan
Womelsdorf, Thilo
JOURNAL OF COGNITIVE NEUROSCIENCE, 2016, 28 (02) : 333 - 349
[48] Emotional design in multimedia learning: Differentiation on relevant design features and their effects on emotions and learning
Heidig, Steffi
Muller, Julia
Reichelt, Maria
COMPUTERS IN HUMAN BEHAVIOR, 2015, 44 : 81 - 95
[49] Pedagogical features of interactive apps for effective learning of foundational skills
Huntington, Bethany
Goulding, James
Pitchford, Nicola J.
BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2023, 54 (05) : 1273 - 1291
[50] Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
Cheng, Jin
Vlastelica, Marin
Kolev, Pavel
Li, Chenhao
Martius, Georg
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 5083 - 5089

← 1 2 3 4 5 →