LEARNING SKILLS DIVERSE IN VALUE-RELEVANT FEATURES

被引:0
|
作者
Smith, Matthew J. A. [1 ]
Luketina, Jelena [1 ]
Hartikainen, Kristian [1 ]
Igl, Maximilian [1 ]
Whiteson, Shimon [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Behavioural abstraction via temporally extended actions is vital to solving large-scale reinforcement learning problems. Skills structure exploration, speed up credit assignment, and can be used in transfer learning. However, such abstraction is often difficult or expensive for experts to craft by hand. Unsupervised information-theoretic methods (Gregor et al., 2016; Eysenbach et al., 2019; Sharma et al., 2020) address this problem by learning a set of skills without using environment rewards, typically by maximizing discriminability of the states visited by individual skills. However, since only some features of the state matter in complex environments, these methods often discover behaviours that are trivially diverse, learning skills that are not helpful for downstream tasks. To overcome this limitation, we propose a method for learning skills that only control features important to the tasks of interest. First, by training on a small set of source tasks, the agent learns which features are most relevant. Then, the discriminability objective for an unsupervised information-theoretic method is defined for this learned feature space. This allows the construction of sets of diverse and useful skills that can control the most important features. Experimental results in continuous control domains validate our method, demonstrating that it yields skills that substantially improve learning on downstream locomotion tasks with sparse rewards.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Learning Relevant Image Features With Multiple-Kernel Classification
    Tuia, Devis
    Camps-Valls, Gustavo
    Matasci, Giona
    Kanevski, Mikhail
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2010, 48 (10): : 3780 - 3791
  • [42] ISOLATING RELEVANT FEATURES AND RETARDED AND NORMAL PUPILS LEARNING AFFIXES
    BLAKE, K
    JOURNAL OF RESEARCH AND DEVELOPMENT IN EDUCATION, 1975, 8 : 57 - 58
  • [43] CODING RELEVANT FEATURES AND RETARDED AND NORMAL PUPILS LEARNING AFFIXES
    BLAKE, K
    JOURNAL OF RESEARCH AND DEVELOPMENT IN EDUCATION, 1975, 8 : 51 - 52
  • [44] Learning task-relevant features from robot data
    Vlassis, N
    Bunschoten, R
    Kröse, B
    2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 499 - 504
  • [45] Fractional Norm Regularization: Learning With Very Few Relevant Features
    Kaban, Ata
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (06) : 953 - 963
  • [46] ``The Value of Relevant, Project-Based Learning to Youth Development
    Schwartz, Kerry
    Tessman, Darcy
    McDonald, Daniel
    JOURNAL OF YOUTH DEVELOPMENT, 2013, 8 (01): : 65 - 71
  • [47] Attentional Selection Can Be Predicted by Reinforcement Learning of Task-relevant Stimulus Features Weighted by Value-independent Stickiness
    Balcarras, Matthew
    Ardid, Salva
    Kaping, Daniel
    Everling, Stefan
    Womelsdorf, Thilo
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2016, 28 (02) : 333 - 349
  • [48] Emotional design in multimedia learning: Differentiation on relevant design features and their effects on emotions and learning
    Heidig, Steffi
    Muller, Julia
    Reichelt, Maria
    COMPUTERS IN HUMAN BEHAVIOR, 2015, 44 : 81 - 95
  • [49] Pedagogical features of interactive apps for effective learning of foundational skills
    Huntington, Bethany
    Goulding, James
    Pitchford, Nicola J.
    BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2023, 54 (05) : 1273 - 1291
  • [50] Learning Diverse Skills for Local Navigation under Multi-constraint Optimality
    Cheng, Jin
    Vlastelica, Marin
    Kolev, Pavel
    Li, Chenhao
    Martius, Georg
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 5083 - 5089