LEARNING SKILLS DIVERSE IN VALUE-RELEVANT FEATURES

被引:0
|
作者
Smith, Matthew J. A. [1 ]
Luketina, Jelena [1 ]
Hartikainen, Kristian [1 ]
Igl, Maximilian [1 ]
Whiteson, Shimon [1 ]
机构
[1] Univ Oxford, Oxford, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Behavioural abstraction via temporally extended actions is vital to solving large-scale reinforcement learning problems. Skills structure exploration, speed up credit assignment, and can be used in transfer learning. However, such abstraction is often difficult or expensive for experts to craft by hand. Unsupervised information-theoretic methods (Gregor et al., 2016; Eysenbach et al., 2019; Sharma et al., 2020) address this problem by learning a set of skills without using environment rewards, typically by maximizing discriminability of the states visited by individual skills. However, since only some features of the state matter in complex environments, these methods often discover behaviours that are trivially diverse, learning skills that are not helpful for downstream tasks. To overcome this limitation, we propose a method for learning skills that only control features important to the tasks of interest. First, by training on a small set of source tasks, the agent learns which features are most relevant. Then, the discriminability objective for an unsupervised information-theoretic method is defined for this learned feature space. This allows the construction of sets of diverse and useful skills that can control the most important features. Experimental results in continuous control domains validate our method, demonstrating that it yields skills that substantially improve learning on downstream locomotion tasks with sparse rewards.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Teaching Basic Lab Skills Using Diverse Microbial Communities in a Biologically Relevant Context
    Brooke, Joanna S.
    Funk, Phillip E.
    Silliker, Margaret E.
    Sparkes, Timothy C.
    AMERICAN BIOLOGY TEACHER, 2010, 72 (08): : 490 - 494
  • [32] Learning Force-Relevant Skills from Human Demonstration
    Gao, Xiao
    Ling, Jie
    Xiao, Xiaohui
    Li, Miao
    COMPLEXITY, 2019, 2019
  • [33] Value for Learning Communication Skills in Undergraduate Students
    Cline, Christina
    Samonds, Karen
    Bergan-Roller, Heather
    FASEB JOURNAL, 2021, 35
  • [34] Literacy skills in science learning among linguistically diverse students
    Lee, O
    Fradd, SH
    SCIENCE EDUCATION, 1996, 80 (06) : 651 - 671
  • [35] DEVELOPING ENGLISH LANGUAGE SKILLS IN A MULTICULTURAL AND DIVERSE LEARNING ENVIRONMENT
    Arias, Nemis Garcia
    Quevedo Arnaiz, Ned Vito
    REVISTA CONRADO, 2023, 19 : 431 - 438
  • [36] Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning
    Zhang, Qiyuan
    Ma, Xiaoteng
    Yang, Yiqin
    Li, Chenghao
    Yang, Jun
    Liu, Yu
    Liang, Bin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 6601 - 6607
  • [37] Selection of relevant features in a fuzzy genetic learning algorithm
    González, A
    Pérez, R
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2001, 31 (03): : 417 - 425
  • [38] PRIOR KNOWLEDGE AND FUNCTIONALLY RELEVANT FEATURES IN CONCEPTS LEARNING
    WISNIEWSKI, EJ
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-LEARNING MEMORY AND COGNITION, 1995, 21 (02) : 449 - 468
  • [39] Renaissance Robot: Optimal Transport Policy Fusion for Learning Diverse Skills
    Tan, Julia
    Senanayake, Ransalu
    Ramos, Fabio
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 7052 - 7059
  • [40] A hybrid relevant-diverse approach for image re-ranking with multiple features
    Cui, Chaoran
    Ma, Jun
    Zhang, Lei
    Li, Piji
    Ren, Zhaochun
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (18) : 3864 - 3881