Behavioral Repertoire via Generative Adversarial Policy Networks

被引:0
|
作者
Jegorova, Marija [1 ]
Doncieux, Stephane [2 ]
Hospedales, Timothy M. [1 ]
机构
[1] Univ Edinburgh, Sch Informat, Edinburgh, Midlothian, Scotland
[2] Sorbonne Univ, CNRS, ISIR, Paris, France
基金
欧盟地平线“2020”;
关键词
OPTIMIZATION; ROBOTS;
D O I
10.1109/tcds.2020.3008574
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning algorithms are enabling robots to solve increasingly challenging real-world tasks. These approaches often rely on demonstrations and reproduce the behavior shown. Unexpected changes in the environment may require using different behaviors to achieve the same effect, for instance to reach and grasp an object in changing clutter. An emerging paradigm addressing this robustness issue is to learn a diverse set of successful behaviors for a given task, from which a robot can select the most suitable policy when faced with a new environment. In this paper, we explore a novel realization of this vision by learning a generative model over policies. Rather than learning a single policy, or a small fixed repertoire, our generative model for policies compactly encodes an unbounded number of policies and allows novel controller variants to be sampled. Leveraging our generative policy network, a robot can sample novel behaviors until it finds one that works for a new environment. We demonstrate this idea with an application of robust ball-throwing in the presence of obstacles. We show that this approach achieves a greater diversity of behaviors than an existing evolutionary approach, while maintaining good efficacy of sampled behaviors, allowing a Baxter robot to hit targets more often when ball throwing in the presence of obstacles.
引用
收藏
页码:320 / 326
页数:7
相关论文
共 50 条
  • [1] Behavioral Repertoire via Generative Adversarial Policy Networks
    Jegorova, Marija
    Doncieux, Stephane
    Hospedales, Timothy M.
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (04) : 1344 - 1355
  • [2] Invisible steganography via generative adversarial networks
    Ru Zhang
    Shiqi Dong
    Jianyi Liu
    [J]. Multimedia Tools and Applications, 2019, 78 : 8559 - 8575
  • [3] Question Generation via Generative Adversarial Networks
    Liu, Dong
    Hong, Yu
    Yao, Jianmin
    Zhou, Guodong
    [J]. 2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [4] Invisible steganography via generative adversarial networks
    Zhang, Ru
    Dong, Shiqi
    Liu, Jianyi
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (07) : 8559 - 8575
  • [5] Machine Learning in NextG Networks via Generative Adversarial Networks
    Ayanoglu, Ender
    Davaslioglu, Kemal
    Sagduyu, Yalin E.
    [J]. IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2022, 8 (02) : 480 - 501
  • [6] Reconstruction of Asphalt Nanostructures via Generative Adversarial Networks
    Aljarrah, Mohammad
    Karaki, Ayman
    Masad, Eyad
    [J]. JOURNAL OF MATERIALS IN CIVIL ENGINEERING, 2023, 35 (08)
  • [7] Estimation with Uncertainty via Conditional Generative Adversarial Networks
    Lee, Minhyeok
    Seok, Junhee
    [J]. SENSORS, 2021, 21 (18)
  • [8] Diabetic retinopathy classification via Generative Adversarial Networks
    Mirabedini, Shirin
    Kangavari, Mohammadreza
    Mohammadzadeh, Javad
    [J]. BIOSCIENCE RESEARCH, 2020, 17 (02): : 1329 - 1338
  • [9] RapidFuzz: Accelerating fuzzing via Generative Adversarial Networks
    Ye, Aoshuang
    Wang, Lina
    Zhao, Lei
    Ke, Jianpeng
    Wang, Wenqi
    Liu, Qinliang
    [J]. Neurocomputing, 2021, 460 : 195 - 204
  • [10] Face Inpainting via Nested Generative Adversarial Networks
    Li, Zhijiang
    Zhu, Haonan
    Cao, Liqin
    Mao, Lei
    Zhong, Yanfei
    Ma, Ailong
    [J]. IEEE ACCESS, 2019, 7 : 155462 - 155471