Learning latent representations to co-adapt to humans

被引:1
|
作者
Parekh, Sagar [1 ]
Losey, Dylan P. [1 ]
机构
[1] Virginia Tech, Mech Engn Dept, Blacksburg, VA 24060 USA
基金
美国食品与农业研究所;
关键词
Human-robot interaction; Representation learning; Reinforcement learning; ROBOT; GAMES;
D O I
10.1007/s10514-023-10109-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When robots interact with humans in homes, roads, or factories the human's behavior often changes in response to the robot. Non-stationary humans are challenging for robot learners: actions the robot has learned to coordinate with the original human may fail after the human adapts to the robot. In this paper we introduce an algorithmic formalism that enables robots (i.e., ego agents) to co-adapt alongside dynamic humans (i.e., other agents) using only the robot's low-level states, actions, and rewards. A core challenge is that humans not only react to the robot's behavior, but the way in which humans react inevitably changes both over time and between users. To deal with this challenge, our insight is that-instead of building an exact model of the human-robots can learn and reason over high-level representations of the human's policy and policy dynamics. Applying this insight we develop RILI: Robustly Influencing Latent Intent. RILI first embeds low-level robot observations into predictions of the human's latent strategy and strategy dynamics. Next, RILI harnesses these predictions to select actions that influence the adaptive human towards advantageous, high reward behaviors over repeated interactions. We demonstrate that-given RILI's measured performance with users sampled from an underlying distribution-we can probabilistically bound RILI's expected performance across new humans sampled from the same distribution. Our simulated experiments compare RILI to state-of-the-art representation and reinforcement learning baselines, and show that RILI better learns to coordinate with imperfect, noisy, and time-varying agents. Finally, we conduct two user studies where RILI co-adapts alongside actual humans in a game of tag and a tower-building task. See videos of our user studies here: https://youtu.be/WYGO5amDXbQ
引用
收藏
页码:771 / 796
页数:26
相关论文
共 50 条
  • [1] Learning latent representations to co-adapt to humans
    Sagar Parekh
    Dylan P. Losey
    [J]. Autonomous Robots, 2023, 47 : 771 - 796
  • [2] Co-Adapt Continuously Tailored Software
    Bovard, Pooja P.
    Gao, Harry T.
    Goodman, Jason B.
    Prasov, Zahar
    [J]. ADJUNCT PUBLICATION OF THE 27TH CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION (ACM UMAP '19 ADJUNCT), 2019, : 105 - 106
  • [3] Learning Representations by Humans, for Humans
    Hilgard, Sophie
    Rosenfeld, Nir
    Banaji, Mahzarin R.
    Cao, Jack
    Parkes, David C.
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [4] Learning to Adapt via Latent Domains for Adaptive Semantic Segmentation
    Liu, Yunan
    Zhang, Shanshan
    Li, Yang
    Yang, Jian
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Learning Sparsity of Representations with Discrete Latent Variables
    Xu, Zhao
    Rubio, Daniel Onoro
    Serra, Giuseppe
    Niepert, Mathias
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] Learning Latent Representations for Speech Generation and Transformation
    Hsu, Wei-Ning
    Zhang, Yu
    Glass, James
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1273 - 1277
  • [7] LEARNING TO FUSE LATENT REPRESENTATIONS FOR MULTIMODAL DATA
    Oyedotun, Oyebade K.
    Aouada, Djamila
    Ottersten, Bjoern
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 3122 - 3126
  • [8] Bayesian Learning of Latent Representations of Language Structures
    Murawaki, Yugo
    [J]. COMPUTATIONAL LINGUISTICS, 2019, 45 (02) : 199 - 228
  • [9] Mental Representations Mediate Aversive Learning in Humans
    Qiao, Xiaolin
    Wolters, Lauren
    Howard, James D.
    [J]. BEHAVIORAL NEUROSCIENCE, 2023, : 319 - 329
  • [10] Learned Disentangled Latent Representations for Scalable Image Coding for Humans and Machines
    Ozyilkan, Ezgi
    Ulhaq, Mateen
    Choi, Hyomin
    Racape, Fabien
    [J]. 2023 DATA COMPRESSION CONFERENCE, DCC, 2023, : 42 - 51