Benchmarking End-to-End Behavioural Cloning on Video Games

被引:0
|
作者
Kanervisto, Anssi [1 ]
Pussinen, Joonas [1 ]
Hautamaki, Ville [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
基金
芬兰科学院;
关键词
video game; behavioral cloning; imitation learning; reinforcement learning; learning environment; neural networks; LEVEL;
D O I
10.1109/cog47356.2020.9231600
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Behavioural cloning, where a computer is taught to perform a task based on demonstrations, has been successfully applied to various video games and robotics tasks, with and without reinforcement learning. This also includes end-to-end approaches, where a computer plays a video game like humans do: by looking at the image displayed on the screen, and sending keystrokes to the game. As a general approach to playing video games, this has many inviting properties: no need for specialized modifications to the game, no lengthy training sessions and the ability to re-use the same tools across different games. However, related work includes game-specific engineering to achieve the results. We take a step towards a general approach and study the general applicability of behavioural cloning on twelve video games, including six modern video games (published after 2010), by using human demonstrations as training data. Our results show that these agents cannot match humans in raw performance but do learn basic dynamics and rules. We also demonstrate how the quality of the data matters, and how recording data from humans is subject to a state-action mismatch, due to human reflexes.
引用
收藏
页码:558 / 565
页数:8
相关论文
共 50 条
  • [21] End-to-End Dense Video Captioning with Masked Transformer
    Zhou, Luowei
    Zhou, Yingbo
    Corso, Jason J.
    Socher, Richard
    Xiong, Caiming
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8739 - 8748
  • [22] End-to-end video compression for surveillance and conference videos
    Wang, Shenhao
    Zhao, Yu
    Gao, Han
    Ye, Mao
    Li, Shuai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42713 - 42730
  • [23] End-to-End Video Captioning with Multitask Reinforcement Learning
    Li, Lijun
    Gong, Boqing
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 339 - 348
  • [24] End-to-End Learning of Motion Representation for Video Understanding
    Fan, Lijie
    Huang, Wenbing
    Gan, Chuang
    Ermon, Stefano
    Gong, Boqing
    Huang, Junzhou
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6016 - 6025
  • [25] End-to-End Dense Video Captioning with Parallel Decoding
    Wang, Teng
    Zhang, Ruimao
    Lu, Zhichao
    Zheng, Feng
    Cheng, Ran
    Luo, Ping
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6827 - 6837
  • [26] An end-to-end generative framework for video segmentation and recognition
    Kuehne, Hilde
    Gall, Juergen
    Serre, Thomas
    2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [27] DVC: An End-to-end Deep Video Compression Framework
    Lu, Guo
    Ouyang, Wanli
    Xu, Dong
    Zhang, Xiaoyun
    Cai, Chunlei
    Gao, Zhiyong
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10998 - 11007
  • [28] End-to-end Generative Pretraining for Multimodal Video Captioning
    Seo, Paul Hongsuck
    Nagrani, Arsha
    Arnab, Anurag
    Schmid, Cordelia
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17938 - 17947
  • [29] Network analysis on Skype end-to-end video quality
    Exarchakos, George
    Druda, Luca
    Menkovski, Vlado
    Liotta, Antonio
    INTERNATIONAL JOURNAL OF PERVASIVE COMPUTING AND COMMUNICATIONS, 2015, 11 (01) : 17 - +
  • [30] An end-to-end delivery scheme for robust video streaming
    Ding, JW
    Huang, YM
    Chu, CC
    ADVANCES IN MUTLIMEDIA INFORMATION PROCESSING - PCM 2001, PROCEEDINGS, 2001, 2195 : 375 - 382