Benchmarking End-to-End Behavioural Cloning on Video Games

被引:0
|
作者
Kanervisto, Anssi [1 ]
Pussinen, Joonas [1 ]
Hautamaki, Ville [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Joensuu, Finland
基金
芬兰科学院;
关键词
video game; behavioral cloning; imitation learning; reinforcement learning; learning environment; neural networks; LEVEL;
D O I
10.1109/cog47356.2020.9231600
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Behavioural cloning, where a computer is taught to perform a task based on demonstrations, has been successfully applied to various video games and robotics tasks, with and without reinforcement learning. This also includes end-to-end approaches, where a computer plays a video game like humans do: by looking at the image displayed on the screen, and sending keystrokes to the game. As a general approach to playing video games, this has many inviting properties: no need for specialized modifications to the game, no lengthy training sessions and the ability to re-use the same tools across different games. However, related work includes game-specific engineering to achieve the results. We take a step towards a general approach and study the general applicability of behavioural cloning on twelve video games, including six modern video games (published after 2010), by using human demonstrations as training data. Our results show that these agents cannot match humans in raw performance but do learn basic dynamics and rules. We also demonstrate how the quality of the data matters, and how recording data from humans is subject to a state-action mismatch, due to human reflexes.
引用
收藏
页码:558 / 565
页数:8
相关论文
共 50 条
  • [1] End-to-End Video Captioning
    Olivastri, Silvio
    Singh, Gurkirt
    Cuzzolin, Fabio
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 1474 - 1482
  • [2] OpenRTiST: End-to-End Benchmarking for Edge Computing
    George, Shilpa
    Eiszler, Thomas
    Iyengar, Roger
    Turki, Haithem
    Feng, Ziqiang
    Wang, Junjue
    Satyanarayanan, Mahadev
    Pillai, Padmanabhan
    IEEE PERVASIVE COMPUTING, 2020, 19 (04) : 10 - 18
  • [3] End-To-End Security for Video Distribution
    Boho, Andras
    Van Wallendael, Glenn
    Dooms, Ann
    De Cock, Jan
    Braeckman, Geert
    Schelkens, Peter
    Preneel, Bart
    Van de Walle, Rik
    IEEE SIGNAL PROCESSING MAGAZINE, 2013, 30 (02) : 97 - 107
  • [4] Retargeting Video With an End-to-End Framework
    Le, Thi-Ngoc-Hanh
    Huang, HuiGuang
    Chen, Yi-Ru
    Lee, Tong-Yee
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (09) : 6164 - 6176
  • [5] End-to-end Distributed Video Coding
    Zhou, Junwei
    Lv, Ting
    Yi, XiangBo
    DCC 2022: 2022 DATA COMPRESSION CONFERENCE (DCC), 2022, : 496 - 496
  • [6] Benchmarking in Virtual Desktops for End-to-End Performance Traceability
    Nguyen, Trung
    Calyam, Prasad
    Antequera, Ronny Bazan
    PROCEEDINGS OF THE 2015 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM), 2015, : 1268 - 1273
  • [7] An Open Architecture for End-to-End Document Analysis Benchmarking
    Lamiroy, Bart
    Lopresti, Daniel
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 42 - 47
  • [8] End-to-End Cloud Application Cloning With Ditto
    Liang, Hmingyu
    Gan, Yu
    Li, Yueying
    Torres, Carlos
    Dhanotia, Abhishek
    Ketkar, Mahesh
    Delimitrou, Christina
    IEEE MICRO, 2024, 44 (04) : 34 - 43
  • [9] End-to-End Transport for Video QoE Fairness
    Nathan, Vikram
    Sivaraman, Vibhaalakshmi
    Addanki, Ravichandra
    Khani, Mehrdad
    Goyal, Prateesh
    Alizadeh, Mohammad
    SIGCOMM '19 - PROCEEDINGS OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2019, : 408 - 423
  • [10] End-to-End Video Text Spotting with Transformer
    Wu, Weijia
    Cai, Yuanqiang
    Shen, Chunhua
    Zhang, Debing
    Fu, Ying
    Zhou, Hong
    Luo, Ping
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4019 - 4035