Attentive Relation Network for Object based Video Games

被引:0
|
作者
Deng, Hangyu [1 ]
Luo, Jia [1 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, 2-7 Hibikino, Kitakyushu, Fukuoka 8080135, Japan
关键词
D O I
10.1109/IJCNN52387.2021.9533369
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep reinforcement learning algorithms have made great progress in video games. However, there are still some problems, such as sample inefficiency and poor generalization. In this paper, we highlight that these problems are partially caused by the inability of convolutional neural networks (CNNs) to reason with the underlying relations between the objects in the image observations. Based on this point, we try to alleviate these problems in a more efficient and explainable way, including learning the representations of objects and reasoning the relations between them with a relation network (RN). Each pixel in the feature maps is treated as an object and our model explicitly learns the relations between object pairs. The relations are summarized through an attention mechanism and then fed into the downstream fully-connected layers. In the experiments, our model is compared with baseline models in three typical object based Atari games. Under the same hyperparameter settings, our model still achieves better sample efficiency and generalization capability. Further studies throw light on the impact of hyperparameters and verify the interpretability of the model.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Attentive Sequences Recurrent Network for Social Relation Recognition from Video
    Lv, Jinna
    Wu, Bin
    Zhang, Yunlei
    Xiao, Yunpeng
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2568 - 2576
  • [2] MATNet: Motion-Attentive Transition Network for Zero-Shot Video Object Segmentation
    Zhou, Tianfei
    Li, Jianwu
    Wang, Shunzhou
    Tao, Ran
    Shen, Jianbing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 8326 - 8338
  • [3] Video Summarization Using Knowledge Distillation-Based Attentive Network
    Qin, Jialin
    Yu, Hui
    Liang, Wei
    Ding, Derui
    COGNITIVE COMPUTATION, 2024, 16 (03) : 1022 - 1031
  • [4] Video summarization with a convolutional attentive adversarial network
    Liang, Guoqiang
    Lv, Yanbing
    Li, Shucheng
    Zhang, Shizhou
    Zhang, Yanning
    PATTERN RECOGNITION, 2022, 131
  • [5] Learning to Play General Video-Games via an Object Embedding Network
    Woof, William
    Chen, Ke
    PROCEEDINGS OF THE 2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG'18), 2018, : 285 - 292
  • [6] Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images
    Zishu Gao
    En Li
    Zhe Wang
    Guodong Yang
    Jiwu Lu
    Bo Ouyang
    Dawei Xu
    Zize Liang
    Neural Processing Letters, 2021, 53 : 653 - 670
  • [7] Object Reconstruction Based on Attentive Recurrent Network from Single and Multiple Images
    Gao, Zishu
    Li, En
    Wang, Zhe
    Yang, Guodong
    Lu, Jiwu
    Ouyang, Bo
    Xu, Dawei
    Liang, Zize
    NEURAL PROCESSING LETTERS, 2021, 53 (01) : 653 - 670
  • [8] An architectural model for combining spatial-based and object-based information for attentive video analysis
    Boccignone, G
    Caggiano, V
    Marcelli, A
    Napoletano, P
    Di Fiore, G
    CAMP 2005: SEVENTH INTERNATIONAL WORKSHOP ON COMPUTER ARCHITECTURE FOR MACHINE PERCEPTION , PROCEEDINGS, 2005, : 116 - 121
  • [9] Multi-interaction Network with Object Relation for Video Question Answering
    Jin, Weike
    Zhao, Zhou
    Gu, Mao
    Yu, Jun
    Xiao, Jun
    Zhuang, Yueting
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1193 - 1201
  • [10] A Spatio-Temporal Attentive Network for Video-Based Crowd Counting
    Avvenuti, Marco
    Bongiovanni, Marco
    Ciampi, Luca
    Falchi, Fabrizio
    Gennaro, Claudio
    Messina, Nicola
    2022 27TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2022), 2022,