Deep Reinforcement Learning Agents for Decision Making for Gameplay

被引:0
|
作者
Heaton, Jacqueline [1 ]
Givigi, Sidney [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON, Canada
关键词
PLAY;
D O I
10.1109/SysCon61195.2024.10553598
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Robots are becoming more integrated into society as they become more advanced, and the programming behind them needs to continue to progress in order for the robots to be utilized to their fullest potential. Artificial Intelligence (AI) is one of the most versatile and quickly growing areas of robotic control, and has been used for a variety of different robots and tasks. One potential use of robotics and AI is in that of childhood development. Cooperative play has been shown to be a crucial part of childhood development, and for children with developmental disabilities, playing with other children may be difficult and frustrating, leading them to miss out on this important milestone. Cooperative play with robots has been shown to have positive educational and therapeutic effects on children with developmental disabilities, and so robots can be used as substitute players for children who have troubles playing with other children. To achieve this, AI algorithms must be developed that can make appropriate decisions or moves for a given game, to such an extent that the children would choose to play with the robot instead of alone. In this paper two AI agents are developed to play Menara, a cooperative tower building game. The two agents include a pillar placement agent and a tile placement agent. They implement algorithms including the method for selecting the pillars to have available to the agent during gameplay, and how many pillars the agent plans to place in a single turn. The tile placement agent was able to successfully balance a tile 62% of the time, while the pillar placement agent was able to succeed 88% of the time on the test dataset.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] A reinforcement learning model of precommitment in decision making
    Kurth-Nelson, Zeb
    Redish, A. David
    FRONTIERS IN BEHAVIORAL NEUROSCIENCE, 2010, 4
  • [42] Reinforcement learning with hierarchical decision-making
    Cohen, Shahar
    Maimon, Oded
    Khmlenitsky, Evgeni
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 3, 2006, : 177 - +
  • [43] Neural Basis of Reinforcement Learning and Decision Making
    Lee, Daeyeol
    Seo, Hyojung
    Jung, Min Whan
    ANNUAL REVIEW OF NEUROSCIENCE, VOL 35, 2012, 35 : 287 - 308
  • [44] Decision analysis and reinforcement learning in surgical decision-making
    Loftus, Tyler J.
    Filiberto, Amanda C.
    Li, Yanjun
    Balch, Jeremy
    Cook, Allyson C.
    Tighe, Patrick J.
    Efron, Philip A.
    Upchurch, Gilbert R., Jr.
    Rashidi, Parisa
    Li, Xiaolin
    Bihorac, Azra
    SURGERY, 2020, 168 (02) : 253 - 266
  • [45] Temporal encoding in deep reinforcement learning agents
    Dongyan Lin
    Ann Zixiang Huang
    Blake Aaron Richards
    Scientific Reports, 13
  • [46] Interval timing in deep reinforcement learning agents
    Deverett, Ben
    Faulkner, Ryan
    Fortunato, Meire
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [47] Perspective Taking in Deep Reinforcement Learning Agents
    Labash, Aqeel
    Aru, Jaan
    Matiisen, Tambet
    Tampuu, Ardi
    Vicente, Raul
    FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2020, 14 (14)
  • [48] Temporal encoding in deep reinforcement learning agents
    Lin, Dongyan
    Huang, Ann Zixiang
    Richards, Blake Aaron
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [49] Goal Modelling for Deep Reinforcement Learning Agents
    Leung, Jonathan
    Shen, Zhiqi
    Zeng, Zhiwei
    Miao, Chunyan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 271 - 286
  • [50] CASRL: Collision Avoidance with Spiking Reinforcement Learning Among Dynamic, Decision-Making Agents
    Zhang, Chengjun
    Yip, Ka-Wa
    Yang, Bo
    Zhang, Zhiyong
    Yuan, Mengwen
    Yan, Rui
    Tang, Huajin
    2024 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2024), 2024, : 8031 - 8038