Explainability of Deep Reinforcement Learning Method with Drones

被引:1
|
作者
Cetin, Ender [1 ]
Barrado, Cristina [1 ]
Pastor, Enric [1 ]
机构
[1] Tech Univ Catalonia, UPC BarcelonaTech, Comp Architecture Dept, Castelldefels, Spain
关键词
Explainable AI; Deep Reinforcement Learning; Counter-Drone; UAV; Drones; Double-DQN; Dueling Network; Prioritised Experience Replay; Airsim;
D O I
10.1109/DASC58513.2023.10311156
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Recent advances in artificial intelligence (AI) technology demonstrated that AI algorithms are very powerful as AI models become more complex. As a result, the users and also the engineers who developed the AI algorithms have a hard time explaining how the AI model gives the specific result. This phenomenon is known as "black box" and affects end-users' confidence in these AI systems. In this research, explainability of deep reinforcement learning is investigated for counter-drone systems. To counter a drone, a deep reinforcement learning method such as double deep Q-network with dueling architecture and prioritized experience replay is proposed. In counter-drone systems, catching the target as soon as possible is expected. Otherwise, the target can be gone in a short time. To understand how the agent performs more quickly and accurately, figures representing rewards, drone locations, crash positions, and the distribution of actions are analyzed and compared. For example, the positions of the drones in a successful episode during training can be analyzed by the actions the agent performed and the rewards in this episode. In addition, the actions agent took in episodes are compared with action frequencies during training and it is seen that at the end of the training, the agent selects the dominant actions throughout the training. However, at the beginning of the training, the distribution of actions is not correlated with the actions selected at the end. The results showed that the agent uses different flight paths by using different actions to catch the target drone in different episodes and different models. Finally, the generation of a saliency map is investigated to identify the critical regions in an input image which influences the predictions made by the DQN agent by evaluating the gradients of the model's output with respect to both the image and scalar inputs.
引用
下载
收藏
页数:9
相关论文
共 50 条
  • [1] Explainability in deep reinforcement learning
    Heuillet, Alexandre
    Couthouis, Fabien
    Diaz-Rodriguez, Natalia
    KNOWLEDGE-BASED SYSTEMS, 2021, 214 (214)
  • [2] Drones Chasing Drones: Reinforcement Learning and Deep Search Area Proposal
    Akhloufi, Moulay A.
    Arola, Sebastien
    Bonnet, Alexandre
    DRONES, 2019, 3 (03) : 1 - 14
  • [3] Power Control in Internet of Drones by Deep Reinforcement Learning
    Yao, Jingjing
    Ansari, Nirwan
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [4] Explainability in Deep Reinforcement Learning: A Review into Current Methods and Applications
    Hickling, Thomas
    Zenati, Abdelhafid
    Aouf, Nabil
    Spencer, Phillippa
    ACM COMPUTING SURVEYS, 2024, 56 (05)
  • [5] Assessing Explainability in Reinforcement Learning
    ZeIvelder, Amber E.
    Westberg, Marcus
    Framling, Kary
    EXPLAINABLE AND TRANSPARENT AI AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2021, 2021, 12688 : 223 - 240
  • [6] HEX: Human-in-the-loop explainability via deep reinforcement learning
    Lash, Michael T.
    Decision Support Systems, 2024, 187
  • [7] Deep Reinforcement Learning for Internet of Drones Networks: Issues and Research Directions
    Aboueleneen, Noor
    Alwarafy, Abdulmalik
    Abdallah, Mohamed
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2023, 4 : 671 - 683
  • [8] Deep Reinforcement Learning for Frontal View Person Shooting using Drones
    Passalis, Nikolaos
    Tefas, Anastasios
    PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS (EAIS), 2018,
  • [9] A Lower Complexity Deep Learning Method for Drones Detection
    Kassab, Mohamad
    Seghrouchni, Amal El Fallah
    Barbaresco, Frederic
    Abu Zitar, Raed
    2023 SENSOR SIGNAL PROCESSING FOR DEFENCE CONFERENCE, SSPD, 2023, : 11 - 15
  • [10] Towards Nano-Drones Agile Flight Using Deep Reinforcement Learning
    Mengozzi, Sebastiano
    Zanatta, Luca
    Barchi, Francesco
    Bartolini, Andrea
    Acquaviva, Andrea
    2024 IEEE INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS, COINS 2024, 2024, : 297 - 302