Adaptive early classification of temporal sequences using deep reinforcement learning

被引:22
|
作者
Martinez, Coralie [1 ]
Ramasso, Emmanuel [2 ]
Perrin, Guillaume [1 ]
Rombaut, Michele [3 ]
机构
[1] bioMerieux, Marcy Letoile, France
[2] Univ Bourgogne Franche Comte, FEMTO ST Inst, Besancon, France
[3] Univ Grenoble Alpes, GIPSA Lab, Grenoble Inst Engn, Grenoble, France
关键词
Early classification; Adaptive prediction time; Deep reinforcement learning; Temporal sequences; Double DQN; Trade-off between accuracy vs. speed;
D O I
10.1016/j.knosys.2019.105290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we address the problem of early classification (EC) of temporal sequences with adaptive prediction times. We frame EC as a sequential decision making problem and we define a partially observable Markov decision process (POMDP) fitting the competitive objectives of classification earliness and accuracy. We solve the POMDP by training an agent for EC with deep reinforcement learning (DRL). The agent learns to make adaptive decisions between classifying incomplete sequences now or delaying its prediction to gather more measurements. We adapt an existing DRL algorithm for batch and online learning of the agent's action value function with a deep neural network. We propose strategies of prioritized sampling, prioritized storing and random episode initialization to address the fact that the agent's memory is unbalanced due to (1): all but one of its actions terminate the process and thus (2): actions of classification are less frequent than the action of delay. In experiments, we show improvements in accuracy induced by our specific adaptation of the algorithm used for online learning of the agents action value function. Moreover, we compare two definitions of the POMDP based on delay reward shaping against reward discounting. Finally, we demonstrate that a static naive deep neural network, i.e. trained to classify at static times, is less efficient in terms of accuracy against speed than the equivalent network trained with adaptive decision making capabilities. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Adaptive Control of Data Center Cooling using Deep Reinforcement Learning
    Heimerson, Albin
    Sjolund, Johannes
    Brannvall, Rickard
    Gustafsson, Jonas
    Eker, Johan
    2022 IEEE INTERNATIONAL CONFERENCE ON AUTONOMIC COMPUTING AND SELF-ORGANIZING SYSTEMS COMPANION (ACSOS-C 2022), 2022, : 1 - 6
  • [22] Adaptive Actuation of Magnetic Soft Robots Using Deep Reinforcement Learning
    Yao, Jianpeng
    Cao, Quanliang
    Ju, Yuwei
    Sun, Yuxuan
    Liu, Ruiqi
    Han, Xiaotao
    Li, Liang
    ADVANCED INTELLIGENT SYSTEMS, 2023, 5 (02)
  • [23] Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning
    Bin Peng, Xue
    Berseth, Glen
    van de Panne, Michiel
    ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04):
  • [24] Temporal encoding in deep reinforcement learning agents
    Lin, Dongyan
    Huang, Ann Zixiang
    Richards, Blake Aaron
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [25] Temporal Explanations of Deep Reinforcement Learning Agents
    Towers, Mark
    Du, Yali
    Freeman, Christopher
    Norman, Tim
    EXPLAINABLE AND TRANSPARENT AI AND MULTI-AGENT SYSTEMS, EXTRAAMAS 2024, 2024, 14847 : 99 - 115
  • [26] Classification of Macromolecules Based on Amino Acid Sequences Using Deep Learning
    Khan, Sarwar
    Ali, Imad
    Ghaffar, Faisal
    Mazhar-ul-Haq, Qazi
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2022, 12 (06) : 9491 - 9495
  • [27] Classification of Chromosomal DNA Sequences Using Hybrid Deep Learning Architectures
    Du, Zhihua
    Xiao, Xiangdong
    Uversky, Vladimir N.
    CURRENT BIOINFORMATICS, 2020, 15 (10) : 1130 - 1136
  • [28] A New Approach using Deep Learning and Reinforcement Learning in HealthCare: Skin Cancer Classification
    Yousra, Dahdouh
    Abdelhakim, Anouar Boudhir
    Mohamed, Ben Ahmed
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (05) : 557 - 564
  • [29] Adaptive beamforming based on the deep reinforcement learning
    Hao, Chuanhui
    Sun, Xubao
    Liu, Yidong
    ICNSC 2022 - Proceedings of 2022 IEEE International Conference on Networking, Sensing and Control: Autonomous Intelligent Systems, 2022,
  • [30] Adaptive Slope Locomotion with Deep Reinforcement Learning
    Jones, William
    Blum, Tamir
    Yoshida, Kazuya
    2020 IEEE/SICE INTERNATIONAL SYMPOSIUM ON SYSTEM INTEGRATION (SII), 2020, : 546 - 550