Adaptive early classification of temporal sequences using deep reinforcement learning

被引:22
|
作者
Martinez, Coralie [1 ]
Ramasso, Emmanuel [2 ]
Perrin, Guillaume [1 ]
Rombaut, Michele [3 ]
机构
[1] bioMerieux, Marcy Letoile, France
[2] Univ Bourgogne Franche Comte, FEMTO ST Inst, Besancon, France
[3] Univ Grenoble Alpes, GIPSA Lab, Grenoble Inst Engn, Grenoble, France
关键词
Early classification; Adaptive prediction time; Deep reinforcement learning; Temporal sequences; Double DQN; Trade-off between accuracy vs. speed;
D O I
10.1016/j.knosys.2019.105290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we address the problem of early classification (EC) of temporal sequences with adaptive prediction times. We frame EC as a sequential decision making problem and we define a partially observable Markov decision process (POMDP) fitting the competitive objectives of classification earliness and accuracy. We solve the POMDP by training an agent for EC with deep reinforcement learning (DRL). The agent learns to make adaptive decisions between classifying incomplete sequences now or delaying its prediction to gather more measurements. We adapt an existing DRL algorithm for batch and online learning of the agent's action value function with a deep neural network. We propose strategies of prioritized sampling, prioritized storing and random episode initialization to address the fact that the agent's memory is unbalanced due to (1): all but one of its actions terminate the process and thus (2): actions of classification are less frequent than the action of delay. In experiments, we show improvements in accuracy induced by our specific adaptation of the algorithm used for online learning of the agents action value function. Moreover, we compare two definitions of the POMDP based on delay reward shaping against reward discounting. Finally, we demonstrate that a static naive deep neural network, i.e. trained to classify at static times, is less efficient in terms of accuracy against speed than the equivalent network trained with adaptive decision making capabilities. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Dementia Classification Using Deep Reinforcement Learning for Early Diagnosis
    Hashmi, Arshad
    Barukab, Omar
    APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [2] An Adaptive Intrusion Detection System for WSN using Reinforcement Learning and Deep Classification
    Hussain, Saqib
    He, Jingsha
    Zhu, Nafei
    Mughal, Fahad Razaque
    Hussain, Muhammad Iftikhar
    Algarni, Abeer D.
    Ahmad, Sadique
    Zarie, Mira M.
    Ateya, Abdelhamied A.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024,
  • [3] Deep Learning based Emotion Classification with Temporal Pupillometry Sequences
    Rafique, Sidra
    Kanwal, Nadia
    Ansari, Mohammad Samar
    Asghar, Mamoona
    Akhtar, Zuhair
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 493 - 498
  • [4] Generating Adaptive Attending Behaviors using User State Classification and Deep Reinforcement Learning
    Kohari, Yoshiki
    Miura, Jun
    Oishi, Shuji
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 548 - 555
  • [5] A deep reinforcement learning approach for early classification of time series
    Martinez, C.
    Perrin, G.
    Ramasso, E.
    Rombaut, M.
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2030 - 2034
  • [6] Classification with Costly Features Using Deep Reinforcement Learning
    Janisch, Jaromir
    Pevny, Tomas
    Lisy, Viliam
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3959 - 3966
  • [7] Deep Reinforcement Learning with Temporal Logics
    Hasanbeig, Mohammadhosein
    Kroening, Daniel
    Abate, Alessandro
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, FORMATS 2020, 2020, 12288 : 1 - 22
  • [8] Deep Reinforcement Learning for Adaptive Learning Systems
    Li, Xiao
    Xu, Hanchen
    Zhang, Jinming
    Chang, Hua-hua
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2023, 48 (02) : 220 - 243
  • [9] Deep reinforcement learning for imbalanced classification
    Enlu Lin
    Qiong Chen
    Xiaoming Qi
    Applied Intelligence, 2020, 50 : 2488 - 2502
  • [10] Deep reinforcement learning for imbalanced classification
    Lin, Enlu
    Chen, Qiong
    Qi, Xiaoming
    APPLIED INTELLIGENCE, 2020, 50 (08) : 2488 - 2502