Adaptive early classification of temporal sequences using deep reinforcement learning

被引:22
|
作者
Martinez, Coralie [1 ]
Ramasso, Emmanuel [2 ]
Perrin, Guillaume [1 ]
Rombaut, Michele [3 ]
机构
[1] bioMerieux, Marcy Letoile, France
[2] Univ Bourgogne Franche Comte, FEMTO ST Inst, Besancon, France
[3] Univ Grenoble Alpes, GIPSA Lab, Grenoble Inst Engn, Grenoble, France
关键词
Early classification; Adaptive prediction time; Deep reinforcement learning; Temporal sequences; Double DQN; Trade-off between accuracy vs. speed;
D O I
10.1016/j.knosys.2019.105290
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this article, we address the problem of early classification (EC) of temporal sequences with adaptive prediction times. We frame EC as a sequential decision making problem and we define a partially observable Markov decision process (POMDP) fitting the competitive objectives of classification earliness and accuracy. We solve the POMDP by training an agent for EC with deep reinforcement learning (DRL). The agent learns to make adaptive decisions between classifying incomplete sequences now or delaying its prediction to gather more measurements. We adapt an existing DRL algorithm for batch and online learning of the agent's action value function with a deep neural network. We propose strategies of prioritized sampling, prioritized storing and random episode initialization to address the fact that the agent's memory is unbalanced due to (1): all but one of its actions terminate the process and thus (2): actions of classification are less frequent than the action of delay. In experiments, we show improvements in accuracy induced by our specific adaptation of the algorithm used for online learning of the agents action value function. Moreover, we compare two definitions of the POMDP based on delay reward shaping against reward discounting. Finally, we demonstrate that a static naive deep neural network, i.e. trained to classify at static times, is less efficient in terms of accuracy against speed than the equivalent network trained with adaptive decision making capabilities. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] An Adaptive Congestion Control Protocol for Wireless Networks Using Deep Reinforcement Learning
    Midhula, K. S.
    Kumar, P. Arun Raj
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (02): : 2027 - 2043
  • [42] Latency Aware Adaptive Video Streaming using Ensemble Deep Reinforcement Learning
    Zhao, Yin
    Shen, Qi-Wei
    Li, Wei
    Xu, Tong
    Niu, Wei-Hua
    Xu, Si-Ran
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2647 - 2651
  • [43] Large-Scale and Adaptive Service Composition Using Deep Reinforcement Learning
    Wang, Hongbing
    Gu, Mingzhu
    Yu, Qi
    Fei, Huanhuan
    Li, Jiajie
    Tao, Yong
    SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 383 - 391
  • [44] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
    Westheider, Jonas
    Rueckin, Julius
    Popovic, Marija
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656
  • [45] HotDASH: Hotspot Aware Adaptive Video Streaming using Deep Reinforcement Learning
    Sengupta, Satadal
    Ganguly, Niloy
    Chakraborty, Sandip
    De, Pradipta
    2018 IEEE 26TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2018, : 165 - 175
  • [46] RAPID: Early Classification of Explosive Transients Using Deep Learning
    Muthukrishna, Daniel
    Narayan, Gautham
    Mandel, Kaisey S.
    Biswas, Rahul
    Hlozek, Renee
    PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2019, 131 (1005)
  • [47] Classification of brain tumor using deep learning at early stage
    Smitha, P.S.
    Balaarunesh, G.
    Sruthi Nath, C.
    Sabatini S, Aminta
    Measurement: Sensors, 2024, 35
  • [48] Deep Reinforcement Learning of Marked Temporal Point Processes
    Upadhyay, Utkarsh
    De, Abir
    Gomez-Rodrizuez, Manuel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [49] Fish Classification Using DNA Barcode Sequences through Deep Learning Method
    Jin, Lina
    Yu, Jiong
    Yuan, Xiaoqian
    Du, Xusheng
    SYMMETRY-BASEL, 2021, 13 (09):
  • [50] Hybrid adaptive deep learning classifier for early detection of diabetic retinopathy using optimal feature extraction and classification
    S. V. Hemanth
    Saravanan Alagarsamy
    Journal of Diabetes & Metabolic Disorders, 2023, 22 : 881 - 895