Adaptive early classification of temporal sequences using deep reinforcement learning

被引：22

作者：

Martinez, Coralie ^{[1
]}

Ramasso, Emmanuel ^{[2
]}

Perrin, Guillaume ^{[1
]}

Rombaut, Michele ^{[3
]}

机构：

[1] bioMerieux, Marcy Letoile, France

[2] Univ Bourgogne Franche Comte, FEMTO ST Inst, Besancon, France

[3] Univ Grenoble Alpes, GIPSA Lab, Grenoble Inst Engn, Grenoble, France

来源：

KNOWLEDGE-BASED SYSTEMS | 2020年 / 190卷

关键词：

Early classification; Adaptive prediction time; Deep reinforcement learning; Temporal sequences; Double DQN; Trade-off between accuracy vs. speed;

D O I：

10.1016/j.knosys.2019.105290

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this article, we address the problem of early classification (EC) of temporal sequences with adaptive prediction times. We frame EC as a sequential decision making problem and we define a partially observable Markov decision process (POMDP) fitting the competitive objectives of classification earliness and accuracy. We solve the POMDP by training an agent for EC with deep reinforcement learning (DRL). The agent learns to make adaptive decisions between classifying incomplete sequences now or delaying its prediction to gather more measurements. We adapt an existing DRL algorithm for batch and online learning of the agent's action value function with a deep neural network. We propose strategies of prioritized sampling, prioritized storing and random episode initialization to address the fact that the agent's memory is unbalanced due to (1): all but one of its actions terminate the process and thus (2): actions of classification are less frequent than the action of delay. In experiments, we show improvements in accuracy induced by our specific adaptation of the algorithm used for online learning of the agents action value function. Moreover, we compare two definitions of the POMDP based on delay reward shaping against reward discounting. Finally, we demonstrate that a static naive deep neural network, i.e. trained to classify at static times, is less efficient in terms of accuracy against speed than the equivalent network trained with adaptive decision making capabilities. (C) 2019 Elsevier B.V. All rights reserved.

引用

页数：10

共 50 条

[41] An Adaptive Congestion Control Protocol for Wireless Networks Using Deep Reinforcement Learning
Midhula, K. S.
Kumar, P. Arun Raj
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (02): : 2027 - 2043
[42] Latency Aware Adaptive Video Streaming using Ensemble Deep Reinforcement Learning
Zhao, Yin
Shen, Qi-Wei
Li, Wei
Xu, Tong
Niu, Wei-Hua
Xu, Si-Ran
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2647 - 2651
[43] Large-Scale and Adaptive Service Composition Using Deep Reinforcement Learning
Wang, Hongbing
Gu, Mingzhu
Yu, Qi
Fei, Huanhuan
Li, Jiajie
Tao, Yong
SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 383 - 391
[44] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
Westheider, Jonas
Rueckin, Julius
Popovic, Marija
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656
[45] HotDASH: Hotspot Aware Adaptive Video Streaming using Deep Reinforcement Learning
Sengupta, Satadal
Ganguly, Niloy
Chakraborty, Sandip
De, Pradipta
2018 IEEE 26TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2018, : 165 - 175
[46] RAPID: Early Classification of Explosive Transients Using Deep Learning
Muthukrishna, Daniel
Narayan, Gautham
Mandel, Kaisey S.
Biswas, Rahul
Hlozek, Renee
PUBLICATIONS OF THE ASTRONOMICAL SOCIETY OF THE PACIFIC, 2019, 131 (1005)
[47] Classification of brain tumor using deep learning at early stage
Smitha, P.S.
Balaarunesh, G.
Sruthi Nath, C.
Sabatini S, Aminta
Measurement: Sensors, 2024, 35
[48] Deep Reinforcement Learning of Marked Temporal Point Processes
Upadhyay, Utkarsh
De, Abir
Gomez-Rodrizuez, Manuel
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[49] Fish Classification Using DNA Barcode Sequences through Deep Learning Method
Jin, Lina
Yu, Jiong
Yuan, Xiaoqian
Du, Xusheng
SYMMETRY-BASEL, 2021, 13 (09):
[50] Hybrid adaptive deep learning classifier for early detection of diabetic retinopathy using optimal feature extraction and classification
S. V. Hemanth
Saravanan Alagarsamy
Journal of Diabetes & Metabolic Disorders, 2023, 22 : 881 - 895

← 1 2 3 4 5 →