PREDICTABILITY ANALYZING: DEEP REINFORCEMENT LEARNING FOR EARLY ACTION RECOGNITION

被引:2
|
作者
Chen, Xiaokai [1 ,2 ]
Gao, Ke [1 ]
Caol, Juan [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
基金
国家重点研发计划;
关键词
Early action recognition; Predictability; Reinforcement learning;
D O I
10.1109/ICME.2019.00169
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Early action recognition aims at inferring ongoing activities from partial videos as early as possible, whereas conventional action recognition relies on fully observed activities. Observations show that the predictability of different activity subsequences vary wildly, however most existing work failing to fully exploit this phenomenon. We define the predictability of activity subsequences as its capacity to perform recognition early and accurately. A predictability-based early action recognition framework(PEAR) is established to utilize predictability information to achieve early recognition. It consists of a predictability evaluator and a classifier. Due to lacking of fine-grained supervision, we develop a reinforcement-learning-based strategy to optimize the evaluator encouraged by a recognizability reward and an early reward. With the predictability estimated by the evaluator, the classifier learns discriminative representation of subsequences to perform early action recognition without sacrificing much accuracy. Experiments on two benchmark datasets demonstrate the proposed approach outperforms existing methods significantly.
引用
收藏
页码:958 / 963
页数:6
相关论文
共 50 条
  • [1] Early Prediction of Human Action by Deep Reinforcement Learning
    Devarakonda, Hareesh
    Mukherjee, Snehasis
    [J]. 2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 486 - 491
  • [2] Better Deep Visual Attention with Reinforcement Learning in Action Recognition
    Wang, Gang
    Wang, Wenmin
    Wang, Jingzhuo
    Bu, Yaohua
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017,
  • [3] Deep Progressive Reinforcement Learning for Skeleton-based Action Recognition
    Tang, Yansong
    Tian, Yi
    Lu, Jiwen
    Li, Peiyang
    Zhou, Jie
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5323 - 5332
  • [4] A Deep Reinforcement Learning Method For Multimodal Data Fusion in Action Recognition
    Guo, Jiale
    Liu, Qiang
    Chen, Enqing
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 120 - 124
  • [5] Attention-Aware Sampling via Deep Reinforcement Learning for Action Recognition
    Dong, Wenkai
    Zhang, Zhaoxiang
    Tan, Tieniu
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8247 - 8254
  • [6] Recognition of human motion with deep reinforcement learning
    Seok, Woojun
    Park, Cheolsoo
    [J]. IEIE Transactions on Smart Processing and Computing, 2018, 7 (03): : 245 - 250
  • [7] Dynamic Action Repetition for Deep Reinforcement Learning
    Lakshminarayanan, Aravind S.
    Sharma, Sahil
    Ravindran, Balaraman
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2133 - 2139
  • [8] Action Space Shaping in Deep Reinforcement Learning
    Kanervisto, Anssi
    Scheller, Christian
    Hautamaki, Ville
    [J]. 2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 479 - 486
  • [9] Action Branching Architectures for Deep Reinforcement Learning
    Tavakoli, Arash
    Pardo, Fabio
    Kormushev, Petar
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 4131 - 4138
  • [10] Early Action Recognition With Category Exclusion Using Policy-Based Reinforcement Learning
    Weng, Junwu
    Jiang, Xudong
    Zheng, Wei-Long
    Yuan, Junsong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (12) : 4626 - 4638