Fast Retinomorphic Event-Driven Representations for Video Gameplay and Action Recognition

被引:4
|
作者
Chen, Huaijin [1 ,2 ]
Liu, Wanjia [3 ,4 ]
Goel, Rishab [5 ,6 ]
Lua, Rhonald C. [7 ]
Mittal, Siddharth [8 ,9 ]
Huang, Yuzhong [10 ,11 ]
Veeraraghavan, Ashok [1 ]
Patel, Ankit B. [7 ]
机构
[1] Rice Univ, Dept Elect & Comp Engn, Houston, TX 77005 USA
[2] SenseBrain Technol LLC, San Jose, CA 95131 USA
[3] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[4] Google Inc, Mountain View, CA 94043 USA
[5] Indian Inst Technol Delhi, New Delhi 110016, India
[6] Borealis AI, Montreal, PQ H2S 3H1, Canada
[7] Baylor Coll Med, Dept Neurosci, Houston, TX 77030 USA
[8] Indian Inst Technol Kanpur, Kanpur 208016, Uttar Pradesh, India
[9] Quadeye, Gurgaon 122009, India
[10] Olin Coll Engn, Needham, MA 02492 USA
[11] Kensho Technol, Cambridge, MA 02138 USA
基金
美国国家科学基金会;
关键词
Smart cameras; retina; real-time systems; streaming media; cells (biology); reinforcement learning; video signal processing; video; ON-CENTER; CELLS; CONTRAST;
D O I
10.1109/TCI.2019.2948755
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Good temporal representations are crucial for video understanding, and the state-of-the-art video recognition framework is based on two-stream networks. In such framework, besides the regular ConvNets responsible for RGB frame inputs, a second network is introduced to handle the temporal representation, usually the optical flow (OF). However, OF or other task-oriented flow is computationally costly, and is thus typically pre-computed. Critically, this prevents the two-stream approach from being applied to reinforcement learning (RL) applications such as video game playing, where the next state depends on current state and action choices. Inspired by the early vision systems of mammals and insects, we propose a fast event-driven representation (EDR) that models several major properties of early retinal circuits: (1) log-arithmic input response, (2) multi-timescale temporal smoothing to filter noise, and (3) bipolar (ON/OFF) pathways for primitive event detection. Trading off the directional information for fast speed (>9000 fps), EDR enables fast real-time inference/learning in video applications that require interaction between an agent and the world such as game-playing, virtual robotics, and domain adaptation. In this vein, we use EDR to demonstrate performance improvements over state-of-the-art reinforcement learning algorithms for Atari games, something that has not been possible with pre-computed OF. Moreover, with UCF-101 video action recognition experiments, we show that EDR performs near state-of-the-art in accuracy while achieving a 1,500x speedup in input representation processing, as compared to optical flow.
引用
收藏
页码:276 / 290
页数:15
相关论文
共 50 条
  • [31] FAST AC TRAIN EMERGENCY RESCHEDULING USING AN EVENT-DRIVEN APPROACH
    CHANG, CS
    HO, SL
    CHAN, TT
    LEE, KK
    IEE PROCEEDINGS-B ELECTRIC POWER APPLICATIONS, 1993, 140 (04): : 281 - 288
  • [32] High Performance Design for Redis with Fast Event-Driven RDMA RPCs
    Qi, Xuecheng
    Hu, Huiqi
    Wei, Xing
    Huang, Chengcheng
    Zhou, Xuan
    Zhou, Aoying
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 195 - 210
  • [33] Event-driven Fast Frequency Response Control Method for Generator Unit
    Cao Y.
    Zhang H.
    Zhang Y.
    Li C.
    Li W.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2021, 45 (19): : 148 - 154
  • [34] A Fast and Accurate True Event-driven Phase Locked Loop Model
    Beyerstedt, Christoph
    Meier, Jonas
    Speicher, Fabian
    Scholl, Markus
    Blase, Daniel
    Wunderlich, Ralf
    Heinen, Stefan
    2020 27TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2020,
  • [35] A top-down event-driven approach for concurrent activity recognition
    Voulodimos, Athanasios S.
    Kosmopoulos, Dimitrios I.
    Doulamis, Nikolaos D.
    Varvarigou, Theodora A.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2014, 69 (02) : 293 - 311
  • [36] A homeostatic gain control mechanism to improve event-driven object recognition
    Grimaldi, Antoine
    Boutin, Victor
    Perrinet, Laurent
    Ieng, Sio-Hoi
    Benosman, Ryad
    2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2021, : 129 - 134
  • [37] An Event-Driven Spatiotemporal Domain Adaptation Method for DVS Gesture Recognition
    Zhang, Yuhan
    Wu, Lindong
    He, Weihua
    Zhang, Ziyang
    Yang, Chen
    Wang, Yaoyuan
    Wang, Ying
    Tian, Kun
    Liao, Jianxing
    Yang, Ying
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1332 - 1336
  • [38] Event-driven Gait Recognition Method Based on Dynamic Temporal Segmentation
    Lai, Xiaochen
    Zhou, Guoqiao
    Lin, Chi
    Yim, Kangbin
    2015 10TH INTERNATIONAL CONFERENCE ON BROADBAND AND WIRELESS COMPUTING, COMMUNICATION AND APPLICATIONS (BWCCA 2015), 2015, : 522 - 527
  • [39] An event-driven plan recognition algorithm based on intuitionistic fuzzy theory
    Wang, Xiaofan
    Wang, Lei
    Li, Shengji
    Wang, Jin
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (12): : 6923 - 6938
  • [40] ERAR: An event-driven approach for real-time activity recognition
    Ye, Chengpeng
    Xia, Ye
    Sun, Yunchuan
    Wang, Shenling
    Yan, Hongli
    Mehmood, Rashid
    2015 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION, AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2015, : 288 - 293