Fast Retinomorphic Event-Driven Representations for Video Gameplay and Action Recognition

被引:4
|
作者
Chen, Huaijin [1 ,2 ]
Liu, Wanjia [3 ,4 ]
Goel, Rishab [5 ,6 ]
Lua, Rhonald C. [7 ]
Mittal, Siddharth [8 ,9 ]
Huang, Yuzhong [10 ,11 ]
Veeraraghavan, Ashok [1 ]
Patel, Ankit B. [7 ]
机构
[1] Rice Univ, Dept Elect & Comp Engn, Houston, TX 77005 USA
[2] SenseBrain Technol LLC, San Jose, CA 95131 USA
[3] Rice Univ, Dept Comp Sci, Houston, TX 77005 USA
[4] Google Inc, Mountain View, CA 94043 USA
[5] Indian Inst Technol Delhi, New Delhi 110016, India
[6] Borealis AI, Montreal, PQ H2S 3H1, Canada
[7] Baylor Coll Med, Dept Neurosci, Houston, TX 77030 USA
[8] Indian Inst Technol Kanpur, Kanpur 208016, Uttar Pradesh, India
[9] Quadeye, Gurgaon 122009, India
[10] Olin Coll Engn, Needham, MA 02492 USA
[11] Kensho Technol, Cambridge, MA 02138 USA
基金
美国国家科学基金会;
关键词
Smart cameras; retina; real-time systems; streaming media; cells (biology); reinforcement learning; video signal processing; video; ON-CENTER; CELLS; CONTRAST;
D O I
10.1109/TCI.2019.2948755
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Good temporal representations are crucial for video understanding, and the state-of-the-art video recognition framework is based on two-stream networks. In such framework, besides the regular ConvNets responsible for RGB frame inputs, a second network is introduced to handle the temporal representation, usually the optical flow (OF). However, OF or other task-oriented flow is computationally costly, and is thus typically pre-computed. Critically, this prevents the two-stream approach from being applied to reinforcement learning (RL) applications such as video game playing, where the next state depends on current state and action choices. Inspired by the early vision systems of mammals and insects, we propose a fast event-driven representation (EDR) that models several major properties of early retinal circuits: (1) log-arithmic input response, (2) multi-timescale temporal smoothing to filter noise, and (3) bipolar (ON/OFF) pathways for primitive event detection. Trading off the directional information for fast speed (>9000 fps), EDR enables fast real-time inference/learning in video applications that require interaction between an agent and the world such as game-playing, virtual robotics, and domain adaptation. In this vein, we use EDR to demonstrate performance improvements over state-of-the-art reinforcement learning algorithms for Atari games, something that has not been possible with pre-computed OF. Moreover, with UCF-101 video action recognition experiments, we show that EDR performs near state-of-the-art in accuracy while achieving a 1,500x speedup in input representation processing, as compared to optical flow.
引用
收藏
页码:276 / 290
页数:15
相关论文
共 50 条
  • [41] An event-driven plan recognition algorithm based on intuitionistic fuzzy theory
    Xiaofan Wang
    Lei Wang
    Shengji Li
    Jin Wang
    The Journal of Supercomputing, 2018, 74 : 6923 - 6938
  • [42] A top-down event-driven approach for concurrent activity recognition
    Athanasios S. Voulodimos
    Dimitrios I. Kosmopoulos
    Nikolaos D. Doulamis
    Theodora A. Varvarigou
    Multimedia Tools and Applications, 2014, 69 : 293 - 311
  • [43] Complete Video-Level Representations for Action Recognition
    Li, Min
    Bai, Ruwen
    Meng, Bo
    Ren, Junxing
    Jiang, Miao
    Yang, Yang
    Li, Linghan
    Du, Hong
    IEEE ACCESS, 2021, 9 : 92134 - 92142
  • [44] Sparse Representations of Clustered Video Shots for Action Recognition
    Ding, Yi
    Lu, Hongtao
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 497 - 501
  • [45] Fast Event-based Harris Corner Detection Exploiting the Advantages of Event-driven Cameras
    Vasco, Valentina
    Glover, Arren
    Bartolozzi, Chiara
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 4144 - 4149
  • [46] Event-driven Stitching for Tile-based Live 360 Video Streaming
    Chen, Bo
    Yan, Zhisheng
    Jin, Haiming
    Nahrstedt, Klara
    PROCEEDINGS OF THE 10TH ACM MULTIMEDIA SYSTEMS CONFERENCE (ACM MMSYS'19), 2019, : 1 - 12
  • [47] An event-driven sports video adaptation for the MPEG-21 DIA framework
    Xu, Min
    Li, Jiaming
    Hu, Yiqun
    Chia, Liang-Tien
    Lee, Bu-Sung
    Rajan, Deepu
    Cai, Jianfei
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1245 - +
  • [48] Fast Operating PLC Based on Event-Driven Control Program Tasks Execution
    Chmiel, Miroslaw
    Hryniewicz, Edward
    Milik, Adam
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2009, 55 (02) : 269 - 286
  • [49] KnightSim: A Fast Discrete Event-Driven Simulation Methodology for Computer Architectural Simulation
    Giles, Christopher E.
    Peterson, Christina L.
    Heinrich, Mark A.
    IEEE TRANSACTIONS ON COMPUTERS, 2020, 69 (01) : 65 - 71
  • [50] A Novel Event-Driven Spiking Convolutional Neural Network for Electromyography Pattern Recognition
    Xu, Mengjuan
    Chen, Xiang
    Sun, Antong
    Zhang, Xu
    Chen, Xun
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (09) : 2604 - 2615