Visual Event-Based Egocentric Human Action Recognition

被引:1
|
作者
Moreno-Rodriguez, Francisco J. [1 ]
Javier Traver, V [2 ]
Barranco, Francisco [3 ]
Dimiccoli, Mariella [4 ]
Pla, Filiberto [2 ]
机构
[1] Univ Jaume 1, Castellon de La Plana, Spain
[2] Univ Jaume 1, Inst New Imaging Technol, Castellon de La Plana, Spain
[3] Univ Granada, CITIC, Dept Comp Architecture & Technol, Granada, Spain
[4] Inst Robot & Informat Ind CSIC UPC, Barcelona, Spain
关键词
Egocentric view; Action recognition; Event vision;
D O I
10.1007/978-3-031-04881-4_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper lies at the intersection of three research areas: human action recognition, egocentric vision, and visual event-based sensors. The main goal is the comparison of egocentric action recognition performance under either of two visual sources: conventional images, or event-based visual data. In this work, the events, as triggered by asynchronous event sensors or their simulation, are spatio-temporally aggregated into event frames (a grid-like representation). This allows to use exactly the same neural model for both visual sources, thus easing a fair comparison. Specifically, a hybrid neural architecture combining a convolutional neural network and a recurrent network is used. It is empirically found that this general architecture works for both, conventional gray-level frames, and event frames. This finding is relevant because it reveals that no modification or adaptation is strictly required to deal with event data for egocentric action classification. Interestingly, action recognition is found to perform better with event frames, suggesting that these data provide discriminative information that aids the neural model to learn good features.
引用
收藏
页码:402 / 414
页数:13
相关论文
共 50 条
  • [41] Spatial and Temporal Downsampling in Event-Based Visual Classification
    Cohen, Gregory
    Afshar, Saeed
    Orchard, Garrick
    Tapson, Jonathan
    Benosman, Ryad
    van Schaik, Andre
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 5030 - 5044
  • [42] Stereo Event-Based Visual-Inertial Odometry
    Wang, Kunfeng
    Zhao, Kaichun
    Lu, Wenshuai
    You, Zheng
    SENSORS, 2025, 25 (03)
  • [43] Asynchronous visual event-based time-to-contact
    Clady, Xavier
    Clercq, Charles
    Ieng, Sio-Hoi
    Houseini, Fouzhan
    Randazzo, Marco
    Natale, Lorenzo
    Bartolozzi, Chiara
    Benosman, Ryad
    FRONTIERS IN NEUROSCIENCE, 2014, 8
  • [44] Recognition memory effects in event-based prospective memory
    Tiller, SJ
    Humphreys, MS
    Neal, AF
    AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2004, 56 : 228 - 229
  • [45] Attention Mechanisms for Object Recognition with Event-Based Cameras
    Cannici, Marco
    Ciccone, Marco
    Romanoni, Andrea
    Matteucci, Matteo
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1127 - 1136
  • [46] Event-Based Recognition of Lived Experiences in User Reviews
    Hassan, Ehab
    Buscaldi, Davide
    Gangemi, Aldo
    KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2016, 2016, 10024 : 320 - 336
  • [47] Grounding action-selection in event-based anticipation
    Capdepuy, Philippe
    Polani, Daniel
    Nehaniv, Chrystopher L.
    ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 2007, 4648 : 253 - +
  • [48] Point process models for event-based speech recognition
    Jansen, Aren
    Niyogi, Partha
    SPEECH COMMUNICATION, 2009, 51 (12) : 1155 - 1168
  • [49] Event Recognition in Egocentric Videos Using a Novel Trajectory Based Feature
    Buddubariki, Vinodh
    Tulluri, Sunitha Gowd
    Mukherjee, Snehasis
    TENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2016), 2016,
  • [50] Multimodal Distillation for Egocentric Action Recognition
    Radevski, Gorjan
    Grujicic, Dusan
    Blaschko, Matthew
    Moens, Marie-Francine
    Tuytelaars, Tinne
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5190 - 5201