Visual Event-Based Egocentric Human Action Recognition

被引：1

作者：

Moreno-Rodriguez, Francisco J. ^{[1
]}

Javier Traver, V ^{[2
]}

Barranco, Francisco ^{[3
]}

Dimiccoli, Mariella ^{[4
]}

Pla, Filiberto ^{[2
]}

机构：

[1] Univ Jaume 1, Castellon de La Plana, Spain

[2] Univ Jaume 1, Inst New Imaging Technol, Castellon de La Plana, Spain

[3] Univ Granada, CITIC, Dept Comp Architecture & Technol, Granada, Spain

[4] Inst Robot & Informat Ind CSIC UPC, Barcelona, Spain

来源：

PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022) | 2022年 / 13256卷

关键词：

Egocentric view; Action recognition; Event vision;

D O I：

10.1007/978-3-031-04881-4_32

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper lies at the intersection of three research areas: human action recognition, egocentric vision, and visual event-based sensors. The main goal is the comparison of egocentric action recognition performance under either of two visual sources: conventional images, or event-based visual data. In this work, the events, as triggered by asynchronous event sensors or their simulation, are spatio-temporally aggregated into event frames (a grid-like representation). This allows to use exactly the same neural model for both visual sources, thus easing a fair comparison. Specifically, a hybrid neural architecture combining a convolutional neural network and a recurrent network is used. It is empirically found that this general architecture works for both, conventional gray-level frames, and event frames. This finding is relevant because it reveals that no modification or adaptation is strictly required to deal with event data for egocentric action classification. Interestingly, action recognition is found to perform better with event frames, suggesting that these data provide discriminative information that aids the neural model to learn good features.

引用

页码：402 / 414

页数：13

共 50 条

[41] Spatial and Temporal Downsampling in Event-Based Visual Classification
Cohen, Gregory
Afshar, Saeed
Orchard, Garrick
Tapson, Jonathan
Benosman, Ryad
van Schaik, Andre
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (10) : 5030 - 5044
[42] Stereo Event-Based Visual-Inertial Odometry
Wang, Kunfeng
Zhao, Kaichun
Lu, Wenshuai
You, Zheng
SENSORS, 2025, 25 (03)
[43] Asynchronous visual event-based time-to-contact
Clady, Xavier
Clercq, Charles
Ieng, Sio-Hoi
Houseini, Fouzhan
Randazzo, Marco
Natale, Lorenzo
Bartolozzi, Chiara
Benosman, Ryad
FRONTIERS IN NEUROSCIENCE, 2014, 8
[44] Recognition memory effects in event-based prospective memory
Tiller, SJ
Humphreys, MS
Neal, AF
AUSTRALIAN JOURNAL OF PSYCHOLOGY, 2004, 56 : 228 - 229
[45] Attention Mechanisms for Object Recognition with Event-Based Cameras
Cannici, Marco
Ciccone, Marco
Romanoni, Andrea
Matteucci, Matteo
2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1127 - 1136
[46] Event-Based Recognition of Lived Experiences in User Reviews
Hassan, Ehab
Buscaldi, Davide
Gangemi, Aldo
KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2016, 2016, 10024 : 320 - 336
[47] Grounding action-selection in event-based anticipation
Capdepuy, Philippe
Polani, Daniel
Nehaniv, Chrystopher L.
ADVANCES IN ARTIFICIAL LIFE, PROCEEDINGS, 2007, 4648 : 253 - +
[48] Point process models for event-based speech recognition
Jansen, Aren
Niyogi, Partha
SPEECH COMMUNICATION, 2009, 51 (12) : 1155 - 1168
[49] Event Recognition in Egocentric Videos Using a Novel Trajectory Based Feature
Buddubariki, Vinodh
Tulluri, Sunitha Gowd
Mukherjee, Snehasis
TENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2016), 2016,
[50] Multimodal Distillation for Egocentric Action Recognition
Radevski, Gorjan
Grujicic, Dusan
Blaschko, Matthew
Moens, Marie-Francine
Tuytelaars, Tinne
2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5190 - 5201

← 1 2 3 4 5 →