Cross Fusion for Egocentric Interactive Action Recognition

被引：2

作者：

Jiang, Haiyu ^{[1
]}

Song, Yan ^{[1
]}

He, Jiang ^{[1
]}

Shu, Xiangbo ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China

来源：

MULTIMEDIA MODELING (MMM 2020), PT I | 2020年 / 11961卷

关键词：

Egocentric interactive videos; Action recognition; Cross fusion;

D O I：

10.1007/978-3-030-37731-1_58

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The characteristics of egocentric interactive videos, which include heavy ego-motion, frequent viewpoint changes and multiple types of activities, hinder the action recognition methods of third-person vision from obtaining satisfactory results. In this paper, we introduce an effective architecture with two branches and a cross fusion method for action recognition in egocentric interactive vision. The two branches are responsible to model the information from observers and inter-actors respectively, and each branch is designed based on the multimodal multi-stream C3D networks. We leverage cross fusion to establish effective linkages between the two branches, which aims to reduce redundant information and fuse complementary features. Besides, we propose variable sampling to obtain discriminative snippets for training. Experimental results demonstrate that the proposed architecture achieves superior performance over several state-of-the-art methods on two benchmarks.

引用

页码：714 / 726

页数：13

共 50 条

[1] Interactive Prototype Learning for Egocentric Action Recognition
Wang, Xiaohan
Zhu, Linchao
Wang, Heng
Yang, Yi
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8148 - 8157
[2] Cross-view action recognition understanding from exocentric to egocentric perspective
Truong, Thanh-Dat
Luu, Khoa
[J]. Neurocomputing, 2025, 614
[3] Multimodal Distillation for Egocentric Action Recognition
Radevski, Gorjan
Grujicic, Dusan
Blaschko, Matthew
Moens, Marie-Francine
Tuytelaars, Tinne
[J]. Proceedings of the IEEE International Conference on Computer Vision, 2023, : 5190 - 5201
[4] Multimodal Distillation for Egocentric Action Recognition
Radevski, Gorjan
Grujicic, Dusan
Blaschko, Matthew
Moens, Marie-Francine
Tuytelaars, Tinne
[J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5190 - 5201
[5] EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
Kazakos, Evangelos
Nagrani, Arsha
Zisserman, Andrew
Damen, Dima
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5491 - 5500
[6] Can Gaze Inform Egocentric Action Recognition?
Zhang, Zehua
Crandall, David
Proulx, Michael J.
Talathi, Sachin S.
Sharma, Abhishek
[J]. 2022 ACM SYMPOSIUM ON EYE TRACKING RESEARCH AND APPLICATIONS, ETRA 2022, 2022,
[7] Egocentric Action Recognition by Automatic Relation Modeling
Li, Haoxin
Zheng, Wei-Shi
Zhang, Jianguo
Hu, Haifeng
Lu, Jiwen
Lai, Jian-Huang
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 489 - 507
[8] Deep Attention Network for Egocentric Action Recognition
Lu, Minlong
Li, Ze-Nian
Wang, Yueming
Pan, Gang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 3703 - 3713
[9] Learning Spatiotemporal Attention for Egocentric Action Recognition
Lu, Minlong
Liao, Danping
Li, Ze-Nian
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4425 - 4434
[10] Generic Action Recognition from Egocentric Videos
Singh, Suriya
Arora, Chetan
Jawahar, C. V.
[J]. 2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,

← 1 2 3 4 5 →