Cross Fusion for Egocentric Interactive Action Recognition

被引:2
|
作者
Jiang, Haiyu [1 ]
Song, Yan [1 ]
He, Jiang [1 ]
Shu, Xiangbo [1 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
来源
关键词
Egocentric interactive videos; Action recognition; Cross fusion;
D O I
10.1007/978-3-030-37731-1_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The characteristics of egocentric interactive videos, which include heavy ego-motion, frequent viewpoint changes and multiple types of activities, hinder the action recognition methods of third-person vision from obtaining satisfactory results. In this paper, we introduce an effective architecture with two branches and a cross fusion method for action recognition in egocentric interactive vision. The two branches are responsible to model the information from observers and inter-actors respectively, and each branch is designed based on the multimodal multi-stream C3D networks. We leverage cross fusion to establish effective linkages between the two branches, which aims to reduce redundant information and fuse complementary features. Besides, we propose variable sampling to obtain discriminative snippets for training. Experimental results demonstrate that the proposed architecture achieves superior performance over several state-of-the-art methods on two benchmarks.
引用
收藏
页码:714 / 726
页数:13
相关论文
共 50 条
  • [1] Interactive Prototype Learning for Egocentric Action Recognition
    Wang, Xiaohan
    Zhu, Linchao
    Wang, Heng
    Yang, Yi
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8148 - 8157
  • [2] Cross-view action recognition understanding from exocentric to egocentric perspective
    Truong, Thanh-Dat
    Luu, Khoa
    [J]. Neurocomputing, 2025, 614
  • [3] Multimodal Distillation for Egocentric Action Recognition
    Radevski, Gorjan
    Grujicic, Dusan
    Blaschko, Matthew
    Moens, Marie-Francine
    Tuytelaars, Tinne
    [J]. Proceedings of the IEEE International Conference on Computer Vision, 2023, : 5190 - 5201
  • [4] Multimodal Distillation for Egocentric Action Recognition
    Radevski, Gorjan
    Grujicic, Dusan
    Blaschko, Matthew
    Moens, Marie-Francine
    Tuytelaars, Tinne
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5190 - 5201
  • [5] EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
    Kazakos, Evangelos
    Nagrani, Arsha
    Zisserman, Andrew
    Damen, Dima
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5491 - 5500
  • [6] Can Gaze Inform Egocentric Action Recognition?
    Zhang, Zehua
    Crandall, David
    Proulx, Michael J.
    Talathi, Sachin S.
    Sharma, Abhishek
    [J]. 2022 ACM SYMPOSIUM ON EYE TRACKING RESEARCH AND APPLICATIONS, ETRA 2022, 2022,
  • [7] Egocentric Action Recognition by Automatic Relation Modeling
    Li, Haoxin
    Zheng, Wei-Shi
    Zhang, Jianguo
    Hu, Haifeng
    Lu, Jiwen
    Lai, Jian-Huang
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 489 - 507
  • [8] Deep Attention Network for Egocentric Action Recognition
    Lu, Minlong
    Li, Ze-Nian
    Wang, Yueming
    Pan, Gang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (08) : 3703 - 3713
  • [9] Learning Spatiotemporal Attention for Egocentric Action Recognition
    Lu, Minlong
    Liao, Danping
    Li, Ze-Nian
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4425 - 4434
  • [10] Generic Action Recognition from Egocentric Videos
    Singh, Suriya
    Arora, Chetan
    Jawahar, C. V.
    [J]. 2015 FIFTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2015,