Cross Fusion for Egocentric Interactive Action Recognition

被引：2

作者：

Jiang, Haiyu ^{[1
]}

Song, Yan ^{[1
]}

He, Jiang ^{[1
]}

Shu, Xiangbo ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China

来源：

MULTIMEDIA MODELING (MMM 2020), PT I | 2020年 / 11961卷

关键词：

Egocentric interactive videos; Action recognition; Cross fusion;

D O I：

10.1007/978-3-030-37731-1_58

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The characteristics of egocentric interactive videos, which include heavy ego-motion, frequent viewpoint changes and multiple types of activities, hinder the action recognition methods of third-person vision from obtaining satisfactory results. In this paper, we introduce an effective architecture with two branches and a cross fusion method for action recognition in egocentric interactive vision. The two branches are responsible to model the information from observers and inter-actors respectively, and each branch is designed based on the multimodal multi-stream C3D networks. We leverage cross fusion to establish effective linkages between the two branches, which aims to reduce redundant information and fuse complementary features. Besides, we propose variable sampling to obtain discriminative snippets for training. Experimental results demonstrate that the proposed architecture achieves superior performance over several state-of-the-art methods on two benchmarks.

引用

页码：714 / 726

页数：13

共 50 条

[31] Multimodal Fusion with Cross-Modal Attention for Action Recognition in Still Images
Tsai, Jia-Hua
Chu, Wei-Ta
[J]. PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
[32] A novel two-level interactive action recognition model based on inertial data fusion
Qiu, Sen
Fan, Tianqi
Jiang, Junhan
Wang, Zhelong
Wang, Yongzhen
Xu, Junnan
Sun, Tao
Jiang, Nan
[J]. INFORMATION SCIENCES, 2023, 633 : 264 - 279
[33] Egocentric Hand Track and Object-based Human Action Recognition
Kapidis, Georgios
Poppe, Ronald
van Dam, Elsbeth
Noldus, Lucas P. J. J.
Veltkamp, Remco C.
[J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 922 - 929
[34] ICON-Pose: Toward Egocentric Action Recognition for Intelligent Construction
Suen, Christine Wun Ki
Liu, Ziming
Shi, Yangming
Zou, Zhengbo
[J]. COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 682 - 689
[35] Distilling interaction knowledge for semi-supervised egocentric action recognition
Wang, Haoran
Yang, Jiahao
Yu, Baosheng
Zhan, Yibing
Tao, Dapeng
Ling, Haibin
[J]. Pattern Recognition, 2025, 157
[36] Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment
Wang, Xiaohan
Zhu, Linchao
Wu, Yu
Yang, Yi
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6605 - 6617
[37] Activity Recognition in Egocentric Videos Using Bag of Key Action Units
Suma, K. Sai
Aditya, G.
Mukherjee, Snehasis
[J]. ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
[38] Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition
Dai, Guangzhao
Shu, Xiangbo
Yan, Rui
Huang, Peng
Tang, Jinhui
[J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7549 - 7558
[39] Gaze-Informed Egocentric Action Recognition for Memory Aid Systems
Zuo, Zheming
Yang, Longzhi
Peng, Yonghong
Chao, Fei
Qu, Yanpeng
[J]. IEEE ACCESS, 2018, 6 : 12894 - 12904
[40] LSTA: Long Short-Term Attention for Egocentric Action Recognition
Sudhakaran, Swathikiran
Escalera, Sergio
Lanz, Oswald
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9946 - 9955

← 1 2 3 4 5 →