Cross Fusion for Egocentric Interactive Action Recognition

被引:2
|
作者
Jiang, Haiyu [1 ]
Song, Yan [1 ]
He, Jiang [1 ]
Shu, Xiangbo [1 ]
机构
[1] Nanjing Univ Sci & Technol, Nanjing, Peoples R China
来源
关键词
Egocentric interactive videos; Action recognition; Cross fusion;
D O I
10.1007/978-3-030-37731-1_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The characteristics of egocentric interactive videos, which include heavy ego-motion, frequent viewpoint changes and multiple types of activities, hinder the action recognition methods of third-person vision from obtaining satisfactory results. In this paper, we introduce an effective architecture with two branches and a cross fusion method for action recognition in egocentric interactive vision. The two branches are responsible to model the information from observers and inter-actors respectively, and each branch is designed based on the multimodal multi-stream C3D networks. We leverage cross fusion to establish effective linkages between the two branches, which aims to reduce redundant information and fuse complementary features. Besides, we propose variable sampling to obtain discriminative snippets for training. Experimental results demonstrate that the proposed architecture achieves superior performance over several state-of-the-art methods on two benchmarks.
引用
收藏
页码:714 / 726
页数:13
相关论文
共 50 条
  • [31] Multimodal Fusion with Cross-Modal Attention for Action Recognition in Still Images
    Tsai, Jia-Hua
    Chu, Wei-Ta
    [J]. PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [32] A novel two-level interactive action recognition model based on inertial data fusion
    Qiu, Sen
    Fan, Tianqi
    Jiang, Junhan
    Wang, Zhelong
    Wang, Yongzhen
    Xu, Junnan
    Sun, Tao
    Jiang, Nan
    [J]. INFORMATION SCIENCES, 2023, 633 : 264 - 279
  • [33] Egocentric Hand Track and Object-based Human Action Recognition
    Kapidis, Georgios
    Poppe, Ronald
    van Dam, Elsbeth
    Noldus, Lucas P. J. J.
    Veltkamp, Remco C.
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 922 - 929
  • [34] ICON-Pose: Toward Egocentric Action Recognition for Intelligent Construction
    Suen, Christine Wun Ki
    Liu, Ziming
    Shi, Yangming
    Zou, Zhengbo
    [J]. COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 682 - 689
  • [35] Distilling interaction knowledge for semi-supervised egocentric action recognition
    Wang, Haoran
    Yang, Jiahao
    Yu, Baosheng
    Zhan, Yibing
    Tao, Dapeng
    Ling, Haibin
    [J]. Pattern Recognition, 2025, 157
  • [36] Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment
    Wang, Xiaohan
    Zhu, Linchao
    Wu, Yu
    Yang, Yi
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) : 6605 - 6617
  • [37] Activity Recognition in Egocentric Videos Using Bag of Key Action Units
    Suma, K. Sai
    Aditya, G.
    Mukherjee, Snehasis
    [J]. ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,
  • [38] Slowfast Diversity-aware Prototype Learning for Egocentric Action Recognition
    Dai, Guangzhao
    Shu, Xiangbo
    Yan, Rui
    Huang, Peng
    Tang, Jinhui
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 7549 - 7558
  • [39] Gaze-Informed Egocentric Action Recognition for Memory Aid Systems
    Zuo, Zheming
    Yang, Longzhi
    Peng, Yonghong
    Chao, Fei
    Qu, Yanpeng
    [J]. IEEE ACCESS, 2018, 6 : 12894 - 12904
  • [40] LSTA: Long Short-Term Attention for Egocentric Action Recognition
    Sudhakaran, Swathikiran
    Escalera, Sergio
    Lanz, Oswald
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9946 - 9955