Dynamic video mix-up for cross-domain action recognition

被引:0
|
作者
Wu, Han [1 ,2 ]
Song, Chunfeng [2 ]
Yue, Shaolong [1 ]
Wang, Zhenyu [1 ]
Xiao, Jun [3 ]
Liu, Yanyang [4 ]
机构
[1] School of Control and Computer Engineering, North China Electric Power University, Beijing,102206, China
[2] Center for Research on Intelligent Perception and Computing (CRIPAC), National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing,100190, China
[3] School of Artificial Intelligence, University of Chinese Academy of Sciences (UCAS), Beijing,100049, China
[4] AIPARK, Beijing,100080, China
来源
Neurocomputing | 2022年 / 471卷
基金
中国国家自然科学基金;
关键词
Action recognition - Cross-domain - Domain problems - Dynamic videos - Generalisation - Performance - Recognition accuracy - Recognition models - Target domain - Video-level mix-up;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, action recognition has been extensively studied. For some general action datasets, such as UCF101 [1], the recognition accuracy in a specific domain can reach 95%. However, due to the existence of the domain-wise discrepancy, the performance of the model will be significantly reduced when deployed to realistic scenes. Therefore, to support the generalization of the action recognition model in practical scenes, the cross-domain problem should be addressed urgently. In this paper, we propose a cross-domain video data fusion mechanism to reduce the difference between domains. Our method is different from existing methods in two points: (1) Instead of performing mix-up at the feature-level, we propose to execute the mix-up directly at the input-level, which introduces more original information beyond the middle features. In addition, a progressive learning method is introduced for adaptive cross-domain fusion. (2) To make full use of the action class knowledge from the source domain, we also propose pseudo-label guided mix-up data learning. Note that only top-ranking confident pseudo labels are selected to ensure the stable similarity between the source and target domains. We evaluate the proposed method on two widely used cross-domain datasets, including the UCF101-HMDB51full and UCF-Olympic. Extensive experimental results have shown that the proposed method is effective and achieves the state-of-the-art performance. In the HMDB51(source domain)→ UCF101(target domain) direction, the accuracy of our method can reach 98.60%, which is 9.54% improvement over the existing state-of-the-art method. © 2021 Elsevier B.V.
引用
收藏
页码:358 / 368
相关论文
共 50 条
  • [21] Multimodal Cross-Domain Few-Shot Learning for Egocentric Action Recognition
    Hatano, Masashi
    Hachiuma, Ryo
    Fujii, Ryo
    Saito, Hideo
    COMPUTER VISION - ECCV 2024, PT XXXIII, 2025, 15091 : 182 - 199
  • [22] Cross-domain learned view-invariant representation for cross-view action recognition
    Li, Yandi
    Li, Mengdi
    Zhao, Zhihao
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (06)
  • [23] Local Domain Adaptation for Cross-Domain Activity Recognition
    Zhao, Jiachen
    Deng, Fang
    He, Haibo
    Chen, Jie
    IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2021, 51 (01) : 12 - 21
  • [24] Manifold and Transfer Subspace Learning for Cross-Domain Vehicle Recognition in Dynamic Systems
    Mendoza-Schrock, Olga
    Rizki, Mateen M.
    Velten, Vincent J.
    2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 1954 - 1961
  • [25] A Unified User-Generic Framework for Myoelectric Pattern Recognition: Mix-Up and Adversarial Training for Domain Generalization and Adaptation
    Li, Xinhui
    Zhang, Xu
    Chen, Xiang
    Chen, Xun
    Zhang, Liwei
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2023, 70 (08) : 2248 - 2257
  • [26] Cross-domain repetition priming in person recognition
    Burton, AM
    Kelly, SW
    Bruce, V
    QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION A-HUMAN EXPERIMENTAL PSYCHOLOGY, 1998, 51 (03): : 515 - 529
  • [27] Gait recognition with cross-domain transfer networks
    Tong, Suibing
    Fu, Yuzhuo
    Ling, Hefei
    JOURNAL OF SYSTEMS ARCHITECTURE, 2019, 93 : 40 - 47
  • [28] NLP Cross-Domain Recognition of Retail Products
    Petterson, Tobias
    Oucheikh, Rachid
    Lofstrom, Tuwe
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 237 - 243
  • [29] Recent Advances on Cross-Domain Face Recognition
    Liu, Xiaoxiang
    Sun, Xiaobo
    He, Ran
    Tan, Tieniu
    BIOMETRIC RECOGNITION, 2016, 9967 : 147 - 157
  • [30] Mix-up Consistent Cross Representations for Data-Efficient Reinforcement Learning
    Liu, Shiyu
    Cao, Guitao
    Liu, Yong
    Li, Yan
    Wu, Chunwei
    Xi, Xidong
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,