Dynamic video mix-up for cross-domain action recognition

被引:0
|
作者
Wu, Han [1 ,2 ]
Song, Chunfeng [2 ]
Yue, Shaolong [1 ]
Wang, Zhenyu [1 ]
Xiao, Jun [3 ]
Liu, Yanyang [4 ]
机构
[1] School of Control and Computer Engineering, North China Electric Power University, Beijing,102206, China
[2] Center for Research on Intelligent Perception and Computing (CRIPAC), National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing,100190, China
[3] School of Artificial Intelligence, University of Chinese Academy of Sciences (UCAS), Beijing,100049, China
[4] AIPARK, Beijing,100080, China
来源
Neurocomputing | 2022年 / 471卷
基金
中国国家自然科学基金;
关键词
Action recognition - Cross-domain - Domain problems - Dynamic videos - Generalisation - Performance - Recognition accuracy - Recognition models - Target domain - Video-level mix-up;
D O I
暂无
中图分类号
学科分类号
摘要
In recent years, action recognition has been extensively studied. For some general action datasets, such as UCF101 [1], the recognition accuracy in a specific domain can reach 95%. However, due to the existence of the domain-wise discrepancy, the performance of the model will be significantly reduced when deployed to realistic scenes. Therefore, to support the generalization of the action recognition model in practical scenes, the cross-domain problem should be addressed urgently. In this paper, we propose a cross-domain video data fusion mechanism to reduce the difference between domains. Our method is different from existing methods in two points: (1) Instead of performing mix-up at the feature-level, we propose to execute the mix-up directly at the input-level, which introduces more original information beyond the middle features. In addition, a progressive learning method is introduced for adaptive cross-domain fusion. (2) To make full use of the action class knowledge from the source domain, we also propose pseudo-label guided mix-up data learning. Note that only top-ranking confident pseudo labels are selected to ensure the stable similarity between the source and target domains. We evaluate the proposed method on two widely used cross-domain datasets, including the UCF101-HMDB51full and UCF-Olympic. Extensive experimental results have shown that the proposed method is effective and achieves the state-of-the-art performance. In the HMDB51(source domain)→ UCF101(target domain) direction, the accuracy of our method can reach 98.60%, which is 9.54% improvement over the existing state-of-the-art method. © 2021 Elsevier B.V.
引用
收藏
页码:358 / 368
相关论文
共 50 条
  • [31] Cross-domain action recognition via collective matrix factorization with graph Laplacian regularization
    Tang, Jun
    Jin, Haiqun
    Tan, Shoubiao
    Liang, Dong
    IMAGE AND VISION COMPUTING, 2016, 55 : 119 - 126
  • [32] Various-Level Spatio-Temporal Alignment for Cross-Domain Action Recognition
    Kim, Hyungmin
    Kim, Dohyung
    Kim, Jaehong
    ROBOT INTELLIGENCE TECHNOLOGY AND APPLICATIONS 6, 2022, 429 : 323 - 335
  • [33] Adaptive Data Optimization for Cross-domain Action Recognition in Low-Light Environment
    Liu, Haoran
    Yang, Huan
    Wang, Danwei
    2024 IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, CIS AND IEEE INTERNATIONAL CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, RAM, CIS-RAM 2024, 2024, : 399 - 404
  • [34] Cross-domain Knowledge Transfer Schemes for 3D Human Action Recognition
    Psaltis, Athanasios
    Papadopoulos, Georgios Th
    Daras, Petros
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [35] Pairwise Two-Stream ConvNets for Cross-Domain Action Recognition With Small Data
    Gao, Zan
    Guo, Leming
    Ren, Tongwei
    Liu, An-An
    Cheng, Zhi-Yong
    Chen, Shengyong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (03) : 1147 - 1161
  • [36] Class structure-aware adversarial loss for cross-domain human action recognition
    Chen, Wanjun
    Liu, Long
    Lin, Guangfeng
    Chen, Yajun
    Wang, Jing
    IET IMAGE PROCESSING, 2021, 15 (14) : 3425 - 3432
  • [37] Domain Adaptive Sampling for Cross-Domain Point Cloud Recognition
    Wang, Zicheng
    Li, Wen
    Xu, Dong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7604 - 7615
  • [38] Cross-Domain Similarity in Domain Adaptation for Human Activity Recognition
    Kasim, Samra
    Sheppard, John W.
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [39] Domain Adversarial Network for Cross-Domain Emotion Recognition in Conversation
    Ma, Hongchao
    Zhang, Chunyan
    Zhou, Xiabing
    Chen, Junyi
    Zhou, Qinglei
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [40] Cross-Domain Recognition via Projective Cross-Reconstruction
    Fang, Xiaozhao
    Jiang, Lin
    Han, Na
    Sun, Weijun
    Xu, Yong
    Xie, Shengli
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (12): : 7366 - 7377