MMTS: Multimodal Teacher-Student learning for One-Shot Human Action Recognition

被引:3
|
作者
Lee, Jongwhoa [1 ]
Sim, Minho [1 ]
Choi, Ho-Jin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
关键词
human action recognition; skeleton; keypoints; one-shot; metric learning; teacher-student networks; CNN;
D O I
10.1109/BigComp57234.2023.00045
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Human action recognition (HAR) is applied to many real-world applications, such as visual surveillance, video retrieval, and autonomous driving vehicles. It can utilize various modalities such as RGB, infrared, depth, or skeleton. Among these, we selected and used a skeleton suited to real-time application because it requires less input than RGB data. Furthermore, we focused on a one-shot setting. The skeleton data tends to have a smaller dataset size than other modalities, so it is hard to expect the powerful generalization ability to make representation from unseen data (i.e. novel class). Therefore, to solve this problem, we proposed a skeleton-text multimodal learning method by borrowing a powerful pretrained text encoder that was trained using a large-scale dataset. Our method utilizes the teacher-student approach through the skeleton-text dataset and only uses the skeleton for inferences. The proposed method is more suitable for one-shot skeleton-based HAR compared to the existing multimodal learning method. Our approach outperformed the state-of-the-art methods for the one-shot action recognition protocol on the NTU RGB+D 120 dataset.
引用
收藏
页码:235 / 242
页数:8
相关论文
共 50 条
  • [21] One-shot Action Localization by Learning Sequence Matching Network
    Yang, Hongtao
    He, Xuming
    Porikli, Fatih
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1450 - 1459
  • [22] A teacher-student deep learning strategy for extreme low resolution unsafe action recognition in construction projects
    Yang, Meng
    Wu, Chengke
    Guo, Yuanjun
    He, Yong
    Jiang, Rui
    Jiang, Junjie
    Yang, Zhile
    ADVANCED ENGINEERING INFORMATICS, 2024, 59
  • [23] One-Shot Imitation Learning
    Duan, Yan
    Andrychowicz, Marcin
    Stadie, Bradly
    Ho, Jonathan
    Schneider, Jonas
    Sutskeyer, Ilya
    Abbeel, Pieter
    Zaremba, Wojciech
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [24] Robust Speech Recognition Using Teacher-Student Learning Domain Adaptation
    Ma, Han
    Zhang, Qiaoling
    Tang, Roubing
    Zhang, Lu
    Jia, Yubo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (12) : 2112 - 2118
  • [25] Adaptation-Oriented Feature Projection for One-Shot Action Recognition
    Zou, Yixiong
    Shi, Yemin
    Shi, Daochen
    Wang, Yaowei
    Liang, Yongsheng
    Tian, Yonghong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (12) : 3166 - 3179
  • [26] CLEAR: Cumulative LEARning for One-Shot One-Class Image Recognition
    Kozerawski, Jedrzej
    Turk, Matthew
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 3446 - 3455
  • [27] General Sequence Teacher-Student Learning
    Wong, Jeremy Heng Meng
    Gales, Mark John Francis
    Wan, Yu
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (11) : 1725 - 1736
  • [28] Skeleton-DML: Deep Metric Learning for Skeleton-Based One-Shot Action Recognition
    Memmesheimer, Raphael
    Haering, Simon
    Theisen, Nick
    Paulus, Dietrich
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 837 - 845
  • [29] Lifelong Teacher-Student Network Learning
    Ye, Fei
    Bors, Adrian G.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 6280 - 6296
  • [30] Teacher-student negotiation in an action research project
    Tsafos, Vassilis
    EDUCATIONAL ACTION RESEARCH, 2009, 17 (02) : 197 - 211