MMTS: Multimodal Teacher-Student learning for One-Shot Human Action Recognition

被引:3
|
作者
Lee, Jongwhoa [1 ]
Sim, Minho [1 ]
Choi, Ho-Jin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
关键词
human action recognition; skeleton; keypoints; one-shot; metric learning; teacher-student networks; CNN;
D O I
10.1109/BigComp57234.2023.00045
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Human action recognition (HAR) is applied to many real-world applications, such as visual surveillance, video retrieval, and autonomous driving vehicles. It can utilize various modalities such as RGB, infrared, depth, or skeleton. Among these, we selected and used a skeleton suited to real-time application because it requires less input than RGB data. Furthermore, we focused on a one-shot setting. The skeleton data tends to have a smaller dataset size than other modalities, so it is hard to expect the powerful generalization ability to make representation from unseen data (i.e. novel class). Therefore, to solve this problem, we proposed a skeleton-text multimodal learning method by borrowing a powerful pretrained text encoder that was trained using a large-scale dataset. Our method utilizes the teacher-student approach through the skeleton-text dataset and only uses the skeleton for inferences. The proposed method is more suitable for one-shot skeleton-based HAR compared to the existing multimodal learning method. Our approach outperformed the state-of-the-art methods for the one-shot action recognition protocol on the NTU RGB+D 120 dataset.
引用
收藏
页码:235 / 242
页数:8
相关论文
共 50 条
  • [11] Progressive Teacher-student Learning for Early Action Prediction
    Wang, Xionghui
    Hu, Jian-Fang
    Lai, Jianhuang
    Zhang, Jianguo
    Zheng, Wei-Shi
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3551 - 3560
  • [12] One-Shot Face Recognition via Generative Learning
    Ding, Zhengming
    Guo, Yandong
    Zhang, Lei
    Fu, Yun
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 1 - 7
  • [13] Kinematic matrix: One-shot human action recognition using kinematic data structure
    Ranjbar, Mohammad Hassan
    Abdi, Ali
    Park, Ju Hong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 139
  • [14] AROS: Affordance Recognition with One-Shot Human Stances
    Pacheco-Ortega, Abel
    Mayol-Cuevas, Walterio
    FRONTIERS IN ROBOTICS AND AI, 2023, 10
  • [15] Teacher-Student Curriculum Learning
    Matiisen, Tambet
    Oliver, Avital
    Cohen, Taco
    Schulman, John
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (09) : 3732 - 3740
  • [16] CONDITIONAL TEACHER-STUDENT LEARNING
    Meng, Zhong
    Li, Jinyu
    Zhao, Yong
    Gong, Yifan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6445 - 6449
  • [17] One-Shot Gesture Recognition: One Step Towards Adaptive Learning
    Cabrera, Maria E.
    Sanchez-Tamayo, Natalia
    Voyles, Richard
    Wachs, Juan P.
    2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 784 - 789
  • [18] Position and Orientation Aware One-Shot Learning for Medical Action Recognition From Signal Data
    Xie, Leiyu
    Yang, Yuxing
    Fu, Zeyu
    Naqvi, Syed Mohsen
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1860 - 1873
  • [19] Tangential Human Motion Recognition With Micro-Doppler Signatures and One-Shot Learning
    Yang, Yang
    Zhou, Zhengkang
    Li, Beichen
    Li, Junhan
    Lang, Yue
    IEEE SENSORS JOURNAL, 2023, 23 (20) : 24812 - 24821
  • [20] Optimizing One-Shot Recognition with Micro-Set Learning
    Tang, Kevin D.
    Tappen, Marshall F.
    Sukthankar, Rahul
    Lampert, Christoph H.
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3027 - 3034