MMTS: Multimodal Teacher-Student learning for One-Shot Human Action Recognition

被引:3
|
作者
Lee, Jongwhoa [1 ]
Sim, Minho [1 ]
Choi, Ho-Jin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, Daejeon, South Korea
关键词
human action recognition; skeleton; keypoints; one-shot; metric learning; teacher-student networks; CNN;
D O I
10.1109/BigComp57234.2023.00045
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Human action recognition (HAR) is applied to many real-world applications, such as visual surveillance, video retrieval, and autonomous driving vehicles. It can utilize various modalities such as RGB, infrared, depth, or skeleton. Among these, we selected and used a skeleton suited to real-time application because it requires less input than RGB data. Furthermore, we focused on a one-shot setting. The skeleton data tends to have a smaller dataset size than other modalities, so it is hard to expect the powerful generalization ability to make representation from unseen data (i.e. novel class). Therefore, to solve this problem, we proposed a skeleton-text multimodal learning method by borrowing a powerful pretrained text encoder that was trained using a large-scale dataset. Our method utilizes the teacher-student approach through the skeleton-text dataset and only uses the skeleton for inferences. The proposed method is more suitable for one-shot skeleton-based HAR compared to the existing multimodal learning method. Our approach outperformed the state-of-the-art methods for the one-shot action recognition protocol on the NTU RGB+D 120 dataset.
引用
收藏
页码:235 / 242
页数:8
相关论文
共 50 条
  • [31] Fast Simplex-HMM for One-Shot Learning Activity Recognition
    Rodriguez, Mario
    Orrite, Carlos
    Medrano, Carlos
    Makris, Dimitrios
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 1259 - 1266
  • [32] Facial Recognition Experiments on a Robotic System Using One-Shot Learning
    Belo, Jose Pedro R.
    Sanches, Felipe P.
    Romero, Roseli A. F.
    2019 LATIN AMERICAN ROBOTICS SYMPOSIUM, 2019 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR) AND 2019 WORKSHOP ON ROBOTICS IN EDUCATION (LARS-SBR-WRE 2019), 2019, : 67 - 73
  • [33] A Reinforcement One-Shot Active Learning Approach for Aircraft Type Recognition
    Huang, Honglan
    Feng, Yanghe
    Huang, Jincai
    Zhang, Jiarui
    Chen, Li
    IEEE ACCESS, 2019, 7 : 147204 - 147214
  • [34] Student Engagement in One-Shot Library Instruction
    Walker, Kevin W.
    Pearce, Michael
    JOURNAL OF ACADEMIC LIBRARIANSHIP, 2014, 40 (3-4): : 281 - 290
  • [35] An Unsupervised Hierarchical Feature Learning Framework for One-Shot Image Recognition
    Guo, Zhenyu
    Wang, Z. Jane
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (03) : 621 - 632
  • [36] An Analysis of One-Shot Augmented Learning: A Face Recognition Case Study
    Jimenez-Bravo, Diego M.
    Lozano Murciego, Alvaro
    Sales Mendes, Andre
    Augusto Silva, Luis
    De la Iglesia, Daniel H.
    NEW TRENDS IN DISRUPTIVE TECHNOLOGIES, TECH ETHICS AND ARTIFICIAL INTELLIGENCE: THE DITTET COLLECTION, 2022, 1410 : 55 - 65
  • [37] Edge Face Recognition System Based on One-Shot Augmented Learning
    Jimenez-Bravo, Diego M.
    Lozano Murciego, Alvaro
    Sales Mendes, Andre
    Augusto Silva, Luis
    De la Iglesia, Daniel H.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2022, 7 (06): : 31 - 44
  • [38] HOW A ONE-TO-ONE COMPUTING LEARNING ENVIRONMENT CHALLENGES TEACHER-STUDENT RELATIONS
    Adelsten, M.
    Lauridsen, C.
    Noer, B.
    Dirckinck-Holmfeld, L.
    EDULEARN18: 10TH INTERNATIONAL CONFERENCE ON EDUCATION AND NEW LEARNING TECHNOLOGIES, 2018, : 2973 - 2982
  • [39] Fine-Grained Grocery Product Recognition by One-Shot Learning
    Geng, Weidong
    Han, Feilin
    Lin, Jiangke
    Zhu, Liuyi
    Bai, Jieming
    Wang, Suzhen
    He, Lin
    Xiao, Qiang
    Lai, Zhangjiong
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1706 - 1714
  • [40] One-Shot Face Recognition with Feature Rectification via Adversarial Learning
    Zhou, Jianli
    Chen, Jun
    Liang, Chao
    Chen, Jin
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 290 - 302