CROSS-MODAL KNOWLEDGE DISTILLATION FOR ACTION RECOGNITION

被引：0

作者：

Thoker, Fida Mohammad ^{[1
]}

Gall, Juergen ^{[1
]}

机构：

[1] Univ Bonn, Bonn, Germany

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年

基金：

欧洲研究理事会;

关键词：

Knowledge Distillation; Action Recognition; Transfer Learning; Cross-Modal Action Recognition;

D O I：

10.1109/icip.2019.8802909

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

In this work, we address the problem how a network for action recognition that has been trained on a modality like RGB videos can be adapted to recognize actions for another modality like sequences of 3D human poses. To this end, we extract the knowledge of the trained teacher network for the source modality and transfer it to a small ensemble of student networks for the target modality. For the cross-modal knowledge distillation, we do not require any annotated data. Instead we use pairs of sequences of both modalities as supervision, which are straightforward to acquire. In contrast to previous works for knowledge distillation that use a KL-loss, we show that the cross-entropy loss together with mutual learning of a small ensemble of student networks performs better. In fact, the proposed approach for cross-modal knowledge distillation nearly achieves the accuracy of a student network trained with full supervision.

引用

页码：6 / 10

页数：5

共 50 条

[1] Progressive Cross-modal Knowledge Distillation for Human Action Recognition
Ni, Jianyuan
Ngu, Anne H. H.
Yan, Yan
[J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 5903 - 5912
[2] CROSS-MODAL KNOWLEDGE DISTILLATION FOR VISION-TO-SENSOR ACTION RECOGNITION
Ni, Jianyuan
Sarbajna, Raunak
Liu, Yang
Ngu, Anne H. H.
Yan, Yan
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4448 - 4452
[3] Cross-modal knowledge distillation for continuous sign language recognition
Gao, Liqing
Shi, Peng
Hu, Lianyu
Feng, Jichao
Zhu, Lei
Wan, Liang
Feng, Wei
[J]. NEURAL NETWORKS, 2024, 179
[4] DistilVPR: Cross-Modal Knowledge Distillation for Visual Place Recognition
Wang, Sijie
She, Rui
Kang, Qiyu
Jian, Xingchao
Zhao, Kai
Song, Yang
Tay, Wee Peng
[J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9, 2024, : 10377 - 10385
[5] Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Wang, Jianrong
Tang, Ziyue
Li, Xuewei
Yu, Mei
Fang, Qiang
Liu, Li
[J]. INTERSPEECH 2021, 2021, : 2986 - 2990
[6] FedCMD: A Federated Cross-modal Knowledge Distillation for Drivers' Emotion Recognition
Bano, Saira
Tonellotto, Nicola
Cassara, Pietro
Gotta, Alberto
[J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2024, 15 (03)
[7] Visual-to-EEG cross-modal knowledge distillation for continuous emotion recognition
Zhang, Su
Tang, Chuangao
Guan, Cuntai
[J]. PATTERN RECOGNITION, 2022, 130
[8] Visual-to-EEG cross-modal knowledge distillation for continuous emotion recognition
Zhang, Su
Tang, Chuangao
Guan, Cuntai
[J]. PATTERN RECOGNITION, 2022, 130
[9] Learning an Augmented RGB Representation with Cross-Modal Knowledge Distillation for Action Detection
Dai, Rui
Das, Srijan
Bremond, Francois
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13033 - 13044
[10] Acoustic NLOS Imaging with Cross-Modal Knowledge Distillation
Shin, Ui-Hyeon
Jang, Seungwoo
Kim, Kwangsu
[J]. PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 1405 - 1413

← 1 2 3 4 5 →