Egocentric Early Action Prediction via Adversarial Knowledge Distillation

被引:6
|
作者
Zheng, Na [1 ]
Song, Xuemeng [1 ]
Su, Tianyu [1 ]
Liu, Weifeng [2 ]
Yan, Yan [3 ]
Nie, Liqiang [1 ]
机构
[1] Shandong Univ, N3 Floor,72 Binhai Highway, Qingdao 266237, Peoples R China
[2] China Univ Petr East China, 66 West Changjiang Rd, Qingdao 266580, Peoples R China
[3] IIT, 10 West 35th St, Chicago, IL 60616 USA
关键词
Early action prediction; teacher-student knowledge distillation; egocentric video understanding; generative adversarial networks;
D O I
10.1145/3544493
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Egocentric early action prediction aims to recognize actions from the first-person view by only observing a partial video segment, which is challenging due to the limited context information of the partial video. In this article, to tackle the egocentric early action prediction problem, we propose a novel multi-modal adversarial knowledge distillation framework. In particular, our approach involves a teacher network to learn the enhanced representation of the partial video by considering the future unobserved video segment, and a student network to mimic the teacher network to produce the powerful representation of the partial video and based on that predicting the action label. To promote the knowledge distillation between the teacher and the student network, we seamlessly integrate adversarial learning with latent and discriminative knowledge regularizations encouraging the learned representations of the partial video to be more informative and discriminative toward the action prediction. Finally, we devise a multi-modal fusion module toward comprehensively predicting the action label. Extensive experiments on two public egocentric datasets validate the superiority of our method over the state-of-the-art methods. We have released the codes and involved parameters to benefit other researchers.(1)
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Egocentric Action Prediction via Knowledge Distillation and Subject-Action Relevance
    Mukherjee, Snehasis
    Chopra, Bhavay
    [J]. COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT I, 2024, 2009 : 565 - 573
  • [2] Egocentric Early Action Prediction via Multimodal Transformer-Based Dual Action Prediction
    Guan, Weili
    Song, Xuemeng
    Wang, Kejie
    Wen, Haokun
    Ni, Hongda
    Wang, Yaowei
    Chang, Xiaojun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4472 - 4483
  • [3] Multimodal Global Relation Knowledge Distillation for Egocentric Action Anticipation
    Huang, Yi
    Yang, Xiaoshan
    Xu, Changsheng
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 245 - 254
  • [4] Multimodal Distillation for Egocentric Action Recognition
    Radevski, Gorjan
    Grujicic, Dusan
    Blaschko, Matthew
    Moens, Marie-Francine
    Tuytelaars, Tinne
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 5190 - 5201
  • [5] Multimodal Distillation for Egocentric Action Recognition
    Radevski, Gorjan
    Grujicic, Dusan
    Blaschko, Matthew
    Moens, Marie-Francine
    Tuytelaars, Tinne
    [J]. Proceedings of the IEEE International Conference on Computer Vision, 2023, : 5190 - 5201
  • [6] PROGRESSIVE KNOWLEDGE DISTILLATION FOR EARLY ACTION RECOGNITION
    Vinh Than
    Balasubramanian, Niranjan
    Minh Hoai
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 2583 - 2587
  • [7] Early Action Prediction With Generative Adversarial Networks
    Wang, Dong
    Yuan, Yuan
    Wang, Qi
    [J]. IEEE ACCESS, 2019, 7 : 35795 - 35804
  • [8] Ensembled CTR Prediction via Knowledge Distillation
    Zhu, Jieming
    Liu, Jinyang
    Li, Weiqi
    Lai, Jincai
    He, Xiuqiang
    Chen, Liang
    Zheng, Zibin
    [J]. CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2941 - 2948
  • [9] Adversarial Variational Knowledge Distillation
    Tang, Xuan
    Lin, Tong
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT III, 2021, 12893 : 558 - 569
  • [10] Adversarial Metric Knowledge Distillation
    Dong, Zihe
    Sun, Xin
    Dong, Junyu
    Zhao, Haoran
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING, ICCIP 2020, 2020, : 159 - 164