Privacy-Safe Action Recognition via Cross-Modality Distillation

被引:0
|
作者
Kim, Yuhyun [1 ]
Jung, Jinwook [1 ]
Noh, Hyeoncheol [1 ]
Ahn, Byungtae [2 ]
Kwon, Junghye [3 ]
Choi, Dong-Geol [1 ]
机构
[1] Hanbat Natl Univ, Dept Informat & Commun Engn, Daejeon 34158, South Korea
[2] Korea Inst Machinery & Mat, Daejeon 34103, South Korea
[3] Chungnam Natl Univ, Coll Med, Dept Internal Med, Div Hematol Oncol, Daejeon 34134, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Action recognition; knowledge distillation; cross-modality distillation; deep learning; multi modal; privacy-safe;
D O I
10.1109/ACCESS.2024.3431227
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human action recognition systems enhance public safety by detecting abnormal behavior autonomously. RGB sensors commonly used in such systems capture personal information of subjects and, as a result, run the risk of potential privacy leakage. On the other hand, privacy-safe alternatives, such as depth or thermal sensors, exhibit poorer performance because they lack the semantic context provided by RGB sensors. Moreover, the data availability of privacy-safe alternatives is significantly lower than RGB sensors. To address these problems, we explore effective cross-modality distillation methods in this paper, aiming to distill the knowledge of context-rich large-scale pre-trained RGB-based models into privacy-safe depth-based models. Based on extensive experiments on multiple architectures and benchmark datasets, we propose an effective method for training privacy-safe depth-based action recognition models via cross-modality distillation: cross-modality mixing distillation. This approach improves both the performance and efficiency by enabling interaction between depth and RGB modalities through a linear combination of their features. By utilizing the proposed cross-modal mixing distillation approach, we achieve state-of-the-art accuracy in two depth-based action recognition benchmarks. The code and the pre-trained models will be available upon publication.
引用
收藏
页码:125955 / 125965
页数:11
相关论文
共 50 条
  • [21] Exploring Cross-Modality Affective Reactions for Audiovisual Emotion Recognition
    Mariooryad, Soroosh
    Busso, Carlos
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (02) : 183 - 196
  • [22] DLFace: Deep local descriptor for cross-modality face recognition
    Peng, Chunlei
    Wang, Nannan
    Li, Jie
    Gao, Xinbo
    PATTERN RECOGNITION, 2019, 90 : 161 - 171
  • [23] Modality Distillation with Multiple Stream Networks for Action Recognition
    Garcia, Nuno C.
    Morerio, Pietro
    Murino, Vittorio
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 106 - 121
  • [24] Towards Cross-Modality Medical Image Segmentation with Online Mutual Knowledge Distillation
    Li, Kang
    Yu, Lequan
    Wang, Shujun
    Heng, Pheng-Ann
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 775 - 783
  • [25] A Structure-Aware Framework of Unsupervised Cross-Modality Domain Adaptation via Frequency and Spatial Knowledge Distillation
    Liu, Shaolei
    Yin, Siqi
    Qu, Linhao
    Wang, Manning
    Song, Zhijian
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3919 - 3931
  • [26] BeSound: Bluetooth-Based Position Estimation Enhancing with Cross-Modality Distillation
    Bello, Hymalai
    Suh, Sungho
    Zhou, Bo
    Lukowicz, Paul
    2024 INTERNATIONAL CONFERENCE ON ACTIVITY AND BEHAVIOR COMPUTING, ABC 2024, 2024,
  • [27] RadOcc: Learning Cross-Modality Occupancy Knowledge through Rendering Assisted Distillation
    Zhang, Haiming
    Yan, Xu
    Bai, Dongfeng
    Gao, Jiantao
    Wang, Pan
    Liu, Bingbing
    Cui, Shuguang
    Li, Zhen
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7060 - 7068
  • [28] Accurate Registration of Cross-Modality Geometry via Consistent Clustering
    Zhao, Mingyang
    Huang, Xiaoshui
    Jiang, Jingen
    Mou, Luntian
    Yan, Dong-Ming
    Ma, Lei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4055 - 4067
  • [29] Pedestrian Recognition Using Cross-Modality Learning in Convolutional Neural Networks
    Pop, Danut Ovidiu
    Rogozan, Alexandrina
    Nashashibi, Fawzi
    Bensrhair, Abdelaziz
    IEEE INTELLIGENT TRANSPORTATION SYSTEMS MAGAZINE, 2021, 13 (01) : 210 - 224
  • [30] Pedestrian Recognition through Different Cross-Modality Deep Learning Methods
    Pop, Danut Ovidiu
    Rogozan, Alexandrina
    Nashashibi, Fawzi
    Bensrhair, Abdelaziz
    2017 IEEE INTERNATIONAL CONFERENCE ON VEHICULAR ELECTRONICS AND SAFETY (ICVES), 2017, : 133 - 138