Efficient Web Video Classification via Cross-modality Knowledge Transferring

被引:1
|
作者
Xia, Shijun [1 ]
Li, Tianyu [1 ]
Ge, Shengbin [1 ]
Dong, Zhengya [2 ]
机构
[1] State Grid Shanghai Municipal Elect Power Co, Shanghai, Peoples R China
[2] Shineenergy Technol, Shanghai, Peoples R China
关键词
Video Classification; Transfer Learning;
D O I
10.1145/3007669.3007677
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper puts forward a novel method for classifying Web videos with high efficiency. Instead of analyzing the videos or extracting complicated visual features, which are both computationally expensive, we only utilize the related textual information of the to-be-classified Web videos. To address the sparsity of the textual features, we propose to exploit knowledge from auxiliary data of diverse modalilies during training, such that more informative features can be constructed. We carried out extensive experiments on MCG-WEB dataset collected from YouTube for video classification. The results demonstrate that our method can outperform several related state-of-the-art methods markedly and is quite fast, validating its effectiveness and efficiency.
引用
收藏
页码:211 / 216
页数:6
相关论文
共 50 条
  • [1] Cross-Modality Bridging and Knowledge Transferring for Image Understanding
    Yan, Chenggang
    Li, Liang
    Zhang, Chunjie
    Liu, Bingtao
    Zhang, Yongdong
    Dai, Qionghai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (10) : 2675 - 2685
  • [2] Two-Stream Video Classification with Cross-Modality Attention
    Chi, Lu
    Tian, Guiyu
    Mu, Yadong
    Tian, Qi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 4511 - 4520
  • [3] Cross-Modality Knowledge Calibration Network for Video Corpus Moment Retrieval
    Chen, Tongbao
    Wang, Wenmin
    Jiang, Zhe
    Li, Ruochen
    Wang, Bingshu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3799 - 3813
  • [4] Representation Learning for Cross-Modality Classification
    van Tulder, Gijs
    de Bruijne, Marleen
    MEDICAL COMPUTER VISION AND BAYESIAN AND GRAPHICAL MODELS FOR BIOMEDICAL IMAGING, 2017, 10081 : 126 - 136
  • [5] Dynamic Knowledge Distillation with Cross-Modality Knowledge Transfer
    Wang, Guangzhi
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2974 - 2978
  • [6] Efficient RGB-T Tracking via Cross-Modality Distillation
    Zhang, Tianlu
    Guo, Hongyuan
    Jiao, Qiang
    Zhang, Qiang
    Han, Jungong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5404 - 5413
  • [7] Robust video question answering via contrastive cross-modality representation learning
    Xun YANG
    Jianming ZENG
    Dan GUO
    Shanshan WANG
    Jianfeng DONG
    Meng WANG
    Science China(Information Sciences), 2024, 67 (10) : 211 - 226
  • [8] Robust video question answering via contrastive cross-modality representation learning
    Yang, Xun
    Zeng, Jianming
    Guo, Dan
    Wang, Shanshan
    Dong, Jianfeng
    Wang, Meng
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (10)
  • [9] Bridging asymmetry between image and video: Cross-modality knowledge transfer based on learning from video
    Zhou, Bingxin
    Zhou, Jianghao
    Chen, Zhongming
    Li, Ziqiang
    Deng, Long
    Ge, Yongxin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [10] Accurate Positioning via Cross-Modality Training
    Papaioannou, Savvas
    Wen, Hongkai
    Xiao, Zhuoling
    Markham, Andrew
    Trigoni, Niki
    SENSYS'15: PROCEEDINGS OF THE 13TH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, 2015, : 239 - 251