Efficient Web Video Classification via Cross-modality Knowledge Transferring

被引:1
|
作者
Xia, Shijun [1 ]
Li, Tianyu [1 ]
Ge, Shengbin [1 ]
Dong, Zhengya [2 ]
机构
[1] State Grid Shanghai Municipal Elect Power Co, Shanghai, Peoples R China
[2] Shineenergy Technol, Shanghai, Peoples R China
关键词
Video Classification; Transfer Learning;
D O I
10.1145/3007669.3007677
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper puts forward a novel method for classifying Web videos with high efficiency. Instead of analyzing the videos or extracting complicated visual features, which are both computationally expensive, we only utilize the related textual information of the to-be-classified Web videos. To address the sparsity of the textual features, we propose to exploit knowledge from auxiliary data of diverse modalilies during training, such that more informative features can be constructed. We carried out extensive experiments on MCG-WEB dataset collected from YouTube for video classification. The results demonstrate that our method can outperform several related state-of-the-art methods markedly and is quite fast, validating its effectiveness and efficiency.
引用
收藏
页码:211 / 216
页数:6
相关论文
共 50 条
  • [21] Accurate Registration of Cross-Modality Geometry via Consistent Clustering
    Zhao, Mingyang
    Huang, Xiaoshui
    Jiang, Jingen
    Mou, Luntian
    Yan, Dong-Ming
    Ma, Lei
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2024, 30 (07) : 4055 - 4067
  • [22] Cross-Modality Face Recognition via Heterogeneous Joint Bayesian
    Shi, Hailin
    Wang, Xiaobo
    Yi, Dong
    Lei, Zhen
    Zhu, Xiangyu
    Li, Stan Z.
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (01) : 81 - 85
  • [23] Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation
    Hao, Xin
    Zhao, Sanyuan
    Ye, Mang
    Shen, Jianbing
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 16383 - 16392
  • [24] Cross-Modality Knowledge Transfer for Prostate Segmentation from CT Scans
    Liu, Yucheng
    Khosravan, Naji
    Liu, Yulin
    Stember, Joseph
    Shoag, Jonathan
    Bagci, Ulas
    Jambawalikar, Sachin
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER AND MEDICAL IMAGE LEARNING WITH LESS LABELS AND IMPERFECT DATA, DART 2019, MIL3ID 2019, 2019, 11795 : 63 - 71
  • [25] Cross-modality semantic guidance for multi-label image classification
    Huang, Jun
    Wang, Dian
    Hong, Xudong
    Qu, Xiwen
    Xue, Wei
    INTELLIGENT DATA ANALYSIS, 2024, 28 (03) : 633 - 646
  • [26] Cross-modality transfer learning with knowledge infusion for diabetic retinopathy grading
    Chen, Tao
    Bai, Yanmiao
    Mao, Haiting
    Liu, Shouyue
    Xu, Keyi
    Xiong, Zhouwei
    Ma, Shaodong
    Yang, Fang
    Zhao, Yitian
    FRONTIERS IN MEDICINE, 2024, 11
  • [27] Task-Decoupled Knowledge Transfer for Cross-Modality Object Detection
    Wei, Chiheng
    Bai, Lianfa
    Chen, Xiaoyu
    Han, Jing
    ENTROPY, 2023, 25 (08)
  • [28] Cross-Organ, Cross-Modality Transfer Learning: Feasibility Study for Segmentation and Classification
    Lee, Juhun
    Nishikawa, Robert M.
    IEEE ACCESS, 2020, 8 : 210194 - 210205
  • [29] Temporal-enhanced Cross-modality Fusion Network for Video Sentence Grounding
    Lv, Zezhong
    Su, Bing
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1487 - 1492
  • [30] Data Efficient Unsupervised Domain Adaptation For Cross-modality Image Segmentation
    Ouyang, Cheng
    Kamnitsas, Konstantinos
    Biffi, Carlo
    Duan, Jinming
    Rueckert, Daniel
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 669 - 677