Efficient Web Video Classification via Cross-modality Knowledge Transferring

被引:1
|
作者
Xia, Shijun [1 ]
Li, Tianyu [1 ]
Ge, Shengbin [1 ]
Dong, Zhengya [2 ]
机构
[1] State Grid Shanghai Municipal Elect Power Co, Shanghai, Peoples R China
[2] Shineenergy Technol, Shanghai, Peoples R China
关键词
Video Classification; Transfer Learning;
D O I
10.1145/3007669.3007677
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper puts forward a novel method for classifying Web videos with high efficiency. Instead of analyzing the videos or extracting complicated visual features, which are both computationally expensive, we only utilize the related textual information of the to-be-classified Web videos. To address the sparsity of the textual features, we propose to exploit knowledge from auxiliary data of diverse modalilies during training, such that more informative features can be constructed. We carried out extensive experiments on MCG-WEB dataset collected from YouTube for video classification. The results demonstrate that our method can outperform several related state-of-the-art methods markedly and is quite fast, validating its effectiveness and efficiency.
引用
收藏
页码:211 / 216
页数:6
相关论文
共 50 条
  • [31] Efficient cross-modality feature interaction for multispectral armored vehicle detection
    Zhang, Jie
    Chang, Tian-qing
    Zhao, Li-yang
    Ma, Jin-dun
    Han, Bin
    Zhang, Lei
    APPLIED SOFT COMPUTING, 2024, 163
  • [32] Cross-modality integration framework with prediction, perception and discrimination for video anomaly detection
    Li, Chaobo
    Li, Hongjun
    Zhang, Guoan
    NEURAL NETWORKS, 2024, 172
  • [33] Recovery of audio-to-video synchronization through analysis of cross-modality correlation
    Liu, Yuyu
    Sato, Yoichi
    PATTERN RECOGNITION LETTERS, 2010, 31 (08) : 696 - 701
  • [34] iSCAN: Automatic Speaker Adaptation via Iterative Cross-modality Association
    Xiangli, Yuanbo
    Lu, Chris Xiaoxuan
    Zhao, Peijun
    Chen, Changhao
    Markham, Andrew
    UBICOMP/ISWC'19 ADJUNCT: PROCEEDINGS OF THE 2019 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2019 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2019, : 529 - 533
  • [35] A Structure-Aware Framework of Unsupervised Cross-Modality Domain Adaptation via Frequency and Spatial Knowledge Distillation
    Liu, Shaolei
    Yin, Siqi
    Qu, Linhao
    Wang, Manning
    Song, Zhijian
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3919 - 3931
  • [36] Privacy-Safe Action Recognition via Cross-Modality Distillation
    Kim, Yuhyun
    Jung, Jinwook
    Noh, Hyeoncheol
    Ahn, Byungtae
    Kwon, Junghye
    Choi, Dong-Geol
    IEEE ACCESS, 2024, 12 : 125955 - 125965
  • [37] TIR/VIS cross-modality modelling via correlative subspace learning
    Sun, L.
    Wu, M. H.
    Dai, X. X.
    ELECTRONICS LETTERS, 2011, 47 (16) : 915 - +
  • [38] Face Anti-Spoofing via Adversarial Cross-Modality Translation
    Liu, Ajian
    Tan, Zichang
    Wan, Jun
    Liang, Yanyan
    Lei, Zhen
    Guo, Guodong
    Li, Stan Z.
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 (16) : 2759 - 2772
  • [39] Image-to-Point Registration via Cross-Modality Correspondence Retrieval
    Bie, Lin
    Li, Siqi
    Cheng, Kai
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 266 - 274
  • [40] Koos Classification of Vestibular Schwannoma via Image Translation-Based Unsupervised Cross-Modality Domain Adaptation
    Yang, Tao
    Wang, Lisheng
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES, BRAINLES 2022, PT II, 2023, 14092 : 59 - 67