Multi-Modal Learning: Study on A Large-Scale Micro-Video Data Collection

被引:12
|
作者
Chen, Jingyuan [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
关键词
Micro-Videos; Popularity Prediction; Venue Estimation; Multi-Modal Learning;
D O I
10.1145/2964284.2971477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Micro-video sharing social services, as a new phenomenon in social media, enable users to share micro-videos and thus gain increasing enthusiasm among people. One distinct characteristic of micro-videos is the multi-modality, as these videos always have visual signals, audio tracks, textual descriptions as well as social clues. Such multi-modality data makes it possible to obtain a comprehensive understanding of videos and hence provides new opportunities for researchers. However, limited efforts thus far have been dedicated to this new emerging user-generated contents (UGCs) due to the lack of large-scale benchmark dataset. Towards this end, in this paper, we construct a large-scale micro-video dataset, which can support many research domains, such as popularity prediction and venue estimation. Based upon this dataset, we conduct an initial study in popularity prediction of micro-videos. Finally, we identify our future work.
引用
收藏
页码:1454 / 1458
页数:5
相关论文
共 50 条
  • [31] Application of smart card data in validating a large-scale multi-modal transit assignment model
    Tavassoli A.
    Mesbah M.
    Hickman M.
    Tavassoli, Ahmad (a.tavassoli@uq.edu.au), 2018, Springer Verlag (10) : 1 - 21
  • [32] Flexible Online Multi-modal Hashing for Large-scale Multimedia Retrieval
    Lu, Xu
    Zhu, Lei
    Cheng, Zhiyong
    Li, Jingjing
    Nie, Xiushan
    Zhang, Huaxiang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1129 - 1137
  • [33] Multi-Modal Low-Data-Based Learning for Video Classification
    Citak, Erol
    Karsligil, Mine Elif
    APPLIED SCIENCES-BASEL, 2024, 14 (10):
  • [34] Multi-modal and multi-scale photo collection summarization
    Shen, Xu
    Tian, Xinmei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (05) : 2527 - 2541
  • [35] Multi-modal and multi-scale photo collection summarization
    Xu Shen
    Xinmei Tian
    Multimedia Tools and Applications, 2016, 75 : 2527 - 2541
  • [36] Micro-Urban Heatmapping: A Multi-Modal and Multi-Temporal Data Collection Framework
    Hu, Ming
    Ghorbany, Siavash
    Yao, Siyuan
    Wang, Chaoli
    BUILDINGS, 2024, 14 (09)
  • [37] Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study
    Ko, Myeongseob
    Jin, Ming
    Wang, Chenguang
    Jia, Ruoxi
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 4848 - 4858
  • [38] Multi-modal and multi-model interrogation of large-scale functional brain networks
    Castaldo, Francesca
    dos Santos, Francisco Pascoa
    Timms, Ryan C.
    Cabral, Joana
    Vohryzek, Jakub
    Deco, Gustavo
    Woolrich, Mark
    Friston, Karl
    Verschure, Paul
    Litvak, Vladimir
    NEUROIMAGE, 2023, 277
  • [39] Learning Instance-Level Representation for Large-Scale Multi-Modal Pretraining in E-commerce
    Jin, Yang
    Li, Yongzhi
    Yuan, Zehuan
    Mu, Yadong
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11060 - 11069
  • [40] A Discrete-Time Model for Large-Scale Multi-Modal Transport Networks
    Pasquale, C.
    Siri, E.
    Sacone, S.
    Siri, S.
    IFAC PAPERSONLINE, 2021, 54 (02): : 7 - 12