Multi-Modal Learning: Study on A Large-Scale Micro-Video Data Collection

被引:12
|
作者
Chen, Jingyuan [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
关键词
Micro-Videos; Popularity Prediction; Venue Estimation; Multi-Modal Learning;
D O I
10.1145/2964284.2971477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Micro-video sharing social services, as a new phenomenon in social media, enable users to share micro-videos and thus gain increasing enthusiasm among people. One distinct characteristic of micro-videos is the multi-modality, as these videos always have visual signals, audio tracks, textual descriptions as well as social clues. Such multi-modality data makes it possible to obtain a comprehensive understanding of videos and hence provides new opportunities for researchers. However, limited efforts thus far have been dedicated to this new emerging user-generated contents (UGCs) due to the lack of large-scale benchmark dataset. Towards this end, in this paper, we construct a large-scale micro-video dataset, which can support many research domains, such as popularity prediction and venue estimation. Based upon this dataset, we conduct an initial study in popularity prediction of micro-videos. Finally, we identify our future work.
引用
收藏
页码:1454 / 1458
页数:5
相关论文
共 50 条
  • [1] Multi-modal Graph Contrastive Learning for Micro-video Recommendation
    Yi, Zixuan
    Wang, Xi
    Ounis, Iadh
    Macdonald, Craig
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1807 - 1811
  • [2] Multi-modal information augmented model for micro-video recommendation
    Huo Y.
    Jin B.
    Liao Z.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1142 - 1152
  • [3] Mutual Complementarity: Multi-Modal Enhancement Semantic Learning for Micro-Video Scene Recognition
    Guo, Jie
    Nie, Xiushan
    Yin, Yilong
    IEEE ACCESS, 2020, 8 : 29518 - 29524
  • [4] Predicting Micro-video Popularity via Multi-modal Retrieval Augmentation
    Zhong, Ting
    Lang, Jian
    Zhang, Yifan
    Cheng, Zhangtao
    Zhang, Kunpeng
    Zhou, Fan
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2579 - 2583
  • [5] MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video
    Wei, Yinwei
    Wang, Xiang
    Nie, Liqiang
    He, Xiangnan
    Hong, Richang
    Chua, Tat-Seng
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1437 - 1445
  • [6] Leveraging Multi-modal Prior Knowledge for Large-scale Concept Learning in NoisyWeb Data
    Liang, Junwei
    Jiang, Lu
    Meng, Deyu
    Hauptmann, Alexander
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 32 - 40
  • [7] Adaptive Anti-Bottleneck Multi-Modal Graph Learning Network for Personalized Micro-video Recommendation
    Cai, Desheng
    Qian, Shengsheng
    Fang, Quan
    Hu, Jun
    Xu, Changsheng
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [8] Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation
    Niu, Yulei
    Lu, Zhiwu
    Wen, Ji-Rong
    Xiang, Tao
    Chang, Shih-Fu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1720 - 1731
  • [9] Deep Multi-Modal Hashing With Semantic Enhancement for Multi-Label Micro-Video Retrieval
    Jing, Peiguang
    Sun, Haoyi
    Nie, Liqiang
    Li, Yun
    Su, Yuting
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5080 - 5091
  • [10] Micro-video multi-label classification method based on multi-modal feature encoding
    Jing P.
    Li Y.
    Su Y.
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (04): : 109 - 117