Multi-Modal Learning: Study on A Large-Scale Micro-Video Data Collection

被引：12

作者：

Chen, Jingyuan ^{[1
]}

机构：

[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore

来源：

MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年

关键词：

Micro-Videos; Popularity Prediction; Venue Estimation; Multi-Modal Learning;

D O I：

10.1145/2964284.2971477

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Micro-video sharing social services, as a new phenomenon in social media, enable users to share micro-videos and thus gain increasing enthusiasm among people. One distinct characteristic of micro-videos is the multi-modality, as these videos always have visual signals, audio tracks, textual descriptions as well as social clues. Such multi-modality data makes it possible to obtain a comprehensive understanding of videos and hence provides new opportunities for researchers. However, limited efforts thus far have been dedicated to this new emerging user-generated contents (UGCs) due to the lack of large-scale benchmark dataset. Towards this end, in this paper, we construct a large-scale micro-video dataset, which can support many research domains, such as popularity prediction and venue estimation. Based upon this dataset, we conduct an initial study in popularity prediction of micro-videos. Finally, we identify our future work.

引用

页码：1454 / 1458

页数：5

共 50 条

[1] Multi-modal Graph Contrastive Learning for Micro-video Recommendation
Yi, Zixuan
Wang, Xi
Ounis, Iadh
Macdonald, Craig
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1807 - 1811
[2] Multi-modal information augmented model for micro-video recommendation
Huo Y.
Jin B.
Liao Z.
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1142 - 1152
[3] Mutual Complementarity: Multi-Modal Enhancement Semantic Learning for Micro-Video Scene Recognition
Guo, Jie
Nie, Xiushan
Yin, Yilong
IEEE ACCESS, 2020, 8 : 29518 - 29524
[4] Predicting Micro-video Popularity via Multi-modal Retrieval Augmentation
Zhong, Ting
Lang, Jian
Zhang, Yifan
Cheng, Zhangtao
Zhang, Kunpeng
Zhou, Fan
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2579 - 2583
[5] MMGCN: Multi-modal Graph Convolution Network for Personalized Recommendation of Micro-video
Wei, Yinwei
Wang, Xiang
Nie, Liqiang
He, Xiangnan
Hong, Richang
Chua, Tat-Seng
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1437 - 1445
[6] Leveraging Multi-modal Prior Knowledge for Large-scale Concept Learning in NoisyWeb Data
Liang, Junwei
Jiang, Lu
Meng, Deyu
Hauptmann, Alexander
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 32 - 40
[7] Adaptive Anti-Bottleneck Multi-Modal Graph Learning Network for Personalized Micro-video Recommendation
Cai, Desheng
Qian, Shengsheng
Fang, Quan
Hu, Jun
Xu, Changsheng
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
[8] Multi-Modal Multi-Scale Deep Learning for Large-Scale Image Annotation
Niu, Yulei
Lu, Zhiwu
Wen, Ji-Rong
Xiang, Tao
Chang, Shih-Fu
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (04) : 1720 - 1731
[9] Deep Multi-Modal Hashing With Semantic Enhancement for Multi-Label Micro-Video Retrieval
Jing, Peiguang
Sun, Haoyi
Nie, Liqiang
Li, Yun
Su, Yuting
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5080 - 5091
[10] Micro-video multi-label classification method based on multi-modal feature encoding
Jing P.
Li Y.
Su Y.
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2022, 49 (04): : 109 - 117

← 1 2 3 4 5 →