Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media

被引:18
|
作者
Abousaleh, Fatma S. [1 ,2 ]
Cheng, Wen-Huang [3 ]
Yu, Neng-Hao [4 ]
Tsao, Yu [2 ]
机构
[1] Acad Sinica, Social Networks & Human Ctr Comp Program, Taiwan Int Grad Program, Inst Informat Sci, Taipei 11529, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 11529, Taiwan
[3] Natl Chiao Tung Univ, Dept Elect Engn, Inst Elect, Hsinchu 30010, Taiwan
[4] Natl Taiwan Univ Sci & Technol, Dept Design, Coll Design, Taipei 10607, Taiwan
关键词
Visualization; Social networking (online); Predictive models; Deep learning; Feature extraction; Multimedia Web sites; Image color analysis; Convolutional neural networks (CNNs); multimodal learning; popularity prediction; social media; PHOTO;
D O I
10.1109/TCDS.2020.3036690
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Billions of photos are uploaded to the Web daily through various types of social networks. Some of these images receive millions of views and become popular, whereas others remain completely unnoticed. This raises the problem of predicting image popularity on social media. The popularity of an image can be affected by several factors, such as visual content, aesthetic quality, user, post metadata, and time. Thus, considering all these factors is essential for accurately predicting image popularity. In addition, the efficiency of the predictive model also plays a crucial role. In this study, motivated by multimodal learning, which uses information from various modalities, and the current success of convolutional neural networks (CNNs) in various fields, we propose a deep learning model, called visual-social CNN (VSCNN), which predicts the popularity of a posted image by incorporating various types of visual and social features into a unified network model. VSCNN first learns to extract high-level representations from the input visual and social features by utilizing two individual CNNs. The outputs of these two networks are then fused into a joint network to estimate the popularity score in the output layer. We assess the performance of the proposed method by conducting extensive experiments on a data set of approximately 432K images posted on Flickr. The simulation results demonstrate that the proposed VSCNN model significantly outperforms state-of-the-art models, with a relative improvement of greater than 2.33%, 7.59%, and 14.16% in terms of Spearman's Rho, mean absolute error, and mean-squared error, respectively.
引用
收藏
页码:679 / 692
页数:14
相关论文
共 50 条
  • [21] A layer-wise deep stacking model for social image popularity prediction
    Zehang Lin
    Feitao Huang
    Yukun Li
    Zhenguo Yang
    Wenyin Liu
    [J]. World Wide Web, 2019, 22 : 1639 - 1655
  • [22] Learning Social Image Embedding with Deep Multimodal Attention Networks
    Huang, Feiran
    Zhang, Xiaoming
    Li, Zhoujun
    Mei, Tao
    He, Yueying
    Zhao, Zhonghua
    [J]. PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 460 - 468
  • [23] Bridging Models for Popularity Prediction on Social Media
    Mishra, Swapnil
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 810 - 811
  • [24] Catboost-based Framework with Additional User Information for Social Media Popularity Prediction
    Kang, Peipei
    Lin, Zehang
    Teng, Shaohua
    Zhang, Guipeng
    Guo, Lingni
    Zhang, Wei
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2677 - 2681
  • [25] Deep fusion of multimodal features for social media retweet time prediction
    Yin, Hui
    Yang, Shuiqiao
    Song, Xiangyu
    Liu, Wei
    Li, Jianxin
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (04): : 1027 - 1044
  • [26] Deep fusion of multimodal features for social media retweet time prediction
    Hui Yin
    Shuiqiao Yang
    Xiangyu Song
    Wei Liu
    Jianxin Li
    [J]. World Wide Web, 2021, 24 : 1027 - 1044
  • [27] A Deep Multimodal Representation Learning Framework for Accurate Molecular Properties Prediction
    Yang, Yuxin
    Wang, Zixu
    Ahadian, Pegah
    Jerger, Abby
    Zucker, Jeremy
    Feng, Song
    Cheng, Feixiong
    Guan, Qiang
    [J]. PROCEEDING OF THE GREAT LAKES SYMPOSIUM ON VLSI 2024, GLSVLSI 2024, 2024, : 760 - 765
  • [28] Social Media Popularity Prediction: A Multiple Feature Fusion Approach with Deep Neural Networks
    Ding, Keyan
    Wang, Ronggang
    Wang, Shiqi
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2682 - 2686
  • [29] DPSP: a multimodal deep learning framework for polypharmacy side effects prediction
    Masumshah, Raziyeh
    Eslahchi, Changiz
    [J]. BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [30] A Framework for Policy Information Popularity Prediction in New Media
    Luo, Yin
    Wang, Fangfang
    Zhao, Feifei
    Guo, Jianbin
    Wang, Lei
    Hao, Yanni
    Zeng, Daniel Dajun
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 209 - 211