Multimodal Deep Learning Framework for Image Popularity Prediction on Social Media

被引:18
|
作者
Abousaleh, Fatma S. [1 ,2 ]
Cheng, Wen-Huang [3 ]
Yu, Neng-Hao [4 ]
Tsao, Yu [2 ]
机构
[1] Acad Sinica, Social Networks & Human Ctr Comp Program, Taiwan Int Grad Program, Inst Informat Sci, Taipei 11529, Taiwan
[2] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 11529, Taiwan
[3] Natl Chiao Tung Univ, Dept Elect Engn, Inst Elect, Hsinchu 30010, Taiwan
[4] Natl Taiwan Univ Sci & Technol, Dept Design, Coll Design, Taipei 10607, Taiwan
关键词
Visualization; Social networking (online); Predictive models; Deep learning; Feature extraction; Multimedia Web sites; Image color analysis; Convolutional neural networks (CNNs); multimodal learning; popularity prediction; social media; PHOTO;
D O I
10.1109/TCDS.2020.3036690
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Billions of photos are uploaded to the Web daily through various types of social networks. Some of these images receive millions of views and become popular, whereas others remain completely unnoticed. This raises the problem of predicting image popularity on social media. The popularity of an image can be affected by several factors, such as visual content, aesthetic quality, user, post metadata, and time. Thus, considering all these factors is essential for accurately predicting image popularity. In addition, the efficiency of the predictive model also plays a crucial role. In this study, motivated by multimodal learning, which uses information from various modalities, and the current success of convolutional neural networks (CNNs) in various fields, we propose a deep learning model, called visual-social CNN (VSCNN), which predicts the popularity of a posted image by incorporating various types of visual and social features into a unified network model. VSCNN first learns to extract high-level representations from the input visual and social features by utilizing two individual CNNs. The outputs of these two networks are then fused into a joint network to estimate the popularity score in the output layer. We assess the performance of the proposed method by conducting extensive experiments on a data set of approximately 432K images posted on Flickr. The simulation results demonstrate that the proposed VSCNN model significantly outperforms state-of-the-art models, with a relative improvement of greater than 2.33%, 7.59%, and 14.16% in terms of Spearman's Rho, mean absolute error, and mean-squared error, respectively.
引用
收藏
页码:679 / 692
页数:14
相关论文
共 50 条
  • [1] Multimodal Learning for Image Popularity Prediction on Social Media
    Hu, Jiani
    Yamasaki, Toshihiko
    Aizawa, Kiyoharu
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS-TAIWAN (ICCE-TW), 2016, : 259 - 260
  • [2] Multimodal Deep Learning for Social Media Popularity Prediction With Attention Mechanism
    Xu, Kele
    Lin, Zhimin
    Zhao, Jianqiao
    Shi, Peicang
    Deng, Wei
    Wang, Huaimin
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4580 - 4584
  • [3] MMSPP: Multimodal Social Media Popularity Prediction
    Gu, Jianyang
    [J]. 2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [4] An Efficient Multi-View Multimodal Data Processing Framework for Social Media Popularity Prediction
    Tan, YunPeng
    Liu, Fangyu
    Li, BoWei
    Zhang, Zheng
    Zhang, Bo
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 7200 - 7204
  • [5] Social media popularity prediction with multimodal hierarchical fusion model
    Wang, Jing
    Yang, Shuo
    Zhao, Hui
    Yang, Yue
    [J]. COMPUTER SPEECH AND LANGUAGE, 2023, 80
  • [6] A Feature Generalization Framework for Social Media Popularity Prediction
    Wang, Kai
    Wang, Penghui
    Chen, Xin
    Huang, Qiushi
    Mao, Zhendong
    Zhang, Yongdong
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4570 - 4574
  • [7] THGFormer: Time-Aware Hypergraph Learning for Multimodal Social Media Popularity Prediction (Student Abstract)
    Zhang, Jienan
    Liu, Jie
    Cheng, Zhangtao
    Xu, Xovee
    Liu, Fang
    Zhong, Ting
    Zhang, Kunpeng
    [J]. THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23705 - 23706
  • [8] Combining Multiple Features for Image Popularity Prediction in Social Media
    Wang, Wen
    Zhang, Wei
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 1901 - 1905
  • [9] Prediction of Image Popularity Over Time on Social Media Networks
    Almgren, Khaled
    Lee, Jeongkyu
    Kim, Minkyu
    [J]. 2016 ANNUAL CONNECTICUT CONFERENCE ON INDUSTRIAL ELECTRONICS, TECHNOLOGY AND AUTOMATION (CT-IETA), 2016,
  • [10] Multimodal Context-Aware Recommender for Post Popularity Prediction in Social Media
    Mazloom, Masoud
    Hendriks, Bouke
    Worring, Marcel
    [J]. PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 236 - 244