Wb-MSF: A Large-scale Multi-source Information Diffusion Dataset for Social Information Diffusion Prediction

被引:0
|
作者
Wu, Zhen [1 ]
Zhou, Jingya [1 ]
Wang, Jie [1 ]
Sun, Xigang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
datasets; social networks; multi-source; information diffusion;
D O I
10.1109/CBD58033.2022.00023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, a large number of social network studies focus on the diffusion of information posted by individual users, which consequently brings in a strong demand for social network datasets. Nevertheless, most of the available datasets have been published for nearly a decade, and their scale is not large enough. Moreover, they ignore the multiple posts originated by different users spontaneously under the same topic, these posts form a kind of multi-source information. This paper presents Wb-MSF, a large-scale dataset that contains multi-source information cascades and user followership. Different from existing datasets used in information diffusion tasks, Wb-MSF is the first multi-source information dataset, and further provides a followership network. Wb-MSF is crawled from a famous social platform Sina-Weibo and contains tens of millions of followership edges and tens of thousands of information cascades formed by millions of users. It can support information diffusion prediction problem. In this paper, our discussions and experiments including carrying out a statistical analysis of the dataset, and examining the difference between single-source and multi-source information and the effect of the followership network are based on this problem.
引用
收藏
页码:79 / 84
页数:6
相关论文
共 50 条
  • [31] Information diffusion prediction based on cascade sequences and social topology
    Zhao, Jinghua
    Zhao, Jiale
    Feng, Juan
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 109
  • [32] Information Diffusion Prediction in Mobile Social Networks with Hydrodynamic Model
    Hu, Ying
    Chen, Min
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016, : 286 - 290
  • [33] Information Diffusion Enhanced by Multi-Task Peer Prediction
    Ito, Kensuke
    Ohsawa, Shohei
    Tanaka, Hideyuki
    IIWAS2018: THE 20TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2014, : 94 - 102
  • [34] Large-Scale Network Imputation and Prediction of Traffic Volume Based on Multi-Source Data Collection System
    Kwon, Donghyun
    Lee, Changhee
    Kang, Heechan
    Kim, Inhi
    TRANSPORTATION RESEARCH RECORD, 2023, 2677 (09) : 30 - 42
  • [35] Prediction of groundwater pollution diffusion path based on multi-source data fusion
    Zhang, Yanhong
    Huo, Xiaofeng
    Luo, Yue
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2023, 10
  • [36] Friend recommendation in social networks based on multi-source information fusion
    Shulin Cheng
    Bofeng Zhang
    Guobing Zou
    Mingqing Huang
    Zhu Zhang
    International Journal of Machine Learning and Cybernetics, 2019, 10 : 1003 - 1024
  • [37] How to Measure the Information Diffusion Process in Large Social Networks?
    Krol, Dariusz
    Intelligent Information and Database Systems, Pt I, 2015, 9011 : 66 - 74
  • [38] Friend recommendation in social networks based on multi-source information fusion
    Cheng, Shulin
    Zhang, Bofeng
    Zou, Guobing
    Huang, Mingqing
    Zhang, Zhu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (05) : 1003 - 1024
  • [39] A fast algorithm for diffusion source localization in large-scale complex networks
    Pan, Chunyu
    Wang, Jie
    Yan, Di
    Zhang, Changsheng
    Zhang, Xizhe
    JOURNAL OF COMPLEX NETWORKS, 2024, 12 (02)
  • [40] Full-Scale Information Diffusion Prediction With Reinforced Recurrent Networks
    Yang, Cheng
    Wang, Hao
    Tang, Jian
    Shi, Chuan
    Sun, Maosong
    Cui, Ganqu
    Liu, Zhiyuan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (05) : 2271 - 2283