Wb-MSF: A Large-scale Multi-source Information Diffusion Dataset for Social Information Diffusion Prediction

被引:0
|
作者
Wu, Zhen [1 ]
Zhou, Jingya [1 ]
Wang, Jie [1 ]
Sun, Xigang [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
datasets; social networks; multi-source; information diffusion;
D O I
10.1109/CBD58033.2022.00023
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, a large number of social network studies focus on the diffusion of information posted by individual users, which consequently brings in a strong demand for social network datasets. Nevertheless, most of the available datasets have been published for nearly a decade, and their scale is not large enough. Moreover, they ignore the multiple posts originated by different users spontaneously under the same topic, these posts form a kind of multi-source information. This paper presents Wb-MSF, a large-scale dataset that contains multi-source information cascades and user followership. Different from existing datasets used in information diffusion tasks, Wb-MSF is the first multi-source information dataset, and further provides a followership network. Wb-MSF is crawled from a famous social platform Sina-Weibo and contains tens of millions of followership edges and tens of thousands of information cascades formed by millions of users. It can support information diffusion prediction problem. In this paper, our discussions and experiments including carrying out a statistical analysis of the dataset, and examining the difference between single-source and multi-source information and the effect of the followership network are based on this problem.
引用
收藏
页码:79 / 84
页数:6
相关论文
共 50 条
  • [41] Distribution Transformer Failure Rate Prediction Model Based on Multi-Source Information
    Niu, Jincang
    Su, Jianjun
    Yang, Yi
    Cai, Yanan
    Liu, Hang
    2016 INTERNATIONAL CONFERENCE ON CONDITION MONITORING AND DIAGNOSIS (CMD), 2016, : 944 - 947
  • [42] State prediction of distributed parameter systems based on multi-source spatiotemporal information
    Mu, Guoqing
    Chen, Junghui
    Liu, Jingxiang
    Shao, Weiming
    Zhao, Dongya
    JOURNAL OF PROCESS CONTROL, 2022, 119 : 55 - 67
  • [43] Remaining useful life prediction based on multi-source information fusion and HMM
    Huang L.
    Gong L.
    Jiang W.
    Wang K.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2022, 44 (05): : 1747 - 1756
  • [44] Multi-Source Information Fusion Graph Convolution Network for traffic flow prediction
    Li, Qin
    Xu, Pai
    He, Deqiang
    Wu, Yuankai
    Tan, Huachun
    Yang, Xuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [45] DWPPI: A Deep Learning Approach for Predicting Protein-Protein Interactions in Plants Based on Multi-Source Information With a Large-Scale Biological Network
    Pan, Jie
    You, Zhu-Hong
    Li, Li-Ping
    Huang, Wen-Zhun
    Guo, Jian-Xin
    Yu, Chang-Qing
    Wang, Li-Ping
    Zhao, Zheng-Yang
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2022, 10
  • [46] Research on bus arrival time prediction based on multi-source traffic information
    Chu, Hao
    Cai, Yun
    Yang, Xiaoguang
    2007 7th International Conference on ITS Telecommunications, Proceedings, 2007, : 24 - 28
  • [47] Logo information recognition in large-scale social media data
    Fanglin Wang
    Shuhan Qi
    Ge Gao
    Sicheng Zhao
    Xiangyu Wang
    Multimedia Systems, 2016, 22 : 63 - 73
  • [48] State prediction of distributed parameter systems based on multi-source spatiotemporal information
    Mu, Guoqing
    Chen, Junghui
    Liu, Jingxiang
    Shao, Weiming
    Zhao, Dongya
    JOURNAL OF PROCESS CONTROL, 2022, 119 : 55 - 67
  • [49] Routing the Social Graphs for the Large-Scale Heterogeneous Information Accessing
    Li, Bing
    2016 SIXTH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2016, : 122 - 131
  • [50] Logo information recognition in large-scale social media data
    Wang, Fanglin
    Qi, Shuhan
    Gao, Ge
    Zhao, Sicheng
    Wang, Xiangyu
    MULTIMEDIA SYSTEMS, 2016, 22 (01) : 63 - 73