Web-scale provenance reconstruction of implicit information diffusion on social media

被引:0
|
作者
Io Taxidou
Sven Lieber
Peter M. Fischer
Tom De Nies
Ruben Verborgh
机构
[1] University of Freiburg,IDLab
[2] Ghent University – IMEC,undefined
来源
关键词
Provenance; Information diffusion; Incremental clustering; Social media; Influence;
D O I
暂无
中图分类号
学科分类号
摘要
Fast, massive, and viral data diffused on social media affects a large share of the online population, and thus, the (prospective) information diffusion mechanisms behind it are of great interest to researchers. The (retrospective) provenance of such data is equally important because it contributes to the understanding of the relevance and trustworthiness of the information. Furthermore, computing provenance in a timely way is crucial for particular use cases and practitioners, such as online journalists that promptly need to assess specific pieces of information. Social media currently provide insufficient mechanisms for provenance tracking, publication and generation, while state-of-the-art on social media research focuses mainly on explicit diffusion mechanisms (like retweets in Twitter or reshares in Facebook).The implicit diffusion mechanisms remain understudied due to the difficulties of being captured and properly understood. From a technical side, the state of the art for provenance reconstruction evaluates small datasets after the fact, sidestepping requirements for scale and speed of current social media data. In this paper, we investigate the mechanisms of implicit information diffusion by computing its fine-grained provenance. We prove that explicit mechanisms are insufficient to capture influence and our analysis unravels a significant part of implicit interactions and influence in social media. Our approach works incrementally and can be scaled up to cover a truly Web-scale scenario like major events. We can process datasets consisting of up to several millions of messages on a single machine at rates that cover bursty behaviour, without compromising result quality. By doing that, we provide to online journalists and social media users in general, fine grained provenance reconstruction which sheds lights on implicit interactions not captured by social media providers. These results are provided in an online fashion which also allows for fast relevance and trustworthiness assessment.
引用
收藏
页码:47 / 79
页数:32
相关论文
共 50 条
  • [1] Web-scale provenance reconstruction of implicit information diffusion on social media
    Taxidou, Io
    Lieber, Sven
    Fischer, Peter M.
    De Nies, Tom
    Verborgh, Ruben
    [J]. DISTRIBUTED AND PARALLEL DATABASES, 2018, 36 (01) : 47 - 79
  • [2] Social Web-Scale Provenance in the Cloud
    Simmhan, Yogesh
    Gomadam, Karthik
    [J]. PROVENANCE AND ANNOTATION OF DATA AND PROCESSES, 2010, 6378 : 298 - 300
  • [3] Towards Web-Scale How-Provenance
    Deutch, Daniel
    Gilad, Amir
    Moskovitch, Yuval
    [J]. 2015 13TH IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2015, : 68 - 70
  • [4] Web-Scale Media Recommendation Systems
    Dror, Gideon
    Koenigstein, Noam
    Koren, Yehuda
    [J]. PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2722 - 2736
  • [5] Web-Scale Multimedia Information Networks
    Qi, Guo-Jun
    Tsai, Min-Hsuan
    Tsai, Shen-Fu
    Cao, Liangliang
    Huang, Thomas S.
    [J]. PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2688 - 2704
  • [6] Browse by Chunks: Topic Mining and Organizing on Web-Scale Social Media
    Sang, Jitao
    Xu, Changsheng
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2011, 7 (01)
  • [7] Web-scale semantic information processing
    Heflin, Jeff
    Stuckenschmidt, Heiner
    [J]. JOURNAL OF WEB SEMANTICS, 2012, 10 : 1 - 2
  • [8] Web-Scale Information Extraction with Vertex
    Gulhane, Pankaj
    Madaan, Amit
    Mehta, Rupesh
    Ramamirtham, Jeyashankher
    Rastogi, Rajeev
    Satpal, Sandeep
    Sengamedu, Srinivasan H.
    Tengli, Ashwin
    Tiwari, Charu
    [J]. IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 1209 - 1220
  • [9] Information Provenance in Social Media
    Barbier, Geoffrey
    Liu, Huan
    [J]. SOCIAL COMPUTING, BEHAVIORAL-CULTURAL MODELING AND PREDICTION, 2011, 6589 : 276 - 283
  • [10] Early Steps Toward Web-Scale Information Extraction with LODIE
    Gentile, Anna Lisa
    Zhang, Ziqi
    Ciravegna, Fabio
    [J]. AI MAGAZINE, 2015, 36 (01) : 55 - 64