Prediction of User Retweets Based on Social Neighborhood Information and Topic Modelling

被引:1
|
作者
Gabriel Celayes, Pablo [1 ]
Ariel Dominguez, Martin [1 ]
机构
[1] Univ Nacl Cordoba, Fac Matemat Astron Fis & Comp, Cordoba, Argentina
关键词
Retweet prediction; Social model; Social network analysis; Machine learning; LDA; SVM;
D O I
10.1007/978-3-030-02840-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter and other social networks have become a fundamental source of information and a powerful tool to spread ideas and opinions. A crucial step in understanding the mechanisms that drive information diffusion in Twitter, is to study the influence of the social neighborhood of a user in the construction of her retweeting preferences. In particular, to what extent can the preferences of a user be predicted given the preferences of her neighborhood. We build our own sample graph of Twitter users and study the problem of predicting retweets from a given user based on the retweeting behavior occurring in her second-degree social neighborhood (followed and followed-by-followed). We manage to train and evaluate user-centered binary classification models that predict retweets with an average F1 score of 87.6%, based purely on social information, that is, without analyzing the content of the tweets. For users getting low scores with such models (on a tuning dataset), we improve the results by adding features extracted from the content of tweets. To do so, we apply a Natural Language Processing (NLP) pipeline including a Twitter-specific adaptation of the Latent Dirichlet Allocation (LDA) probabilistic topic model.
引用
收藏
页码:146 / 157
页数:12
相关论文
共 50 条
  • [1] User-Agnostic Model for Retweets Prediction Based on Graph-Embedding Representation of Social Neighborhood Information
    Gabriel Celayes, Pablo
    Ariel Dominguez, Martin
    Barsotti, Damian
    INFORMATION MANAGEMENT AND BIG DATA, SIMBIG 2023, 2024, 2142 : 107 - 120
  • [2] The Diffusion of Information: The Impact of Sentiment and Topic on Retweets
    Berry, Colin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5636 - 5638
  • [3] Examining Information on Social Media: Topic Modelling, Trend Prediction and Community Classification
    Fang, Anjie
    SIGIR'17: PROCEEDINGS OF THE 40TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2017, : 1377 - 1377
  • [4] Exploiting user-to-user topic inclusion degree for link prediction in social-information networks
    Wang, Zhiqiang
    Liang, Jiye
    Li, Ru
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 143 - 158
  • [5] Novel Web community recommendation based on user neighborhood and topic
    Yu Q.
    Peng Z.-Y.
    Hong L.
    Wan Y.-L.
    Ruan Jian Xue Bao/Journal of Software, 2016, 27 (05): : 1266 - 1284
  • [6] A user-based topic model with topical word embeddings for semantic modelling in social network
    Jin, Xin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (01) : 1467 - 1480
  • [7] Prediction of Likes and Retweets Using Text Information Retrieval
    Daga, Ishita
    Gupta, Anchal
    Vardhan, Raj
    Mukherjee, Partha
    COMPLEX ADAPTIVE SYSTEMS, 2020, 168 : 123 - 128
  • [8] User Context Information Prediction Based on the Mobile Internet Social Pictures
    Wang, Yilei
    Ren, Huaigui
    Qin, Zhen
    Zheng, Wentao
    Yu, Linfang
    Geng, Zhenwei
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 2397 - 2401
  • [9] Personality Prediction Based on All Characters of User Social Media Information
    Wan, Danlin
    Zhang, Chuang
    Wu, Ming
    An, Zhixiang
    SOCIAL MEDIA PROCESSING, 2014, 489 : 220 - 230
  • [10] Topic Sensitive Information Diffusion Modelling in Online Social Networks
    Michelle, Gracia G.
    Kumaran, P.
    Chitrakala, S.
    PROCEEDINGS OF THE 2016 IEEE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL & ELECTRONICS, INFORMATION, COMMUNICATION & BIO INFORMATICS (IEEE AEEICB-2016), 2016, : 152 - 156