Role of twitter user profile features in retweet prediction for big data streams

被引:0
|
作者
Saurabh Sharma
Vishal Gupta
机构
[1] University Institute of Engineering and Technology,
[2] Panjab University,undefined
来源
关键词
Twitter; Social media analysis; Retweet prediction; User behavior; User profiling; Big data analysis;
D O I
暂无
中图分类号
学科分类号
摘要
To study the various factors influencing the process of information sharing on Twitter is a very active research area. This paper aims to explore the impact of numerical features extracted from user profiles in retweet prediction from the real-time raw feed of tweets. The originality of this work comes from the fact that the proposed model is based on simple numerical features with the least computational complexity, which is a scalable solution for big data analysis. This research work proposes three new features from the tweet author profile to capture the unique behavioral pattern of the user, namely “Author total activity”, “Author total activity per year”, and “Author tweets per year”. The features set is tested on a dataset of 100 million random tweets collected through Twitter API. The binary labels regression gave an accuracy of 0.98 for user-profile features and gave an accuracy of 0.99 when combined with tweet content features. The regression analysis to predict the retweet count gave an R-squared value of 0.98 with combined features. The multi-label classification gave an accuracy of 0.9 for combined features and 0.89 for user-profile features. The user profile features performed better than tweet content features and performed even better when combined. This model is suitable for near real-time analysis of live streaming data coming through Twitter API and provides a baseline pattern of user behavior based on numerical features available from user profiles only.
引用
收藏
页码:27309 / 27338
页数:29
相关论文
共 46 条
  • [21] Prediction on critically ill patients: The role of "big data"
    Bulgarelli, Lucas
    Deliberato, Rodrigo Octavio
    Johnson, Alistair E. W.
    JOURNAL OF CRITICAL CARE, 2020, 60 : 64 - 68
  • [22] Demographical gender prediction of Twitter users using big data analytics: An application of decision marketing
    Roy S.
    Patel B.
    Bhattacharyya D.
    Dhayal K.
    Kim T.-H.
    Mittal M.
    International Journal of Reasoning-based Intelligent Systems, 2021, 13 (02) : 41 - 49
  • [23] A Simple Framework of Smart Geriatric Nursing considering Health Big Data and User Profile
    Li, Shijie
    Tang, Yongchuan
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2020, 2020 (2020)
  • [24] Big Data Feature Selection And Projection For Gender Prediction Based On User Web Behaviour
    Gulsen, Esra
    Gunduz, Hakan
    Cataltepe, Zehra
    Serinol, Levent
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1545 - 1548
  • [25] Design and Application of a Prediction Model for User Purchase Intention Based on Big Data Analysis
    Zhang R.
    Zhang, Ruixue (zrx@dlnu.edu.cn); Zhang, Ruixue (zrx@dlnu.edu.cn), 1600, International Information and Engineering Technology Association (25): : 311 - 317
  • [26] User Persona in Personalized Wireless Networks: A Big Data-Driven Prediction Framework
    Alkurd, Rawan
    AbuAlhaol, Ibrahim
    Yanikomeroglu, Halim
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [27] User-Specific Loyalty Measure and Prediction Using Deep Neural Network From Twitter Data
    Urolagin, Siddhaling
    Patel, Saifali
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) : 1046 - 1061
  • [28] Employing traditional machine learning algorithms for big data streams analysis: The case of object trajectory prediction
    Valsamis, Angelos
    Tserpes, Konstantinos
    Zissis, Dimitrios
    Anagnostopoulos, Dimosthenis
    Varvarigou, Theodora
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 127 : 249 - 257
  • [29] Big Data Analysis and User Behavior Prediction of Social Networks Based on Artificial Neural Network
    Liu Z.
    Song T.
    Journal of Computing and Information Technology, 2023, 31 (03) : 185 - 201
  • [30] Extracting Integrated Features of Electronic Medical Records Big Data for Mortality and Phenotype Prediction
    Li, Fei
    Chen, Yiqiang
    Gu, Yang
    Wang, Yaowei
    Pan, Yi
    CHINESE JOURNAL OF ELECTRONICS, 2024, 33 (03) : 776 - 792