Role of twitter user profile features in retweet prediction for big data streams

被引:9
|
作者
Sharma, Saurabh [1 ]
Gupta, Vishal [1 ]
机构
[1] Panjab Univ, Univ Inst Engn & Technol, Chandigarh, India
关键词
Twitter; Social media analysis; Retweet prediction; User behavior; User profiling; Big data analysis; MODEL; INFORMATION; ACCOUNTS; WEB;
D O I
10.1007/s11042-022-12815-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To study the various factors influencing the process of information sharing on Twitter is a very active research area. This paper aims to explore the impact of numerical features extracted from user profiles in retweet prediction from the real-time raw feed of tweets. The originality of this work comes from the fact that the proposed model is based on simple numerical features with the least computational complexity, which is a scalable solution for big data analysis. This research work proposes three new features from the tweet author profile to capture the unique behavioral pattern of the user, namely "Author total activity", "Author total activity per year", and "Author tweets per year". The features set is tested on a dataset of 100 million random tweets collected through Twitter API. The binary labels regression gave an accuracy of 0.98 for user-profile features and gave an accuracy of 0.99 when combined with tweet content features. The regression analysis to predict the retweet count gave an R-squared value of 0.98 with combined features. The multi-label classification gave an accuracy of 0.9 for combined features and 0.89 for user-profile features. The user profile features performed better than tweet content features and performed even better when combined. This model is suitable for near real-time analysis of live streaming data coming through Twitter API and provides a baseline pattern of user behavior based on numerical features available from user profiles only.
引用
收藏
页码:27309 / 27338
页数:30
相关论文
共 46 条
  • [31] Research on FCM-LR cross electricity theft detection based on big data user profile
    Hu, Ronghui
    Zhen, Tong
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (07) : 3251 - 3265
  • [32] Role of machine learning and big data in healthcare for the prediction of epidemic diseases: a survey
    Sharma, S.
    Gupta, Yogesh Kumar
    INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2021, 12 (2-3) : 148 - 155
  • [33] TCB: A feature transformation method based central behavior for user interest prediction on mobile big data
    Zhou, Chen
    Jiang, Hao
    Chen, Yanqiu
    Wu, Jing
    Zhou, Jianguo
    Wu, Yuanshan
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2016, 12 (10):
  • [34] Constructing Technology Commercialization Capability: The Critical Role of User Engagement and Big Data Analytics Capability
    Cai, Li
    Lu, Shan
    Chen, Biao
    JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2022, 34 (09)
  • [35] Automatic ontology-based User Profile Learning from heterogeneous Web Resources in a Big Data Context
    Hoppe, Anett
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (12): : 1428 - 1433
  • [36] The Role of Teamwork in the Analysis of Big Data: A Study of Visual Analytics and Box Office Prediction
    Buchanan, Verica
    Lu, Yafeng
    McNeese, Nathan
    Steptoe, Michael
    Maciejewski, Ross
    Cooke, Nancy
    BIG DATA, 2017, 5 (01) : 53 - 66
  • [37] Motivations for customer revisit behavior in online review comments: Analyzing the role of user experience using big data approaches
    Park, Eunil
    JOURNAL OF RETAILING AND CONSUMER SERVICES, 2019, 51 : 14 - 18
  • [38] Analysis of big data for prediction of provider-initiated preterm birth and spontaneous premature deliveries and ranking the predictive features
    Khatibi, Toktam
    Kheyrikoochaksarayee, Naghme
    Sepehri, Mohammad Mehdi
    ARCHIVES OF GYNECOLOGY AND OBSTETRICS, 2019, 300 (06) : 1565 - 1582
  • [39] The prediction of crystal densities of a big data set using 1D and 2D structure features
    Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng
    475004, China
    不详
    441003, China
    不详
    150001, China
    Struct Chem, 5 (1375-1385):
  • [40] The prediction of crystal densities of a big data set using 1D and 2D structure features
    Li, Xianlan
    Kong, Dingling
    Luan, Yue
    Guo, Lili
    Lu, Yanhua
    Li, Wei
    Tang, Meng
    Zhang, Qingyou
    Pang, Aimin
    STRUCTURAL CHEMISTRY, 2024, 35 (05) : 1375 - 1385