On Predicting the Popularity of Newly Emerging Hashtags in Twitter

被引:158
|
作者
Ma, Zongyang [1 ]
Sun, Aixin [1 ]
Cong, Gao [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
text mining; content filtering; automatic classification;
D O I
10.1002/asi.22844
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Because of Twitter's popularity and the viral nature of information dissemination on Twitter, predicting which Twitter topics will become popular in the near future becomes a task of considerable economic importance. Many Twitter topics are annotated by hashtags. In this article, we propose methods to predict the popularity of new hashtags on Twitter by formulating the problem as a classification task. We use five standard classification models (i. e., Naive bayes, k-nearest neighbors, decision trees, support vector machines, and logistic regression) for prediction. The main challenge is the identification of effective features for describing new hashtags. We extract 7 content features from a hashtag string and the collection of tweets containing the hashtag and 11 contextual features from the social graph formed by users who have adopted the hashtag. We conducted experiments on a Twitter data set consisting of 31 million tweets from 2 million Singapore-based users. The experimental results show that the standard classifiers using the extracted features significantly outperform the baseline methods that do not use these features. Among the five classifiers, the logistic regression model performs the best in terms of the Micro-F1 measure. We also observe that contextual features are more effective than content features.
引用
收藏
页码:1399 / 1410
页数:12
相关论文
共 50 条
  • [21] Real-Time Anomaly Detection and Popularity Prediction for Emerging Events on Twitter
    Steuber, Florian
    Schneider, Sinclair
    Schneider, Joao A. G.
    Rodosek, Gabi Dreo
    PROCEEDINGS OF THE 2023 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2023, 2023, : 300 - 304
  • [22] Analyzing and predicting news popularity on Twitter (vol 35, pg 702, 2015)
    Wu, Bo
    Shen, Haiying
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2016, 36 (01) : 180 - 180
  • [23] Gatekeeping Twitter: message diffusion in political hashtags
    Bastos, Marco Toledo
    Galdini Raimundo, Rafael Luis
    Travitzki, Rodrigo
    MEDIA CULTURE & SOCIETY, 2013, 35 (02) : 260 - 270
  • [24] The pragmatics of hashtags: Inference and conversational style on Twitter
    Scott, Kate
    JOURNAL OF PRAGMATICS, 2015, 81 : 8 - 20
  • [25] Twitter Hashtags for Anesthesiologists: Building Global Communities
    Gai, Nan
    Matava, Clyde
    A & A PRACTICE, 2019, 12 (02): : 59 - 62
  • [26] Analysis of Twitter Hashtags: Fuzzy Clustering Approach
    Zadeh, Lotfi A.
    Abbasov, Ali M.
    Shahbazova, Shahnaz N.
    2015 Annual Meeting of the North American Fuzzy Information Processing Society DigiPen NAFIPS 2015, 2015,
  • [27] Towards the prediction problems of bursting hashtags on Twitter
    Kong, Shoubin
    Ye, Fei
    Feng, Ling
    Zhao, Zhe
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2015, 66 (12) : 2566 - 2579
  • [28] Identifying topic relevant hashtags in Twitter streams
    Figueiredo, Filipe
    Jorge, Alipio
    INFORMATION SCIENCES, 2019, 505 : 65 - 83
  • [29] Predicting Popularity of Twitter Accounts through the Discovery of Link-Propagating Early Adopters
    Imamori, Daichi
    Tajima, Keishi
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 639 - 648
  • [30] Twitter Issue Response Hashtags as Affordances for Momentary Connectedness
    Rathnayake, Chamil
    Suthers, Daniel D.
    SOCIAL MEDIA + SOCIETY, 2018, 4 (03):