On Predicting the Popularity of Newly Emerging Hashtags in Twitter

被引:158
|
作者
Ma, Zongyang [1 ]
Sun, Aixin [1 ]
Cong, Gao [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
text mining; content filtering; automatic classification;
D O I
10.1002/asi.22844
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Because of Twitter's popularity and the viral nature of information dissemination on Twitter, predicting which Twitter topics will become popular in the near future becomes a task of considerable economic importance. Many Twitter topics are annotated by hashtags. In this article, we propose methods to predict the popularity of new hashtags on Twitter by formulating the problem as a classification task. We use five standard classification models (i. e., Naive bayes, k-nearest neighbors, decision trees, support vector machines, and logistic regression) for prediction. The main challenge is the identification of effective features for describing new hashtags. We extract 7 content features from a hashtag string and the collection of tweets containing the hashtag and 11 contextual features from the social graph formed by users who have adopted the hashtag. We conducted experiments on a Twitter data set consisting of 31 million tweets from 2 million Singapore-based users. The experimental results show that the standard classifiers using the extracted features significantly outperform the baseline methods that do not use these features. Among the five classifiers, the logistic regression model performs the best in terms of the Micro-F1 measure. We also observe that contextual features are more effective than content features.
引用
收藏
页码:1399 / 1410
页数:12
相关论文
共 50 条
  • [41] Integrated & Alone: The Use of Hashtags in Twitter Social Activism
    Simpson, Ellen
    COMPANION OF THE 2018 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING (CSCW'18), 2018, : 237 - 240
  • [42] Nursing and Twitter: Creating an online community using hashtags
    Moorley, Calvin R.
    Chinn, Teresa
    COLLEGIAN, 2014, 21 (02) : 103 - 109
  • [43] Detecting political biases of named entities and hashtags on Twitter
    Zhiping Xiao
    Jeffrey Zhu
    Yining Wang
    Pei Zhou
    Wen Hong Lam
    Mason A. Porter
    Yizhou Sun
    EPJ Data Science, 12
  • [44] What affects publications’ popularity on Twitter?
    Liwei Zhang
    Jue Wang
    Scientometrics, 2021, 126 : 9185 - 9198
  • [45] What affects publications' popularity on Twitter?
    Zhang, Liwei
    Wang, Jue
    SCIENTOMETRICS, 2021, 126 (11) : 9185 - 9198
  • [46] Understanding Popularity of Social Media Entities: From Hashtags to Question Topics
    Maity, Suman Kalyan
    CSCW'17: COMPANION OF THE 2017 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING, 2017, : 77 - 80
  • [47] Educational influencers on Twitter. Analysis of hashtags and relationship structure
    Marcelo, Carlos
    Marcelo, Paula
    COMUNICAR, 2021, 29 (68) : 73 - 83
  • [48] A Distributed Approach for Mining Moroccan Hashtags using Twitter Platform
    El Abdouli, Abdeljalil
    Hassouni, Larbi
    Anoun, Houda
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEMS & SECURITY (NISS19), 2019,
  • [49] Socio-semantic query expansion using Twitter hashtags
    Anagnostopoulos, Ioannis
    Kolias, Vasileios
    Mylonas, Phivos
    2012 SEVENTH INTERNATIONAL WORKSHOP ON SEMANTIC AND SOCIAL MEDIA ADAPTATION AND PERSONALIZATION (SMAP 2012), 2012, : 29 - 34
  • [50] Tweeting for social justice in #Ferguson: Affective discourse in Twitter hashtags
    Blevins, Jeffrey Layne
    Lee, James Jaehoon
    McCabe, Erin E.
    Edgerton, Ezra
    NEW MEDIA & SOCIETY, 2019, 21 (07) : 1636 - 1653