On Predicting the Popularity of Newly Emerging Hashtags in Twitter

被引:158
|
作者
Ma, Zongyang [1 ]
Sun, Aixin [1 ]
Cong, Gao [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
text mining; content filtering; automatic classification;
D O I
10.1002/asi.22844
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Because of Twitter's popularity and the viral nature of information dissemination on Twitter, predicting which Twitter topics will become popular in the near future becomes a task of considerable economic importance. Many Twitter topics are annotated by hashtags. In this article, we propose methods to predict the popularity of new hashtags on Twitter by formulating the problem as a classification task. We use five standard classification models (i. e., Naive bayes, k-nearest neighbors, decision trees, support vector machines, and logistic regression) for prediction. The main challenge is the identification of effective features for describing new hashtags. We extract 7 content features from a hashtag string and the collection of tweets containing the hashtag and 11 contextual features from the social graph formed by users who have adopted the hashtag. We conducted experiments on a Twitter data set consisting of 31 million tweets from 2 million Singapore-based users. The experimental results show that the standard classifiers using the extracted features significantly outperform the baseline methods that do not use these features. Among the five classifiers, the logistic regression model performs the best in terms of the Micro-F1 measure. We also observe that contextual features are more effective than content features.
引用
收藏
页码:1399 / 1410
页数:12
相关论文
共 50 条
  • [1] Predicting Twitter Hashtags Popularity Level
    Doong, Shing H.
    PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 1959 - 1968
  • [2] Predicting the active period of popularity evolution: A case study on Twitter hashtags
    Huang, Jianyi
    Tang, Yuyuan
    Hu, Ying
    Li, Jianjiang
    Hu, Changjun
    INFORMATION SCIENCES, 2020, 512 : 315 - 326
  • [3] Modeling the popularity of twitter hashtags with master equations
    Oscar Fontanelli
    Demian Hernández
    Ricardo Mansilla
    Social Network Analysis and Mining, 2022, 12
  • [4] Modeling the popularity of twitter hashtags with master equations
    Fontanelli, Oscar
    Hernandez, Demian
    Mansilla, Ricardo
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [5] Understanding and predicting the peak popularity of bursting hashtags
    Xu, Wenwen
    Shi, Peng
    Huang, Jianyi
    Liu, Feng
    JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 28 : 328 - 335
  • [6] A Prediction Method of Peak Time Popularity Based on Twitter Hashtags
    Yu, Hai
    Hu, Ying
    Shi, Peng
    IEEE ACCESS, 2020, 8 : 61453 - 61461
  • [7] Predicting Bursts and Popularity of Hashtags in Real-Time
    Kong, Shoubin
    Mei, Qiaozhu
    Feng, Ling
    Ye, Fei
    Zhao, Zhe
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 927 - 930
  • [8] Hashtag Popularity on Twitter: Analyzing Co-occurrence of Multiple Hashtags
    Pervin, Nargis
    Phan, Tuan Quang
    Datta, Anindya
    Takeda, Hideaki
    Toriumi, Fujio
    SOCIAL COMPUTING AND SOCIAL MEDIA, SCSM 2015, 2015, 9182 : 169 - 182
  • [9] Analyzing and predicting news popularity on Twitter
    Wu, Bo
    Shen, Haiying
    INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2015, 35 (06) : 702 - 711
  • [10] Real-Time Predicting Bursting Hashtags on Twitter
    Kong, Shoubin
    Mei, Qiaozhu
    Feng, Ling
    Zhao, Zhe
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 268 - 271