On Predicting the Popularity of Newly Emerging Hashtags in Twitter

被引:158
|
作者
Ma, Zongyang [1 ]
Sun, Aixin [1 ]
Cong, Gao [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
关键词
text mining; content filtering; automatic classification;
D O I
10.1002/asi.22844
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Because of Twitter's popularity and the viral nature of information dissemination on Twitter, predicting which Twitter topics will become popular in the near future becomes a task of considerable economic importance. Many Twitter topics are annotated by hashtags. In this article, we propose methods to predict the popularity of new hashtags on Twitter by formulating the problem as a classification task. We use five standard classification models (i. e., Naive bayes, k-nearest neighbors, decision trees, support vector machines, and logistic regression) for prediction. The main challenge is the identification of effective features for describing new hashtags. We extract 7 content features from a hashtag string and the collection of tweets containing the hashtag and 11 contextual features from the social graph formed by users who have adopted the hashtag. We conducted experiments on a Twitter data set consisting of 31 million tweets from 2 million Singapore-based users. The experimental results show that the standard classifiers using the extracted features significantly outperform the baseline methods that do not use these features. Among the five classifiers, the logistic regression model performs the best in terms of the Micro-F1 measure. We also observe that contextual features are more effective than content features.
引用
收藏
页码:1399 / 1410
页数:12
相关论文
共 50 条
  • [31] Complex contagions and the diffusion of popular Twitter hashtags in Nigeria
    Fink, Clay
    Schmidt, Aurora
    Barash, Vladimir
    Cameron, Christopher
    Macy, Michael
    SOCIAL NETWORK ANALYSIS AND MINING, 2016, 6 (01) : 1 - 19
  • [32] On the use of URLs and hashtags in age prediction of Twitter users
    Pandya, Abhinay
    Oussalah, Mourad
    Monachesi, Paola
    Kostakos, Panos
    Loven, Lauri
    2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI), 2018, : 62 - 69
  • [33] Exploring the Meaning behind Twitter Hashtags through Clustering
    Muntean, Cristina Ioana
    Morar, Gabriela Andreea
    Moldovan, Darie
    BUSINESS INFORMATION SYSTEMS WORKSHOPS, BIS 2012, 2012, 127 : 231 - 242
  • [34] Defining Semantic Meta-hashtags for Twitter Classification
    Costa, Joana
    Silva, Catarina
    Antunes, Mario
    Ribeiro, Bernardete
    ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, ICANNGA 2013, 2013, 7824 : 226 - 235
  • [35] Detecting political biases of named entities and hashtags on Twitter
    Xiao, Zhiping
    Zhu, Jeffrey
    Wang, Yining
    Zhou, Pei
    Lam, Wen Hong
    Porter, Mason A.
    Sun, Yizhou
    EPJ DATA SCIENCE, 2023, 12 (01)
  • [36] #bully: Uses of Hashtags in Posts About Bullying on Twitter
    Calvin, Angela J.
    Bellmore, Amy
    Xu, Jun-Ming
    Zhu, Xiaojin
    JOURNAL OF SCHOOL VIOLENCE, 2015, 14 (01) : 133 - 153
  • [37] A qualitative analysis of sarcasm, irony and related #hashtags on Twitter
    Sykora, Martin
    Elayan, Suzanne
    Jackson, Thomas W.
    BIG DATA & SOCIETY, 2020, 7 (02):
  • [38] Analyzing Trendy Twitter Hashtags in the 2022 French Election
    Mandviwalla, Aamir
    Yin, Lake
    Szymanski, Boleslaw K.
    COMPLEX NETWORKS & THEIR APPLICATIONS XII, VOL 1, COMPLEX NETWORKS 2023, 2024, 1141 : 215 - 224
  • [39] Is Anyone Out There? Unpacking Q&A Hashtags on Twitter
    Rzeszotarski, Jeffrey M.
    Spiro, Emma S.
    Matias, Jorge Nathan
    Monroy-Hernandez, Andres
    Morris, Meredith Ringel
    32ND ANNUAL ACM CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2014), 2014, : 2755 - 2758
  • [40] #creativity: Exploring Lay Conceptualizations of Creativity with Twitter Hashtags
    Ceh, Simon M.
    Christensen, Alexander P.
    Lebuda, Izabela
    Benedek, Mathias
    CREATIVITY RESEARCH JOURNAL, 2023,