Temporal Semantics: Time-Varying Hashtag Sense Clustering

被引:0
|
作者
Stilo, Giovanni [1 ]
Velardi, Paola [1 ]
机构
[1] Dipartimento Informat, Rome, Italy
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hashtags are creative labels used in micro-blogs to characterize the topic of a message/discussion. However, since hashtags are created in a spontaneous and highly dynamic way by users using multiple languages, the same topic can be associated to different hashtags and conversely, the same hashtag may imply different topics in different time spans. Contrary to common words, sense clustering for hashtags is complicated by the fact that no sense catalogues are available, like, e.g. Wikipedia or WordNet and furthermore, hashtag labels are often obscure. In this paper we propose a sense clustering algorithm based on temporal mining. First, hashtag time series are converted into strings of symbols using Symbolic Aggregate ApproXimation (SAX), then, hashtags are clustered based on string similarity and temporal co-occurrence. Evaluation is performed on two reference datasets of semantically tagged hashtags. We also perform a complexity evaluation of our algorithm, since efficiency is a crucial performance factor when processing large-scale data streams, such as Twitter.
引用
收藏
页码:563 / 578
页数:16
相关论文
共 50 条
  • [1] Hashtag Sense Clustering Based on Temporal Similarity
    Stilo, Giovanni
    Velardi, Paola
    COMPUTATIONAL LINGUISTICS, 2017, 43 (01) : 181 - 200
  • [2] Semantics of time-varying information
    Jensen, CS
    Snodgrass, RT
    INFORMATION SYSTEMS, 1996, 21 (04) : 311 - 352
  • [3] Node Clustering of Time-Varying Graphs Based on Temporal Label Smoothness
    Fukumoto, Katsuki
    Yamada, Koki
    Tanaka, Yuichi
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 324 - 329
  • [4] Multiway clustering with time-varying parameters
    Cerqueti, Roy
    Mattera, Raffaele
    Scepi, Germana
    COMPUTATIONAL STATISTICS, 2024, 39 (01) : 51 - 92
  • [5] Multiway clustering with time-varying parameters
    Roy Cerqueti
    Raffaele Mattera
    Germana Scepi
    Computational Statistics, 2024, 39 : 51 - 92
  • [6] Temporal coding of time-varying stimuli
    Shamir, Maoz
    Sen, Kamal
    Colburn, H. Steven
    NEURAL COMPUTATION, 2007, 19 (12) : 3239 - 3261
  • [7] Fuzzy clustering of time series with time-varying memory
    Cerqueti, Roy
    Mattera, Raffaele
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2023, 153 : 193 - 218
  • [8] Fuzzy clustering of time series with time-varying memory
    Cerqueti, Roy
    Mattera, Raffaele
    International Journal of Approximate Reasoning, 2023, 153 : 193 - 218
  • [9] Clustering for time-varying relational count data
    Goto, Satoshi
    Takagishi, Mariko
    Yadohisa, Hiroshi
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2021, 156
  • [10] Clustering from Labels and Time-Varying Graphs
    Lim, Shiau Hong
    Chen, Yudong
    Xu, Huan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27