An algorithmic approach to rank the disambiguous entities in Twitter streams for effective semantic search operations

被引:0
|
作者
N Senthil Kumar
M Dinakaran
机构
[1] VIT University,School of Information Technology and Engineering
来源
Sādhanā | 2020年 / 45卷
关键词
Entity linking; entity disambiguation; Latent Dirichlet Allocation; semantic similarity; DBpedia ontology;
D O I
暂无
中图分类号
学科分类号
摘要
The most challenging task in any modern reasoning system is that it has been completely relied on automatic knowledge acquisition from the unstructured text and filtering out the structured information from it has turned out to be the most crucial task of Information Retrieval systems. In this paper, we have proposed a system that can recognize the potential named entities from the Twitter streams and link them to the appropriate real world knowledge entities. Besides, it has performed many semantic functions such as entity disambiguation, contextual similarity, type induction, and semantic labeling, to augment the semantic score of the entity and provide the rich entity feature space to quantitatively enhance entity retrieval accuracy. Nevertheless, we have leveraged a model to alleviate the entity imbalance present over the collected Twitter Streams and effectively utilized the contextual relatedness between the candidate entity sets. Eventually, we have proposed a probabilistic approach to deal with topic modeling and effectively disambiguate the entities by clustering the entities into its appropriate entity domain. The proposed Latent Dirichlet Allocation (LDA) model has been categorically distinguished the topics for clustering between the candidate entities and fix the exact true mentions occurred in the Knowledge Base such as DBpedia. We have also demonstrated the performance and accuracy rate of the proposed system and evaluated the results with the collected Twitter Streams for the month of August, 2016. The empirical results have shown that it has outperformed the existing state-of-the-art systems and proved that the proposed system given here has gradual accuracy rate against the conventional systems.
引用
收藏
相关论文
共 7 条