Graph node rank based important keyword detection from Twitter

被引:2
|
作者
Kumar, Mukesh [1 ]
Rehan, Palak [1 ]
机构
[1] Panjab Univ, Univ Inst Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India
关键词
Graphs; Normalization; Social media; Spanning trees; CENTRALITY; EXTRACTION;
D O I
10.1016/j.aci.2018.08.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media networks like Twitter, Facebook, WhatsApp etc. are most commonly used medium for sharing news, opinions and to stay in touch with peers. Messages on twitter are limited to 140 characters. This led users to create their own novel syntax in tweets to express more in lesser words. Free writing style, use of URLs, markup syntax, inappropriate punctuations, ungrammatical structures, abbreviations etc. makes it harder to mine useful information from them. For each tweet, we can get an explicit time stamp, the name of the user, the social network the user belongs to, or even the GPS coordinates if the tweet is created with a GPS-enabled mobile device. With these features, Twitter is, in nature, a good resource for detecting and analyzing the real time events happening around the world. By using the speed and coverage of Twitter, we can detect events, a sequence of important keywords being talked, in a timely manner which can be used in different applications like natural calamity relief support, earthquake relief support, product launches, suspicious activity detection etc. The keyword detection process from Twitter can be seen as a two step process: detection of keyword in the raw text form (words as posted by the users) and keyword normalization process (reforming the users' unstructured words in the complete meaningful English language words). In this paper a keyword detection technique based upon the graph, spanning tree and Page Rank algorithm is proposed. A text normalization technique based upon hybrid approach using Levenshtein distance, demetaphone algorithm and dictionary mapping is proposed to work upon the unstructured keywords as produced by the proposed keyword detector. The proposed normalization technique is validated using the standard lexnorm 1.2 dataset. The proposed system is used to detect the keywords from Twiter text being posted at real time. The detected and normalized keywords are further validated from the search engine results at later time for detection of events.
引用
收藏
页码:194 / 209
页数:16
相关论文
共 50 条
  • [41] Detection of rumor conversations in Twitter using graph convolutional networks
    Serveh Lotfi
    Mitra Mirzarezaee
    Mehdi Hosseinzadeh
    Vahid Seydi
    Applied Intelligence, 2021, 51 : 4774 - 4787
  • [42] Detection of hateful twitter users with graph convolutional network model
    Anıl Utku
    Umit Can
    Serpil Aslan
    Earth Science Informatics, 2023, 16 : 329 - 343
  • [43] Detection of Fake Twitter Followers using Graph Centrality Measures
    Mehrotra, Ashish
    Sarreddy, Mallidi
    Singh, Sanjay
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 499 - 504
  • [44] Detection of hateful twitter users with graph convolutional network model
    Utku, Anil
    Can, Umit
    Aslan, Serpil
    EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 329 - 343
  • [45] BotRGCN: Twitter Bot Detection with Relational Graph Convolutional Networks
    Feng, Shangbin
    Wan, Herun
    Wang, Ningnan
    Luo, Minnan
    PROCEEDINGS OF THE 2021 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2021, 2021, : 236 - 239
  • [46] Out-of-Distribution Node Detection Based on Graph Heat Kernel Diffusion
    Li, Fangfang
    Wang, Yangshuai
    Du, Xinyu
    Li, Xiaohua
    Yu, Ge
    MATHEMATICS, 2024, 12 (18)
  • [47] Detection of rumor conversations in Twitter using graph convolutional networks
    Lotfi, Serveh
    Mirzarezaee, Mitra
    Hosseinzadeh, Mehdi
    Seydi, Vahid
    APPLIED INTELLIGENCE, 2021, 51 (07) : 4774 - 4787
  • [48] Influential Node Detection on Graph on Event Sequence
    Lu, Zehao
    Wang, Shihan
    Ren, Xiao-Long
    Costas, Rodrigo
    Metze, Tamara
    COMPLEX NETWORKS & THEIR APPLICATIONS XII, VOL 3, COMPLEX NETWORKS 2023, 2024, 1143 : 147 - 158
  • [49] Graph Anomaly Detection with Adaptive Node Mixup
    Zhou, Qinghai
    Chen, Yuzhong
    Xu, Zhe
    Wu, Yuhang
    Pan, Menghai
    Das, Mahashweta
    Yang, Hao
    Tong, Hanghang
    PROCEEDINGS OF THE 33RD ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2024, 2024, : 3494 - 3504
  • [50] From Automatic Keyword Detection to Ontology-Based Topic Modeling
    Beck, Marc
    Rizvi, Syed Tahseen Raza
    Dengel, Andreas
    Ahmed, Sheraz
    DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 : 451 - 465