Graph node rank based important keyword detection from Twitter

被引:2
|
作者
Kumar, Mukesh [1 ]
Rehan, Palak [1 ]
机构
[1] Panjab Univ, Univ Inst Engn & Technol, Dept Comp Sci & Engn, Chandigarh, India
关键词
Graphs; Normalization; Social media; Spanning trees; CENTRALITY; EXTRACTION;
D O I
10.1016/j.aci.2018.08.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media networks like Twitter, Facebook, WhatsApp etc. are most commonly used medium for sharing news, opinions and to stay in touch with peers. Messages on twitter are limited to 140 characters. This led users to create their own novel syntax in tweets to express more in lesser words. Free writing style, use of URLs, markup syntax, inappropriate punctuations, ungrammatical structures, abbreviations etc. makes it harder to mine useful information from them. For each tweet, we can get an explicit time stamp, the name of the user, the social network the user belongs to, or even the GPS coordinates if the tweet is created with a GPS-enabled mobile device. With these features, Twitter is, in nature, a good resource for detecting and analyzing the real time events happening around the world. By using the speed and coverage of Twitter, we can detect events, a sequence of important keywords being talked, in a timely manner which can be used in different applications like natural calamity relief support, earthquake relief support, product launches, suspicious activity detection etc. The keyword detection process from Twitter can be seen as a two step process: detection of keyword in the raw text form (words as posted by the users) and keyword normalization process (reforming the users' unstructured words in the complete meaningful English language words). In this paper a keyword detection technique based upon the graph, spanning tree and Page Rank algorithm is proposed. A text normalization technique based upon hybrid approach using Levenshtein distance, demetaphone algorithm and dictionary mapping is proposed to work upon the unstructured keywords as produced by the proposed keyword detector. The proposed normalization technique is validated using the standard lexnorm 1.2 dataset. The proposed system is used to detect the keywords from Twiter text being posted at real time. The detected and normalized keywords are further validated from the search engine results at later time for detection of events.
引用
收藏
页码:194 / 209
页数:16
相关论文
共 50 条
  • [1] Twitter User Rank Using Keyword Search
    Noro, Tomoya
    Ru, Fei
    Xiao, Feng
    Tokuda, Takehiro
    INFORMATION MODELLING AND KNOWLEDGE BASES XXIV, 2013, 251 : 31 - 48
  • [2] Graph based sentiment analysis using keyword rank based polarity assignment
    Monali Bordoloi
    Saroj Kr. Biswas
    Multimedia Tools and Applications, 2020, 79 : 36033 - 36062
  • [3] Graph based sentiment analysis using keyword rank based polarity assignment
    Bordoloi, Monali
    Biswas, Saroj Kr.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 36033 - 36062
  • [4] Implementing Graph Based Rank on Online News Media Keyword Extraction
    Syafiandini, Arida Ferti
    Mustika, Hani Febri
    Manik, Lindung Parningotan
    Rianto, Yan
    Akbar, Zaenal
    2019 INTERNATIONAL CONFERENCE ON COMPUTER, CONTROL, INFORMATICS AND ITS APPLICATIONS (IC3INA), 2019, : 108 - 113
  • [5] A graph based keyword extraction model using collective node weight
    Biswas, Saroj Kr.
    Bordoloi, Monali
    Shreya, Jacob
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 97 : 51 - 59
  • [6] Event Detection in Twitter: A Keyword Volume Approach
    Hossny, Ahmad Hany
    Mitchell, Lewis
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 1200 - 1208
  • [7] NE-Rank: A Novel Graph-based Keyphrase Extraction in Twitter
    Bellaachia, Abdelghani
    Al-Dhelaan, Mohammed
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 372 - 379
  • [8] Expert Profile Identification From Community Detection on Author-Publication-Keyword Graph With Keyword Extraction
    Fu, William
    Akbar, Saiful
    IEEE ACCESS, 2024, 12 : 27918 - 27930
  • [9] A Keyword Extraction Scheme from CQI Based on Graph Centrality
    Pheaktra, They
    Lim, JongBeom
    Lee, JongHyuk
    Gil, Joon-Min
    ADVANCED MULTIMEDIA AND UBIQUITOUS ENGINEERING, 2020, 590 : 158 - 163
  • [10] Graph based Ranked Answers for Keyword Graph Structure
    Nidhi R. Arora
    Wookey Lee
    New Generation Computing, 2013, 31 : 115 - 134