Discovering Hot Topics using Twitter Streaming Data Social Topic Detection and Geographic Clustering

被引:0
|
作者
Kim, Hwi-Gang [1 ]
Lee, Seongjoo [2 ]
Kyeong, Sunghyon [1 ]
机构
[1] Natl Inst Math Sci, Math Analyt Team, Taejon, South Korea
[2] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
social network analysis; Twitter streaming data; social topic detection; geographic clustering;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There has been an increasing interest in analyzing social network services data. However, detecting social topics in the era of information explosion requires state-of-the-art analytics techniques. The geographic clustering analysis based on social topics across provinces, i.e., states, has rarely been studied. Using the Twitter data collected in the United States (US), we detected the social hot topic by using the ratio of word frequency. Also, we found geographic communities by correlating the time series for a set of topic words across US states. The result of the geographic clustering was visualized using the Google Fusion Table. In conclusion, the ratio of word frequency properly detects social topics or breaking news while suppressing daily tweeted small talks or emotional words such as lol, like, and love. We have also demonstrated that a clustering algorithm based on a social topic can be useful in classifying social communities.
引用
收藏
页码:1215 / 1220
页数:6
相关论文
共 50 条
  • [1] Hot Topic Detection Using Twitter Streaming Data
    Jagic, Teodor
    Brkic, Ljiljana
    [J]. 2020 43RD INTERNATIONAL CONVENTION ON INFORMATION, COMMUNICATION AND ELECTRONIC TECHNOLOGY (MIPRO 2020), 2020, : 1730 - 1735
  • [2] Hot Topic Detection on Twitter Data Streams with Incremental Clustering Using Named Entities and Central Centroids
    Son Nguyen
    Bao Ngo
    Chau Vo
    Tru Cao
    [J]. 2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 88 - 93
  • [3] Discovering Health Topics in Social Media Using Topic Models
    Paul, Michael J.
    Dredze, Mark
    [J]. PLOS ONE, 2014, 9 (08):
  • [4] Visualization and performance measure to determine number of topics in twitter data clustering using hybrid topic modeling
    Noorullah, R. M.
    Mohammed, Moulana
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (01) : 803 - 817
  • [5] Corporate Social Responsibility (CSR): A Survey of Topics and Trends Using Twitter Data and Topic Modeling
    Chae, Bongsug
    Park, Eunhye
    [J]. SUSTAINABILITY, 2018, 10 (07)
  • [6] Online Clustering for Topic Detection in Social Data Streams
    Comito, Carmela
    Pizzuti, Clara
    Procopio, Nicola
    [J]. 2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 362 - 369
  • [7] Twitter spammer detection using data stream clustering
    Miller, Zachary
    Dickinson, Brian
    Deitrick, William
    Hu, Wei
    Wang, Alex Hai
    [J]. INFORMATION SCIENCES, 2014, 260 : 64 - 73
  • [8] Detection of Hot Topics Using Multi-view Text Clustering
    Fraj, Maha
    Ben Hajkacem, Mohamed Aymen
    Essoussi, Nadia
    [J]. INFORMATION INTEGRATION AND WEB INTELLIGENCE, IIWAS 2022, 2022, 13635 : 548 - 558
  • [9] SRTM: A Sparse RNN-Topic Model for Discovering Bursty Topics in Big Data of Social Networks
    Shi, Lei
    Du, Jun-Ping
    Liang, Mei-Yu
    Kou, Fei-Fei
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2019, 35 (04) : 749 - 767
  • [10] A Study on Topics Identification on Twitter using Clustering Algorithms
    Klinczak, Marjori N. M.
    Kaestner, Celso A. A.
    [J]. 2015 LATIN AMERICA CONGRESS ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2015,