Towards Social Data Platform: Automatic Topic-focused Monitor for Twitter Stream

被引:14
|
作者
Li, Rui [1 ]
Wang, Shengjie [1 ]
Chang, Kevin Chen-Chuan [1 ,2 ]
机构
[1] Univ Illinois, Dept Comp Sci, Urbana, IL 60680 USA
[2] Adv Digital Sci Ctr, Singapore, Singapore
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2013年 / 6卷 / 14期
基金
美国国家科学基金会;
关键词
D O I
10.14778/2556549.2556577
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many novel applications have been built based on analyzing tweets about specific topics. While these applications provide different kinds of analysis, they share a common task of monitoring "target" tweets from the Twitter stream for a topic. The current solution for this task tracks a set of manually selected keywords with Twitter APIs. Obviously, this manual approach has many limitations. In this paper, we propose a data platform to automatically monitor target tweets from the Twitter stream for any given topic. To monitor target tweets in an optimal and continuous way, we design Automatic Topic-focused Monitor (ATM), which iteratively 1) samples tweets from the stream and 2) selects keywords to track based on the samples. To realize ATM, we develop a tweet sampling algorithm to sample sufficient unbiased tweets with available Twitter APIs, and a keyword selection algorithm to efficiently select keywords that have a near-optimal coverage of target tweets under cost constraints. We conduct extensive experiments to show the effectiveness of ATM. E.g., ATM covers 90% of target tweets for a topic and improves the manual approach by 49%.
引用
收藏
页码:1966 / 1977
页数:12
相关论文
共 23 条
  • [1] A topic-focused trust model for Twitter
    Zhao, Liang
    Hua, Ting
    Lu, Chang-Tien
    Chen, Ing-Ray
    [J]. COMPUTER COMMUNICATIONS, 2016, 76 : 1 - 11
  • [2] Automatic Unsupervised Polarity Detection on a Twitter Data Stream
    Terrana, Diego
    Augello, Agnese
    Pilato, Giovanni
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC), 2014, : 128 - 134
  • [3] Automatic Detection of Cyber Security Events from Turkish Twitter Stream and Newspaper Data
    Ural, Ozgur
    Acarturk, Cengiz
    [J]. ICISSP: PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2021, : 66 - 76
  • [4] Automatic Classification of Sexism in Social Networks: An Empirical Study on Twitter Data
    Rodriguez-Sanchez, Francisco
    Carrillo-de-Albornoz, Jorge
    Plaza, Laura
    [J]. IEEE ACCESS, 2020, 8 : 219563 - 219576
  • [5] Towards an Efficient Platform for Social Big Data Analytics
    Wang, Jenq-Haur
    Chen, Kuan-Ting
    [J]. 2015 24TH WIRELESS AND OPTICAL COMMUNICATION CONFERENCE (WOCC), 2015, : 175 - 179
  • [6] Exploring Food Waste Conversations on Social Media: A Sentiment, Emotion, and Topic Analysis of Twitter Data
    Jenkins, Eva L.
    Lukose, Dickson
    Brennan, Linda
    Molenaar, Annika
    McCaffrey, Tracy A.
    [J]. SUSTAINABILITY, 2023, 15 (18)
  • [7] Discovering Hot Topics using Twitter Streaming Data Social Topic Detection and Geographic Clustering
    Kim, Hwi-Gang
    Lee, Seongjoo
    Kyeong, Sunghyon
    [J]. 2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 1215 - 1220
  • [8] Corporate Social Responsibility (CSR): A Survey of Topics and Trends Using Twitter Data and Topic Modeling
    Chae, Bongsug
    Park, Eunhye
    [J]. SUSTAINABILITY, 2018, 10 (07)
  • [9] Social Media Data Processing Infrastructure by Using Apache Spark Big Data Platform: Twitter Data Analysis
    Podhoranyi, Michal
    Vojacek, Lukas
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2019), 2019, : 1 - 6
  • [10] Poster: Sentiment Analysis of Twitter Data: Towards Filtering, Analyzing and Interpreting Social Network Data
    Branz, Lisa
    Brockmann, Patricia
    [J]. DEBS'18: PROCEEDINGS OF THE 12TH ACM INTERNATIONAL CONFERENCE ON DISTRIBUTED AND EVENT-BASED SYSTEMS, 2018, : 238 - 241