STREAM TEXT DATA ANALYSIS ON TWITTER USING APACHE SPARK STREAMING

被引:0
|
作者
Hakdagli, Ozlem [1 ]
Ozcan, Caner [2 ]
Ogul, Iskender Ulgen [3 ]
机构
[1] Karabuk Univ, Bilgisayar Muhendisligi, Karabuk, Turkey
[2] Purdue Univ, Elekt & Bilgisayar Muhendisligi, W Lafayette, IN 47907 USA
[3] Izmir Yuksek Teknol Enstitusu, Bilgisayar Muhendisligi, Izmir, Turkey
关键词
Apache Spark; Spark Streaming; Twitter; Machine Learning; Text Mining;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With today's developing technology, people's access to information and its production have reached a very fast level. These generated and obtained information are instantly created, entered into data systems and updated. Sources of streaming data can be transformed into valuable analysis results when they are handled with targeted methods. In this study, a text data field is determined to perform analysis on instantaneous generated data and Twitter, the richest platform for instant text data, is used. Twitter instantly generates a variety of data in large quantities and it presents it as open source using an API. A machine learning framework Apache Spark's stream analysis environment is used to analyze these resources. Situation analysis was performed using Support Vector Machine, Decision Trees and Logistic Regression algorithms presented under this environment. The results are presented in tables.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Sentiment Analysis on Twitter Data using Apache Spark Framework
    Elzayady, Hossam
    Badran, Khaled M.
    Salama, Gouda I.
    [J]. PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 171 - 176
  • [2] An Apache Spark Implementation for Sentiment Analysis on Twitter Data
    Baltas, Alexandros
    Kanavos, Andreas
    Tsakalidis, Athanasios K.
    [J]. ALGORITHMIC ASPECTS OF CLOUD COMPUTING, ALGOCLOUD 2016, 2017, 10230 : 15 - 25
  • [3] Trending Pattern Analysis of Twitter Using Spark Streaming
    Garg, Prachi
    Johari, Rahul
    Kumar, Hemang
    Bhatia, Riya
    [J]. APPLICATIONS OF COMPUTING AND COMMUNICATION TECHNOLOGIES, ICACCT 2018, 2018, 899 : 3 - 13
  • [4] Social Media Data Processing Infrastructure by Using Apache Spark Big Data Platform: Twitter Data Analysis
    Podhoranyi, Michal
    Vojacek, Lukas
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTERNET OF THINGS (CCIOT 2019), 2019, : 1 - 6
  • [5] Framework for Error Detection & its Localization in Sensor Data Stream for reliable big sensor data analytics using Apache Spark Streaming
    Gupta, Govind P.
    Khedwal, Jahanvi
    [J]. INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 2337 - 2342
  • [6] Road Traffic Event Detection Using Twitter Data, Machine Learning, and Apache Spark
    Alomari, Ebtesam
    Mehmood, Rashid
    Katib, Iyad
    [J]. 2019 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI 2019), 2019, : 1888 - 1895
  • [7] Low latency analytics for streaming traffic data with Apache Spark
    Maarala, Altti Ilari
    Rautiainen, Mika
    Salmi, Miikka
    Pirttikangas, Susanna
    Riekki, Jukka
    [J]. PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2855 - 2858
  • [8] Using Apache Spark to Collect Analytic from the Streaming Data Processing Application Logs
    Evgenyevich, Golovanov Mikhail
    Valerievich, Bakulev Aleksandr
    Alekseevna, Bakuleva Marina
    [J]. 2018 7TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2018, : 238 - 241
  • [9] Streaming Distributed DNA Sequence Alignment Using Apache Spark
    Mushtaq, Hamid
    Ahmed, Nauman
    Al-Ars, Zaid
    [J]. 2017 IEEE 17TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2017, : 188 - 193
  • [10] Real-time Data Streaming using Apache Spark on Fully Configured Hadoop Cluster
    Prasad, Kashi Sai
    Pasupathy, S.
    [J]. JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2018, 13 (05): : 164 - 176