pytwanalysis: Twitter Data Management And Analysis at Scale

被引:0
|
作者
Nogueira, Lia [1 ]
Tesic, Jelena [1 ]
机构
[1] Texas State Univ, Dept Comp Sci, San Marcos, TX 78666 USA
关键词
Graph Construction; Social Network Management; Graph Analysis; Community Discovery;
D O I
10.1109/SNAMS53716.2021.9732079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trends and communities in social media networks shape news cycles, politics, public governing, and economy these days. There is a wealth of information in the way users interact in the large social media networks, and state-of-the-art of mining network data from e.g. Twitter platform is limited by the narrow field of research or computing power. In this paper, we describe the new end-to-end Twitter network data management pipeline. We propose a scalable way to gather, store, and model rich relationships from Twitter networks. We also propose to analyze Twitter data using a combination of graph-clustering and topic modeling techniques at scale using multiple data science methods for graph construction and tweet data processing. We evaluate the proposed system on over 9 million tweets over five different Twitter datasets. We invite the community to add more features, as this end to end pipeline is released as an open source gitHub repository pytwanalysis [1], and as a python pip package pytwanalysis [2].
引用
收藏
页码:101 / 108
页数:8
相关论文
共 50 条
  • [21] TwiFly: A Data Analysis Framework for Twitter
    Chatziadam, Panagiotis
    Dimitriadis, Aftantil
    Gikas, Stefanos
    Logothetis, Ilias
    Michalodimitrakis, Manolis
    Neratzoulakis, Manolis
    Papadakis, Alexandros
    Kontoulis, Vasileios
    Siganos, Nikolaos
    Theodoropoulos, Dimitrios
    Vougioukalos, Giannis
    Hatzakis, Ilias
    Gerakis, George
    Papadakis, Nikolaos
    Kondylakis, Haridimos
    [J]. INFORMATION, 2020, 11 (05)
  • [22] Deep Learning-Based Sentimental Analysis for Large-Scale Imbalanced Twitter Data
    Jamal, Nasir
    Chen, Xianqiao
    Aldabbas, Hamza
    [J]. FUTURE INTERNET, 2019, 11 (09)
  • [23] Sentiments Analysis Of Twitter Data Using Data Mining
    Jain, Anurag P.
    Katkar, Vijay D.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING (ICIP), 2015, : 807 - 810
  • [24] Thematic Analysis of Twitter as a Platform for Knowledge Management
    Noor, Saleha
    Guo, Yi
    Shah, Syed Hamad Hassan
    Halepoto, Habiba
    [J]. KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT III, 2021, 12817 : 610 - 618
  • [25] Large Scale Homophily Analysis in Twitter Using a Twixonomy
    Faralli, Stefano
    Stilo, Giovanni
    Velardi, Paola
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 2334 - 2340
  • [26] Geographically distributed data management to support large-scale data analysis
    Emara, Tamer Z.
    Trinh, Thanh
    Huang, Joshua Zhexue
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [27] A distributed data management system to support large-scale data analysis
    Emara, Tamer Z.
    Huang, Joshua Zhexue
    [J]. JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 148 : 105 - 115
  • [28] Geographically distributed data management to support large-scale data analysis
    Tamer Z. Emara
    Thanh Trinh
    Joshua Zhexue Huang
    [J]. Scientific Reports, 13
  • [29] Cloud-based Disaster Management as a Service: A Microservice Approach for Hurricane Twitter Data Analysis
    Khaleq, Abeer Abdel
    Ra, Ilkyeun
    [J]. 2018 IEEE GLOBAL HUMANITARIAN TECHNOLOGY CONFERENCE (GHTC), 2018,
  • [30] Dynamic Large Scale Data on Twitter using Sentiment Analysis and Topic Modeling Case Study: Uber
    Alamsyah, Andry
    Rizkika, Wirawan
    Nugroho, Ditya Dwi Adhi
    Renate, Farhan
    Saadah, Siti
    [J]. 2018 6TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2018, : 254 - 258