pytwanalysis: Twitter Data Management And Analysis at Scale

被引:0
|
作者
Nogueira, Lia [1 ]
Tesic, Jelena [1 ]
机构
[1] Texas State Univ, Dept Comp Sci, San Marcos, TX 78666 USA
关键词
Graph Construction; Social Network Management; Graph Analysis; Community Discovery;
D O I
10.1109/SNAMS53716.2021.9732079
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Trends and communities in social media networks shape news cycles, politics, public governing, and economy these days. There is a wealth of information in the way users interact in the large social media networks, and state-of-the-art of mining network data from e.g. Twitter platform is limited by the narrow field of research or computing power. In this paper, we describe the new end-to-end Twitter network data management pipeline. We propose a scalable way to gather, store, and model rich relationships from Twitter networks. We also propose to analyze Twitter data using a combination of graph-clustering and topic modeling techniques at scale using multiple data science methods for graph construction and tweet data processing. We evaluate the proposed system on over 9 million tweets over five different Twitter datasets. We invite the community to add more features, as this end to end pipeline is released as an open source gitHub repository pytwanalysis [1], and as a python pip package pytwanalysis [2].
引用
收藏
页码:101 / 108
页数:8
相关论文
共 50 条
  • [1] Extreme Event Detection and Management using Twitter Data Analysis
    Girish, K. K.
    Moni, Jeni
    Roy, Joel Gee
    Afreed, C. P.
    Harikrishnan, S.
    Kumar, Gokul G.
    [J]. 2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 917 - 921
  • [2] DATA ANALYSIS OF TWITTER DATA
    Kasprikova, Nikola
    [J]. 13TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2019, : 678 - 686
  • [3] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Rafeeque Pandarachalil
    Selvaraju Sendhilkumar
    G. S. Mahalakshmi
    [J]. Cognitive Computation, 2015, 7 : 254 - 262
  • [4] Twitter Sentiment Analysis for Large-Scale Data: An Unsupervised Approach
    Pandarachalil, Rafeeque
    Sendhilkumar, Selvaraju
    Mahalakshmi, G. S.
    [J]. COGNITIVE COMPUTATION, 2015, 7 (02) : 254 - 262
  • [5] Large scale and parallel sentiment analysis based on Label Propagation in Twitter Data
    Yang, Yibing
    Shafiq, M. Omair
    [J]. 2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1791 - 1798
  • [6] Twitter Analysis for Disaster Management
    Shah, Bhavya
    Agarwal, Varun
    Dubey, Utkarsh
    Correia, Stevina
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [7] Sentiment Analysis of Twitter Data
    Desai, Radhi D.
    [J]. PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 114 - 117
  • [8] On Analysis and Visualization of Twitter Data
    Ali, Fathelalem
    Shima, Yasuki
    [J]. HYBRID INTELLIGENT SYSTEMS, HIS 2015, 2016, 420 : 271 - 280
  • [9] Sentiment Analysis of Twitter Data
    Wang, Yili
    Guo, Jiaxuan
    Yuan, Chengsheng
    Li, Baozhu
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (22):
  • [10] Sentiment Analysis of Twitter Data
    El Rahman, Sahar A.
    AlOtaibi, Feddah Alhumaidi
    AlShehri, Wejdan Abdullah
    [J]. 2019 INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCIS), 2019, : 336 - 339