Tweets opinion analysis integration: ETL modeling with MapReduce

被引:0
|
作者
Afef Walha [1 ]
Hana Mallek [2 ]
Faiza Ghozzi [1 ]
Faiez Gargouri [1 ]
机构
[1] University of Sfax,Multimedia, InfoRmation systems and Advanced Computing Laboratory (MIRACL)
[2] University of Gabes,Higher Institute of Information Science and Multimedia of Gabes (ISIMG)
[3] University of Sfax,Higher Institute of Information Science and Multimedia of Sfax (ISIMS)
关键词
Social media analytics; Twitter text analysis; Opinion mining; Big data integration; ETL modeling; MapReduce; Distributed computing;
D O I
10.1007/s10586-024-04983-6
中图分类号
学科分类号
摘要
The advent of social media has revolutionized the way people communicate and share information, leading to new business opportunities and challenges. Social media platforms offer a valuable resource in user-generated content (UGC), widely used for opinion analysis and business intelligence. However, traditional integration methods, particularly Extract-Transform-Load (ETL) tasks, need help to keep up with the vast volume, variety, and speed of big data generated on these platforms. This paper proposes MR_ETLSent, an ETL process model adopting MapReduce to perform opinion integration of a large volume of UGC data. It underlines the problem of covering time, cost, and complexity to semantically analyze informal and unstructured UGC texts and transform them into sentiments. This approach provides reusable models for the complex process that extracts UGC text, cleans it, semantically analyzes its data to detect sentiment, and transforms it into the data warehouse. The experimentation results of MR_ETLSent components in the Hadoop framework indicate that the proposed sentiment analysis method based on MapReduce performs well with large sets of UGC while minimizing time and computing resources. Overall, our approach is scalable, efficient, and cost-effective and can be integrated into decision-making systems that analyze opinions and handle large volumes of data.
引用
收藏
相关论文
共 50 条
  • [31] Topic modeling and sentiment analysis of global climate change tweets
    Dahal, Biraj
    Kumar, Sathish A. P.
    Li, Zhenlong
    SOCIAL NETWORK ANALYSIS AND MINING, 2019, 9 (01)
  • [32] A Generic Procedure for Integration Testing of ETL Procedures
    Mekterovic, Igor
    Brkic, Ljiljana
    Baranovic, Mirta
    AUTOMATIKA, 2011, 52 (02) : 169 - 178
  • [33] Topic modeling and sentiment analysis of global climate change tweets
    Biraj Dahal
    Sathish A. P. Kumar
    Zhenlong Li
    Social Network Analysis and Mining, 2019, 9
  • [34] 基于MapReduce的分布式ETL体系结构研究
    宋杰
    郝文宁
    陈刚
    靳大尉
    赵水宁
    计算机科学, 2013, 40 (06) : 152 - 154
  • [35] Modeling Agents Working on ETL Processes
    Gomes, Nuno
    Oliveira, Bruno
    Belo, Orlando
    ADVANCES IN PRACTICAL APPLICATIONS OF SCALABLE MULTI-AGENT SYSTEMS: THE PAAMS COLLECTION, 2016, 9662 : 265 - 268
  • [36] Public Opinion in Conflict Situations: A Sentiment Analysis of Tweets About Russia During the War on Ukraine
    Pina, Jose M.
    DEFENCE AND PEACE ECONOMICS, 2025, 36 (03) : 292 - 306
  • [37] Tracking the public's opinion of online education: a quantitative analysis of tweets on e-learning
    Giannakoulopoulos, Andreas
    Kouretsis, Alexandros
    Limniati, Laida
    INTERNATIONAL JOURNAL OF LEARNING TECHNOLOGY, 2019, 14 (04) : 271 - 287
  • [38] Research on Data Integration of Credit Cooperative Based on ETL
    Yang, Bin
    Wang, Lei
    2010 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING (MSE 2010), VOL 3, 2010, : 290 - 293
  • [39] Sentiment analysis and topic modeling of COVID-19 tweets of India
    Bhardwaj, Manju
    Mishra, Priya
    Badhani, Shikha
    Muttoo, Sunil K. K.
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (05) : 1756 - 1776
  • [40] Modeling and optimizing MapReduce programs
    Doerre, Jens
    Apel, Sven
    Lengauer, Christian
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (07): : 1734 - 1766