Tweets opinion analysis integration: ETL modeling with MapReduce

被引:0
|
作者
Afef Walha [1 ]
Hana Mallek [2 ]
Faiza Ghozzi [1 ]
Faiez Gargouri [1 ]
机构
[1] University of Sfax,Multimedia, InfoRmation systems and Advanced Computing Laboratory (MIRACL)
[2] University of Gabes,Higher Institute of Information Science and Multimedia of Gabes (ISIMG)
[3] University of Sfax,Higher Institute of Information Science and Multimedia of Sfax (ISIMS)
关键词
Social media analytics; Twitter text analysis; Opinion mining; Big data integration; ETL modeling; MapReduce; Distributed computing;
D O I
10.1007/s10586-024-04983-6
中图分类号
学科分类号
摘要
The advent of social media has revolutionized the way people communicate and share information, leading to new business opportunities and challenges. Social media platforms offer a valuable resource in user-generated content (UGC), widely used for opinion analysis and business intelligence. However, traditional integration methods, particularly Extract-Transform-Load (ETL) tasks, need help to keep up with the vast volume, variety, and speed of big data generated on these platforms. This paper proposes MR_ETLSent, an ETL process model adopting MapReduce to perform opinion integration of a large volume of UGC data. It underlines the problem of covering time, cost, and complexity to semantically analyze informal and unstructured UGC texts and transform them into sentiments. This approach provides reusable models for the complex process that extracts UGC text, cleans it, semantically analyzes its data to detect sentiment, and transforms it into the data warehouse. The experimentation results of MR_ETLSent components in the Hadoop framework indicate that the proposed sentiment analysis method based on MapReduce performs well with large sets of UGC while minimizing time and computing resources. Overall, our approach is scalable, efficient, and cost-effective and can be integrated into decision-making systems that analyze opinions and handle large volumes of data.
引用
收藏
相关论文
共 50 条
  • [1] P-ETL: Parallel-ETL based on the MapReduce paradigm
    Bala, Mahfoud
    Boussaid, Omar
    Alimazighi, Zaia
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 42 - 49
  • [2] ETL Design Toward Social Network Opinion Analysis
    Walha, Afef
    Ghozzi, Faiza
    Gargouri, Faiez
    COMPUTER AND INFORMATION SCIENCE 2015, 2016, 614 : 235 - 249
  • [3] A Lexicon Approach to Multidimensional Analysis of Tweets Opinion
    Walha, Afef
    Ghozzi, Faiza
    Gargouri, Faiez
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [4] Sentiment Analysis of Arabic Tweets: Opinion Target Extraction
    Salima, Behdenna
    Fatiha, Barigou
    Ghalem, Belalem
    MODELLING AND IMPLEMENTATION OF COMPLEX SYSTEMS, 2019, 64 : 158 - 167
  • [5] MapReduce-based Dimensional ETL Made Easy
    Liu, Xiufeng
    Thomsen, Christian
    Pedersen, Torben Bach
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 1882 - 1885
  • [6] Public opinion monitoring through collective semantic analysis of tweets
    Dionysios Karamouzas
    Ioannis Mademlis
    Ioannis Pitas
    Social Network Analysis and Mining, 2022, 12
  • [7] Monitoring Negative Opinion about Vaccines from Tweets Analysis
    D'Andrea, Eleonora
    Ducange, Pietro
    Marcelloni, Francesco
    2017 THIRD IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2017, : 186 - 191
  • [8] Public opinion monitoring through collective semantic analysis of tweets
    Karamouzas, Dionysios
    Mademlis, Ioannis
    Pitas, Ioannis
    SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
  • [9] Opinion Mining in MapReduce Framework
    Cho, Kyung Soo
    Lim, Ji Yeon
    Yoon, Jae Yeol
    Kim, Young Hee
    Kim, Seung Kwan
    Kim, Ung Mo
    SECURE AND TRUST COMPUTING, DATA MANAGEMENT, AND APPLICATIONS, 2011, 187 : 50 - 55
  • [10] Monitoring the public opinion about the vaccination topic from tweets analysis
    D'Andrea, Eleonora
    Ducange, Pietro
    Bechini, Alessio
    Renda, Alessandro
    Marcelloni, Francesco
    EXPERT SYSTEMS WITH APPLICATIONS, 2019, 116 : 209 - 226