Tweets opinion analysis integration: ETL modeling with MapReduce

被引:0
|
作者
Afef Walha [1 ]
Hana Mallek [2 ]
Faiza Ghozzi [1 ]
Faiez Gargouri [1 ]
机构
[1] University of Sfax,Multimedia, InfoRmation systems and Advanced Computing Laboratory (MIRACL)
[2] University of Gabes,Higher Institute of Information Science and Multimedia of Gabes (ISIMG)
[3] University of Sfax,Higher Institute of Information Science and Multimedia of Sfax (ISIMS)
关键词
Social media analytics; Twitter text analysis; Opinion mining; Big data integration; ETL modeling; MapReduce; Distributed computing;
D O I
10.1007/s10586-024-04983-6
中图分类号
学科分类号
摘要
The advent of social media has revolutionized the way people communicate and share information, leading to new business opportunities and challenges. Social media platforms offer a valuable resource in user-generated content (UGC), widely used for opinion analysis and business intelligence. However, traditional integration methods, particularly Extract-Transform-Load (ETL) tasks, need help to keep up with the vast volume, variety, and speed of big data generated on these platforms. This paper proposes MR_ETLSent, an ETL process model adopting MapReduce to perform opinion integration of a large volume of UGC data. It underlines the problem of covering time, cost, and complexity to semantically analyze informal and unstructured UGC texts and transform them into sentiments. This approach provides reusable models for the complex process that extracts UGC text, cleans it, semantically analyzes its data to detect sentiment, and transforms it into the data warehouse. The experimentation results of MR_ETLSent components in the Hadoop framework indicate that the proposed sentiment analysis method based on MapReduce performs well with large sets of UGC while minimizing time and computing resources. Overall, our approach is scalable, efficient, and cost-effective and can be integrated into decision-making systems that analyze opinions and handle large volumes of data.
引用
收藏
相关论文
共 50 条
  • [41] Advanced ETL (AETL) by integration of PERL and scripting method
    Tiwari, Prayag
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 3, 2015, : 256 - 260
  • [42] Analyzing Tweets to Understand Factors Affecting Opinion on Climate Change
    Mohith, S.
    Jose, Jackson I.
    Khetarpaul, Sonia
    Sharma, Dolly
    DATABASES THEORY AND APPLICATIONS (ADC 2021), 2021, 12610 : 99 - 110
  • [43] From Unlabelled Tweets to Twitter-specific Opinion Words
    Bravo-Marquez, Felipe
    Frank, Eibe
    Pfahringer, Bernhard
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 743 - 746
  • [44] Opinion Mining about a Product by Analyzing Public Tweets in Twitter
    Das, T. K.
    Acharjya, D. P.
    Patra, M. R.
    2014 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2014,
  • [45] Using short URLs in tweets to improve Twitter opinion mining
    Pavel, Andrei
    Palade, Vasile
    Iqbal, Rahat
    Hintea, Diana
    2017 16TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2017, : 965 - 970
  • [46] Entity-based Opinion Mining from Spanish Tweets
    Paniagua-Reyes, Fabian
    Reyes-Ortiz, Jose A.
    Bravo, Maricela
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2017, : 400 - 407
  • [47] DEMONETIZATION: A VISUAL EXPLORATION AND PATTERN IDENTIFICATION OF PEOPLE OPINION ON TWEETS
    BalaAnand, M.
    Karthikeyan, N.
    Karthick, S.
    Sivaparthipan, C. B.
    IEEE INTERNATIONAL CONFERENCE ON SOFT-COMPUTING AND NETWORK SECURITY (ICSNS 2018), 2018, : 92 - 98
  • [48] SVM based approach for opinion classification in Arabic written tweets
    Bouchlaghem, Rihab
    Elkhelifi, Aymen
    Faiz, Rim
    2015 IEEE/ACS 12TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2015,
  • [49] UKRAINIAN LANGUAGE TWEETS ANALYSIS TECHNOLOGY FOR PUBLIC OPINION DYNAMICS CHANGE PREDICTION BASED ON MACHINE LEARNING
    Prokipchuk, O.
    Vysotska, V.
    RADIO ELECTRONICS COMPUTER SCIENCE CONTROL, 2023, (02) : 103 - 116
  • [50] Towards a Framework for Conceptual Modeling of ETL Processes
    Kabiri, Ahmed
    Wadjinny, Faouzia
    Chiadmi, Dalila
    INNOVATIVE COMPUTING TECHNOLOGY, 2011, 241 : 146 - 160