A Feasibility Study of Open-Source Sentiment Analysis and Text Classification Systems on Disaster-Specific Social Media Data

被引:4
|
作者
Kejriwal, Mayank [1 ]
Fang, Ge [1 ]
Zhou, Ying [1 ]
机构
[1] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90007 USA
关键词
Crisis informatics; natural language processing; social media; sentiment analysis; text classification; TWITTER; DESIGN;
D O I
10.1109/SSCI50451.2021.9660089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crisis informatics is a multi-disciplinary area of research that has taken on renewed urgency due to the COVID-19 pandemic and the runaway effects of climate change. Due to scarce resources, technology, especially augmented artificial intelligence (AI), has the potential to play a meaningful role by using information management for facilitating better crisis response. In part, this is both due to improvements in the underlying technology, as well as an increasing willingness by stakeholders to release data and systems as open-source. Yet, it is still not clear from published literature if such established systems are truly useful on real-world crisis datasets (such as acquired from Twitter) that often contain noise and inconsistencies. In this paper, we explore this agenda by conducting a set of case studies, using real social media data collected during six disasters (including Hurricane Sandy and the Boston Marathon Bombings) and made publicly available on a crisis informatics platform. We apply established, independently developed AI tools, including a resource specifically designed for the crisis domain, to explore whether they yield useful insights that could be helpful to first-responders. Our results reveal that, while such insights can be obtained with relatively low effort, some caveats and best practices do apply, and sentiment analysis results (in particular) are not always consistent.
引用
收藏
页数:8
相关论文
共 47 条
  • [1] A review of open-source machine learning algorithms for twitter text sentiment analysis and image classification
    Lynch, Conor
    Kehoe, Jacqueline
    O'leary, Christian
    Vakaloudis, Alex
    Smith, Gary
    Linger, Richrd
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [2] Statistical Text Analysis and Sentiment Classification in Social Media
    Cho, Sang-Hyun
    Kang, Hang-Bong
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1112 - 1117
  • [3] Multilingual Sentiment Analysis on Social Media Disaster Data
    Fuadvy, Muhammad Jauharul
    Ibrahim, Roliana
    2019 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS AND INFORMATION ENGINEERING (ICEEIE), 2019, : 269 - 272
  • [4] SentiVerb system: classification of social media text using sentiment analysis
    Shailendra Kumar Singh
    Manoj Kumar Sachan
    Multimedia Tools and Applications, 2019, 78 : 32109 - 32136
  • [5] SentiVerb system: classification of social media text using sentiment analysis
    Singh, Shailendra Kumar
    Sachan, Manoj Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (22) : 32109 - 32136
  • [6] Analysis of the Functionalities of Open-Source Social Media for Academic Use
    Mejia-Lobo, Mauricio
    Gil-Fernandez, Raquel
    Calderon-Garrido, Diego
    INTERNATIONAL JOURNAL OF INSTRUCTION, 2024, 17 (03) : 337 - 354
  • [7] A Study on Sentiment Analysis on Social Media Data
    Manasa, K. N.
    Padma, M. C.
    EMERGING RESEARCH IN ELECTRONICS, COMPUTER SCIENCE AND TECHNOLOGY, ICERECT 2018, 2019, 545 : 661 - 667
  • [8] An Unsupervised Approach for Sentiment Analysis on Social Media Short Text Classification in Roman Urdu Sentiment analysis on short text classification in Roman Urdu
    Rana, Toqir A.
    Shahzadi, Kiran
    Rana, Tauseef
    Arshad, Ahsan
    Tubishat, Mohammad
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)
  • [9] Effective Text Data Preprocessing Technique for Sentiment Analysis in Social Media Data
    Pradha, Saurav
    Halgamuge, Malka N.
    Nguyen Tran Quoc Vinh
    PROCEEDINGS OF 2019 11TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE 2019), 2019, : 108 - 115
  • [10] Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data
    Chen, Yuhao
    Wang, Zhimu
    Zulkernine, Farhana
    2024 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH 2024, 2024, : 126 - 128