A Feasibility Study of Open-Source Sentiment Analysis and Text Classification Systems on Disaster-Specific Social Media Data

被引:4
|
作者
Kejriwal, Mayank [1 ]
Fang, Ge [1 ]
Zhou, Ying [1 ]
机构
[1] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90007 USA
关键词
Crisis informatics; natural language processing; social media; sentiment analysis; text classification; TWITTER; DESIGN;
D O I
10.1109/SSCI50451.2021.9660089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crisis informatics is a multi-disciplinary area of research that has taken on renewed urgency due to the COVID-19 pandemic and the runaway effects of climate change. Due to scarce resources, technology, especially augmented artificial intelligence (AI), has the potential to play a meaningful role by using information management for facilitating better crisis response. In part, this is both due to improvements in the underlying technology, as well as an increasing willingness by stakeholders to release data and systems as open-source. Yet, it is still not clear from published literature if such established systems are truly useful on real-world crisis datasets (such as acquired from Twitter) that often contain noise and inconsistencies. In this paper, we explore this agenda by conducting a set of case studies, using real social media data collected during six disasters (including Hurricane Sandy and the Boston Marathon Bombings) and made publicly available on a crisis informatics platform. We apply established, independently developed AI tools, including a resource specifically designed for the crisis domain, to explore whether they yield useful insights that could be helpful to first-responders. Our results reveal that, while such insights can be obtained with relatively low effort, some caveats and best practices do apply, and sentiment analysis results (in particular) are not always consistent.
引用
收藏
页数:8
相关论文
共 47 条
  • [41] The Complexities of Metal Detecting Policy and Practice: A Response to Samuel Hardy, 'Quantitative Analysis of Open-Source Data on Metal Detecting for Cultural Property' (Cogent Social Sciences 3, 2017)
    Deckers, Pieterjan
    Dobat, Andres
    Ferguson, Natasha
    Heeren, Stijn
    Lewis, Michael
    Thomas, Suzie
    OPEN ARCHAEOLOGY, 2018, 4 (01): : 322 - 333
  • [42] Data and systems for medication-related text classification and concept normalization from Twitter: insights from the Social Media Mining for Health (SMM4H)-2017 shared task
    Sarker, Abeed
    Belousov, Maksim
    Friedrichs, Jasper
    Hakala, Kai
    Kiritchenko, Svetlana
    Mehryary, Farrokh
    Han, Sifei
    Tung Tran
    Rios, Anthony
    Kavuluru, Ramakanth
    de Bruijn, Berry
    Ginter, Filip
    Mahata, Debanjan
    Mohammad, Saif M.
    Nenadic, Goran
    Gonzalez-Hernandez, Graciela
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (10) : 1274 - 1283
  • [43] Data Mining and Spatial Analysis of Social Media Text Based on the BERT-CNN Model to Achieve Situational Awareness: a Case Study of COVID-19
    Jiawei ZHANG
    Hua QI
    JournalofGeodesyandGeoinformationScience, 2022, 5 (02) : 38 - 48
  • [44] Spatial-Temporal Pattern Evolution of Public Sentiment Responses to the COVID-19 Pandemic in Small Cities of China: A Case Study Based on Social Media Data Analysis
    Zhou, Yuye
    Xu, Jiangang
    Yin, Maosen
    Zeng, Jun
    Ming, Haolin
    Wang, Yiwen
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (18)
  • [45] Comparing location-specific and location-open social media data: methodological lessons from a study of blaming of minorities on Twitter during the COVID-19 pandemic
    Zhang, Shiyi
    Tsatsou, Panayiota
    McLaren, Lauren
    Zhu, Yimei
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (03): : 2457 - 2479
  • [46] Effizienter Datenmanagement-Workflow zur Analyse eines PtGtX-Systems basierend auf Open-Source-Software und Low-Cost-HardwareEfficient Data Management Workflow for Analysis of a PtGtX System Based on Open-Source Software and Low-cost Hardware
    Tanja Clees
    Michael Bareev-Rudy
    Malte Pfennig
    HMD Praxis der Wirtschaftsinformatik, 2024, 61 (4) : 891 - 910
  • [47] Comparison of data processing strategies using commercial vs. open-source software in GC-Orbitrap-HRMS untargeted metabolomics analysis for food authentication: thyme geographical differentiation and marker identification as a case study
    Rivera-Perez, Araceli
    Garrido Frenich, Antonia
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2024, 416 (18) : 4039 - 4055