A Feasibility Study of Open-Source Sentiment Analysis and Text Classification Systems on Disaster-Specific Social Media Data

被引:4
|
作者
Kejriwal, Mayank [1 ]
Fang, Ge [1 ]
Zhou, Ying [1 ]
机构
[1] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90007 USA
关键词
Crisis informatics; natural language processing; social media; sentiment analysis; text classification; TWITTER; DESIGN;
D O I
10.1109/SSCI50451.2021.9660089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crisis informatics is a multi-disciplinary area of research that has taken on renewed urgency due to the COVID-19 pandemic and the runaway effects of climate change. Due to scarce resources, technology, especially augmented artificial intelligence (AI), has the potential to play a meaningful role by using information management for facilitating better crisis response. In part, this is both due to improvements in the underlying technology, as well as an increasing willingness by stakeholders to release data and systems as open-source. Yet, it is still not clear from published literature if such established systems are truly useful on real-world crisis datasets (such as acquired from Twitter) that often contain noise and inconsistencies. In this paper, we explore this agenda by conducting a set of case studies, using real social media data collected during six disasters (including Hurricane Sandy and the Boston Marathon Bombings) and made publicly available on a crisis informatics platform. We apply established, independently developed AI tools, including a resource specifically designed for the crisis domain, to explore whether they yield useful insights that could be helpful to first-responders. Our results reveal that, while such insights can be obtained with relatively low effort, some caveats and best practices do apply, and sentiment analysis results (in particular) are not always consistent.
引用
收藏
页数:8
相关论文
共 47 条
  • [31] Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
    Melton, Chad A.
    White, Brianna M.
    Davis, Robert L.
    Bednarczyk, Robert A.
    Shaban-Nejad, Arash
    arXiv, 2022,
  • [32] Bias Aware Lexicon-Based Sentiment Analysis of Malay Dialect on Social Media Data: A Study on The Sabah Language
    Hijazi, Mohd Hanafi Ahmad
    Libin, Lyndia
    Alfred, Rayner
    Coenen, Frans
    PROCEEDINGS OF 2016 2ND INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH) - INFORMATION SCIENCE FOR GREEN SOCIETY AND ENVIRONMENT, 2016, : 356 - 361
  • [33] Evaluation of the Optimal Topic Classification for Social Media Data Combined with Text Semantics: A Case Study of Public Opinion Analysis Related to COVID-19 with Microblogs
    Liang, Qin
    Hu, Chunchun
    Chen, Si
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (12)
  • [34] Fine-tuned Sentiment Analysis of COVID-19 Vaccine-Related Social Media Data: Comparative Study
    Melton, Chad A.
    White, Brianna M.
    Davis, Robert L.
    Bednarczyk, Robert A.
    Shaban-Nejad, Arash
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (10)
  • [35] Glucose Variability Analysis in Two Large-Scale and Real-World Data Sets of Open-Source Automated Insulin Delivery Systems
    Cooper, Drew
    Reinhold, Bernd
    Shahid, Arsalan
    Lewis, Dana M.
    JOURNAL OF DIABETES SCIENCE AND TECHNOLOGY, 2023,
  • [36] Comparison and Applicability Study of Analysis Methods for Social Media Text Data: Taking Perception of Urban Parks in Beijing as an Example
    Shang, Zhenyu
    Cheng, Kexin
    Jian, Yuqing
    Wang, Zhifang
    LANDSCAPE ARCHITECTURE FRONTIERS, 2023, 11 (05) : 8 - 29
  • [37] An integrated framework for flood disaster information extraction and analysis leveraging social media data: A case study of the Shouguang flood in China
    Hou, Huawei
    Shen, Li
    Jia, Jianan
    Xu, Zhu
    SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 949
  • [38] Differing Content and Language Based on Poster-Patient Relationships on the Chinese Social Media Platform Weibo: Text Classification, Sentiment Analysis, and Topic Modeling of Posts on Breast Cancer
    Zhang, Zhouqing
    Liew, Kongmeng
    Kuijer, Roeline
    She, Wan Jou
    Yada, Shuntaro
    Wakamiya, Shoko
    Aramaki, Eiji
    JMIR CANCER, 2024, 10
  • [39] Application note: Validation of BovHEAT-An open-source analysis tool to process data from automated activity monitoring systems in dairy cattle for estrus detection
    Plenio, J. -L.
    Bartel, A.
    Madureira, A. M. L.
    Cerri, R. L. A.
    Heuwieser, W.
    Borchardt, S.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 188
  • [40] Long-Term Glucose Forecasting for Open-Source Automated Insulin Delivery Systems: A Machine Learning Study with Real-World Variability Analysis
    Zafar, Ahtsham
    Lewis, Dana M.
    Shahid, Arsalan
    HEALTHCARE, 2023, 11 (06)