A Feasibility Study of Open-Source Sentiment Analysis and Text Classification Systems on Disaster-Specific Social Media Data

被引：4

作者：

Kejriwal, Mayank ^{[1
]}

Fang, Ge ^{[1
]}

Zhou, Ying ^{[1
]}

机构：

[1] Univ Southern Calif, Dept Ind & Syst Engn, Los Angeles, CA 90007 USA

来源：

2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021) | 2021年

关键词：

Crisis informatics; natural language processing; social media; sentiment analysis; text classification; TWITTER; DESIGN;

D O I：

10.1109/SSCI50451.2021.9660089

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Crisis informatics is a multi-disciplinary area of research that has taken on renewed urgency due to the COVID-19 pandemic and the runaway effects of climate change. Due to scarce resources, technology, especially augmented artificial intelligence (AI), has the potential to play a meaningful role by using information management for facilitating better crisis response. In part, this is both due to improvements in the underlying technology, as well as an increasing willingness by stakeholders to release data and systems as open-source. Yet, it is still not clear from published literature if such established systems are truly useful on real-world crisis datasets (such as acquired from Twitter) that often contain noise and inconsistencies. In this paper, we explore this agenda by conducting a set of case studies, using real social media data collected during six disasters (including Hurricane Sandy and the Boston Marathon Bombings) and made publicly available on a crisis informatics platform. We apply established, independently developed AI tools, including a resource specifically designed for the crisis domain, to explore whether they yield useful insights that could be helpful to first-responders. Our results reveal that, while such insights can be obtained with relatively low effort, some caveats and best practices do apply, and sentiment analysis results (in particular) are not always consistent.

引用

页数：8

共 47 条

[31] Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
Melton, Chad A.
White, Brianna M.
Davis, Robert L.
Bednarczyk, Robert A.
Shaban-Nejad, Arash
arXiv, 2022,
[32] Bias Aware Lexicon-Based Sentiment Analysis of Malay Dialect on Social Media Data: A Study on The Sabah Language
Hijazi, Mohd Hanafi Ahmad
Libin, Lyndia
Alfred, Rayner
Coenen, Frans
PROCEEDINGS OF 2016 2ND INTERNATIONAL CONFERENCE ON SCIENCE IN INFORMATION TECHNOLOGY (ICSITECH) - INFORMATION SCIENCE FOR GREEN SOCIETY AND ENVIRONMENT, 2016, : 356 - 361
[33] Evaluation of the Optimal Topic Classification for Social Media Data Combined with Text Semantics: A Case Study of Public Opinion Analysis Related to COVID-19 with Microblogs
Liang, Qin
Hu, Chunchun
Chen, Si
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (12)
[34] Fine-tuned Sentiment Analysis of COVID-19 Vaccine-Related Social Media Data: Comparative Study
Melton, Chad A.
White, Brianna M.
Davis, Robert L.
Bednarczyk, Robert A.
Shaban-Nejad, Arash
JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (10)
[35] Glucose Variability Analysis in Two Large-Scale and Real-World Data Sets of Open-Source Automated Insulin Delivery Systems
Cooper, Drew
Reinhold, Bernd
Shahid, Arsalan
Lewis, Dana M.
JOURNAL OF DIABETES SCIENCE AND TECHNOLOGY, 2023,
[36] Comparison and Applicability Study of Analysis Methods for Social Media Text Data: Taking Perception of Urban Parks in Beijing as an Example
Shang, Zhenyu
Cheng, Kexin
Jian, Yuqing
Wang, Zhifang
LANDSCAPE ARCHITECTURE FRONTIERS, 2023, 11 (05) : 8 - 29
[37] An integrated framework for flood disaster information extraction and analysis leveraging social media data: A case study of the Shouguang flood in China
Hou, Huawei
Shen, Li
Jia, Jianan
Xu, Zhu
SCIENCE OF THE TOTAL ENVIRONMENT, 2024, 949
[38] Differing Content and Language Based on Poster-Patient Relationships on the Chinese Social Media Platform Weibo: Text Classification, Sentiment Analysis, and Topic Modeling of Posts on Breast Cancer
Zhang, Zhouqing
Liew, Kongmeng
Kuijer, Roeline
She, Wan Jou
Yada, Shuntaro
Wakamiya, Shoko
Aramaki, Eiji
JMIR CANCER, 2024, 10
[39] Application note: Validation of BovHEAT-An open-source analysis tool to process data from automated activity monitoring systems in dairy cattle for estrus detection
Plenio, J. -L.
Bartel, A.
Madureira, A. M. L.
Cerri, R. L. A.
Heuwieser, W.
Borchardt, S.
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 188
[40] Long-Term Glucose Forecasting for Open-Source Automated Insulin Delivery Systems: A Machine Learning Study with Real-World Variability Analysis
Zafar, Ahtsham
Lewis, Dana M.
Shahid, Arsalan
HEALTHCARE, 2023, 11 (06)

← 1 2 3 4 5 →