A lightweight and multilingual framework for crisis information extraction from Twitter data

被引:0
|
作者
Roberto Interdonato
Jean-Loup Guillaume
Antoine Doucet
机构
[1] CIRAD,
[2] TETIS,undefined
[3] Univ. of Montpellier,undefined
[4] APT,undefined
[5] Cirad,undefined
[6] CNRS,undefined
[7] Irstea,undefined
[8] L3I,undefined
[9] Université de La Rochelle,undefined
来源
关键词
Crisis management; Situational awareness; Informativeness ranking;
D O I
暂无
中图分类号
学科分类号
摘要
Obtaining relevant timely information during crisis events is a challenging task that can be fundamental to handle the consequences deriving from both unexpected events (e.g., terrorist attacks) and partially predictable ones (i.e., natural disasters). Even though microblogging-based online social networks (e.g., Twitter) have become an attractive data source in these emergency situations, overcoming the information overload deriving from mass events is not trivial. The aim of this work was to enable unsupervised extraction of relevant information from Twitter data during a crisis event, offering a lightweight alternative to learning-based approaches. The proposed lightweight crisis management framework integrates natural language processing and clustering techniques in order to produce a ranking of tweets relevant to a crisis situation based on their informativeness. Experiments carried out on six Twitter collections in two languages (English and French) proved the significance and the flexibility of our approach.
引用
收藏
相关论文
共 50 条
  • [21] Raimond: Quantitative Data Extraction from Twitter to Describe Events
    Sellam, Thibault
    Alonso, Omar
    ENGINEERING THE WEB IN THE BIG DATA ERA, 2015, 9114 : 251 - 268
  • [22] Optimal Path Finding based on Traffic Information Extraction from Twitter
    Hasby, Muhammad
    Khodra, Masayu Leylia
    2013 INTERNATIONAL CONFERENCE ON ICT FOR SMART SOCIETY (ICISS): THINK ECOSYSTEM ACT CONVERGENCE, 2013, : 120 - 124
  • [23] Multilingual Open Information Extraction: Challenges and Opportunities
    Claro, Daniela Barreiro
    Souza, Marlo
    Xavier, Clarissa Castella
    Oliveira, Leandro
    INFORMATION, 2019, 10 (07)
  • [24] Multilingual open information extraction: Challenges and opportunities
    Claro D.B.
    Souza M.
    Xavier C.C.
    Oliveira L.
    Information (Switzerland), 2019, 10 (07):
  • [25] A Multilingual Information Extraction Pipeline for Investigative Journalism
    Wiedemann, Gregor
    Yimam, Seid Muhie
    Biemann, Chris
    CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018): PROCEEDINGS OF SYSTEM DEMONSTRATIONS, 2018, : 78 - 83
  • [26] A framework for multilingual electronic data interchange
    Maani, R
    Parsa, S
    E-COMMERCE AND WEB TECHNOLOGIES, 2004, 3182 : 196 - 205
  • [27] LinguaKit: a Big Data-based multilingual tool for linguistic analysis and information extraction
    Gamallo, Pablo
    Garcia, Marcos
    Pineiro, Cesar
    Martinez-Castano, Rodrigo
    Pichel, Juan C.
    2018 FIFTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS), 2018, : 239 - 244
  • [28] A framework with efficient extraction and analysis of Twitter data for evaluating public opinions on transportation services
    Qi, Bing
    Costin, Aaron
    Jia, Mengda
    TRAVEL BEHAVIOUR AND SOCIETY, 2020, 21 : 10 - 23
  • [29] Python']Python Code and Illustrative Crisis Management Data from Twitter
    Wang, Yen-Yao
    Wang, Tawei
    JOURNAL OF INFORMATION SYSTEMS, 2022, 36 (03) : 211 - 217
  • [30] Framework for Geometric Information Extraction and Digital Modeling from LiDAR Data of Road Scenarios
    Wang, Yuchen
    Wang, Weicheng
    Liu, Jinzhou
    Chen, Tianheng
    Wang, Shuyi
    Yu, Bin
    Qin, Xiaochun
    REMOTE SENSING, 2023, 15 (03)